Gene EcSMS35_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1705 
SymbolnarU 
ID6143194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1709920 
End bp1711308 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content49% 
IMG OID641616581 
Productnitrite extrusion protein 2 
Protein accessionYP_001743759 
Protein GI170682066 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0635768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGC AAAATGAGAA AAATAGTCGT TATCTTTTAC GCGACTGGAA ACCAGAAAAT 
CCGGCCTTCT GGGAAAATAA AGGAAAACAT ATTGCACGAA GAAACCTCTG GATATCAGTC
AGTTGTCTAC TTCTTGCCTT CTGTGTCTGG ATGCTATTTA GCGCAGTTAC TGTTAATCTC
AATAAAATCG GTTTTAATTT CACTACCGAT CAACTCTTTT TATTAACCGC ATTACCCTCC
GTTTCTGGCG CATTATTGCG TGTCCCCTAC TCCTTTATGG TGCCTATATT CGGTGGACGC
CGATGGACGG TTTTTAGTAC TGCAATCCTG ATTATTCCTT GCGTCTGGCT CGGAATTGCC
GTGCAGAATC CGAATACTCC TTTTGGGATA TTTATCGTTA TCGCTTTGCT ATGCGGTTTT
GCAGGTGCGA ACTTTGCTTC GAGCATGGGC AATATCAGTT TCTTCTTCCC AAAAGCCAAA
CAAGGGAGCG CACTTGGGAT TAATGGCGGA TTAGGAAACT TAGGTGTAAG TGTGATGCAG
CTGGTTGCAC CGCTGGTCAT TTTTGTACCC GTATTTGCCT TTCTCGGCGT CAATGGCGTA
CCGCAGGCCG ACGGTTCGGT AATGTCGCTG GCGAATGCCG CATGGATTTG GGTGCCATTA
CTGGCGATTG CCACGATCGC CGCCTGGTCA GGGATGAATG ATATCGCCAG TTCACGCGCG
TCAATTGCCG ACCAGCTGCC AGTGTTACAA CGCCTGCATC TCTGGCTGCT GAGCCTGCTT
TACCTTGCCA CCTTCGGTTC GTTTATCGGT TTTTCTGCGG GTTTTGCCAT GCTGGCGAAA
ACTCAGTTCC CGGATGTGAA TATTCTGCGC CTGGCGTTCT TTGGCCCATT TATCGGTGCC
ATCGCGCGAT CGCTTGGTGG TGCTATTTCC GATAAATTCG GCGGCGTGCG GGTGACGTTG
ATCAACTTCA TTTTTATGGC GATTTTCAGC GCCCTGCTGT TCCTTACCTT ACCGGGCACA
GGCTCCGGTA ATTTCATCGC ATTTTACGCC GTATTTATGG GGCTGTTTCT GACCGCGGGT
CTGGGAAGTG GTTCTACTTT CCAGATGATC GCCGTCATCT TTCGCCAGAT AACCATTTAT
CGGGTGAAGA TGAAAGGCGG TAGTGATGAG CAAGCTCAAA GAGAAGCCGT CACCGAAACG
GCAGCCGCTC TGGGCTTTAT CTCAGCCATT GGCGCAGTGG GCGGCTTTTT TATTCCGCAG
GCGTTTGGCA TGTCGCTCAA TATGACCGGC TCTCCGGTGG GCGCGATGAA AGTGTTTTTA
ATCTTCTACA TCGTTTGTGT GCTGCTGACC TGGCTGGTTT ATGGTCGGCG GAAGTTCAGC
CAAAAATAA
 
Protein sequence
MALQNEKNSR YLLRDWKPEN PAFWENKGKH IARRNLWISV SCLLLAFCVW MLFSAVTVNL 
NKIGFNFTTD QLFLLTALPS VSGALLRVPY SFMVPIFGGR RWTVFSTAIL IIPCVWLGIA
VQNPNTPFGI FIVIALLCGF AGANFASSMG NISFFFPKAK QGSALGINGG LGNLGVSVMQ
LVAPLVIFVP VFAFLGVNGV PQADGSVMSL ANAAWIWVPL LAIATIAAWS GMNDIASSRA
SIADQLPVLQ RLHLWLLSLL YLATFGSFIG FSAGFAMLAK TQFPDVNILR LAFFGPFIGA
IARSLGGAIS DKFGGVRVTL INFIFMAIFS ALLFLTLPGT GSGNFIAFYA VFMGLFLTAG
LGSGSTFQMI AVIFRQITIY RVKMKGGSDE QAQREAVTET AAALGFISAI GAVGGFFIPQ
AFGMSLNMTG SPVGAMKVFL IFYIVCVLLT WLVYGRRKFS QK