Gene ECH74115_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2079 
SymbolnarU 
ID6969977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1976508 
End bp1977896 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content49% 
IMG OID643385984 
Productnitrite extrusion protein 2 
Protein accessionYP_002270473 
Protein GI209399656 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGC AAAATGAGAA AAATAGTCGT TATCTTTTAC GCGACTGGAA ACCAGAAAAT 
CCGGCCTTCT GGAAAAATAA AGGAAAACAT ATTGCTCGAA GAAACCTCTG GATATCAGTC
AGTTGTCTAC TTCTTGCCTT CTGTGTCTGG ATGCTATTTA GCGCAGTTAC CGTTAATCTC
AATAAAATCG GTTTTAATTT CACTACCGAT CAACTCTTTT TATTAACCGC ATTACCCTCC
GTTTCTGGCG CATTATTACG TGTCCCCTAC TCCTTTATGG TGCCTATATT CGGTGGACGT
CGATGGACGG TTTTTAGTAC TGCAATCCTG ATTATTCCTT GCGTCTGGCT CGGAATTGCC
GTGCAGAATC CGAATACTCC TTTTGGGATA TTTATCGTTA TCGCTTTGCT ATGCGGTTTT
GCAGGTGCGA ACTTTGCTTC GAGCATGGGC AATATCAGTT TCTTCTTTCC AAAAGCCAGG
CAAGGTAGCG CTCTTGGGAT TAATGGCGGA TTAGGAAACT TAGGTGTAAG TGTAATGCAG
CTGGTTGCAC CGCTGGTCAT TTTTGTACCC GTATTTGCCT TTCTCGGCGT CAATGGAGTA
CCGCAGGCCG ACGGTTCGGT GATGTCGCTG GCGAATGCCG CATGGATTTG GGTACCGCTA
CTAGCGATTG CCACGATCGC CGCCTGGTCA GGGATGAATG ATATCGCCAG TTCACGCGCG
TCAATTGCCG ACCAGCTCCC TGTCTTACAA CGCCTGCATC TCTGGCTGCT GAGCCTGCTT
TACCTTGCCA CCTTCGGTTC GTTTATCGGT TTTTCTGCGG GTTTCGCCAT GCTGGCAAAA
ACCCAGTTCC CGGATGTGAA TATTCTGCGC CTGGCGTTCT TTGGCCCATT TATCGGTGCC
ATCGCGCGGT CGGTGGGTGG TGCTATTTCC GATAAATTCG GCGGCGTGCG GGTGACGTTG
ATCAACTTTA TTTTTATGGC GATTTTCAGC GCCCTGCTGT TCCTTACCTT ACCGGGCACA
GGCTCCGGTA ACTTCATCGC CTTTTACGCC GTATTTATGG GGCTGTTTCT GACCGCGGGT
CTGGGAAGTG GTTCTACTTT CCAGATGATC GCCGTCATCT TTCGCCAGAT AACCATTTAT
CGGATGAAGA TGAAAGGCGG TAGTGATGAG CAAGCTCAGA GAGAAGCCGT CACCGAAACG
GCGGCGGCTC TGGGCTTTAT CTCAGCCATC GGCGCAGTGG GCGGCTTTTT TATTCCGCAG
GCATTTGGCA TGTCACTCAA TATGACCGGC TCTCCAGTGG GCGCGATGAA AGTGTTTTTA
ATCTTCTACA TCGTTTGTGT GCTACTGACC TGGCTGGTTT ATGGTCGGCG GAAGTTCAGC
CAAAAATAA
 
Protein sequence
MALQNEKNSR YLLRDWKPEN PAFWKNKGKH IARRNLWISV SCLLLAFCVW MLFSAVTVNL 
NKIGFNFTTD QLFLLTALPS VSGALLRVPY SFMVPIFGGR RWTVFSTAIL IIPCVWLGIA
VQNPNTPFGI FIVIALLCGF AGANFASSMG NISFFFPKAR QGSALGINGG LGNLGVSVMQ
LVAPLVIFVP VFAFLGVNGV PQADGSVMSL ANAAWIWVPL LAIATIAAWS GMNDIASSRA
SIADQLPVLQ RLHLWLLSLL YLATFGSFIG FSAGFAMLAK TQFPDVNILR LAFFGPFIGA
IARSVGGAIS DKFGGVRVTL INFIFMAIFS ALLFLTLPGT GSGNFIAFYA VFMGLFLTAG
LGSGSTFQMI AVIFRQITIY RMKMKGGSDE QAQREAVTET AAALGFISAI GAVGGFFIPQ
AFGMSLNMTG SPVGAMKVFL IFYIVCVLLT WLVYGRRKFS QK