Gene ECH74115_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1707 
SymbolnarK 
ID6972216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1644799 
End bp1646190 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content53% 
IMG OID643385664 
Productnitrite extrusion protein 1 
Protein accessionYP_002270158 
Protein GI209400219 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00886] nitrite extrusion protein (nitrite facilitator) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0135678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACT CATCCGCCCC CGAAAGGGCT ACTGGAGCTG TCATTACAGA TTGGCGACCG 
GAAGATCCTG CGTTCTGGCA ACAACGCGGT CAACGTATTG CCAGCCGCAA CCTGTGGATT
TCCGTTCCCT GTCTGCTGCT GGCGTTTTGC GTATGGATGT TGTTCAGCGC TGTTGCGGTG
AACTTACCGA AAGTCGGTTT TAATTTTACG ACCGATCAGC TATTTATGTT GACTGCGTTG
CCTTCGGTTT CTGGCGCGTT ATTACGTGTT CCATACTCCT TTATGGTTCC TATCTTCGGT
GGTCGTCGCT GGACGGCGTT CAGCACCGGT ATTCTGATTA TTCCTTGTGT CTGGCTGGGT
TTTGCCGTGC AGGATACCTC CACGCCTTAT AGCGTCTTCA TCATCATCTC TCTGCTGTGC
GGCTTTGCTG GCGCGAACTT CGCATCCAGT ATGGCAAACA TCAGCTTCTT CTTTCCGAAA
CAGAAGCAGG GTGGCGCGCT GGGCCTGAAT GGTGGTCTGG GCAACATGGG CGTTAGCGTT
ATGCAGTTGG TTGCTCCGCT GGTGGTATCA CTGTCGATTT TCGCAGTATT TGGTAGCCAG
GGCGTCAAAC AGCCGGATGG GACTGAGCTG TATCTGGCGA ATGCGTCCTG GATATGGGTG
CCGTTCCTTG CCATCTTCAC CATTGCGGCG TGGTTTGGCA TGAACGATCT TGCTACCTCG
AAAGCCTCCA TCAAGGAGCA GTTGCCGGTA CTCAAACGGG GTCATCTGTG GATTATGAGC
CTGCTGTATC TGGCAACCTT CGGTTCCTTC ATCGGCTTCT CCGCGGGCTT TGCGATGCTG
TCAAAAACGC AGTTCCCGGA TGTTCAGATT CTGCAATACG CTTTCTTCGG GCCGTTTATT
GGTGCGCTGG CGCGTTCTGC AGGTGGTGCA TTATCTGACC GTCTGGGCGG AACTCGTGTC
ACGCTGGTGA ACTTTATTCT GATGGCGATT TTCAGCGGCC TGCTGTTCCT GACCTTACCG
ACTGACGGGC AGGGCGGAAG CTTCATGGCG TTCTTCGCGG TCTTCCTGGC GCTGTTCCTG
ACAGCTGGGC TGGGTAGTGG TTCCACTTTC CAGATGATTT CCGTGATCTT CCGTAAACTG
ACAATGGATC GCGTGAAAGC AGAAGGGGGT TCTGACGAAC GTGCGATGCG TGAAGCGGCA
ACCGACACGG CGGCGGCGTT GGGTTTCATC TCTGCGATTG GCGCGATTGG TGGCTTCTTT
ATCCCGAAAG CGTTCGGTAG CTCGCTGGCA TTAACGGGTT CGCCAGTCGG CGCAATGAAA
GTATTTTTGA TTTTCTATAT CGCCTGCGTA GTGATTACCT GGGCGGTATA TGGTCGGCAT
TCTAAAAAAT AA
 
Protein sequence
MSHSSAPERA TGAVITDWRP EDPAFWQQRG QRIASRNLWI SVPCLLLAFC VWMLFSAVAV 
NLPKVGFNFT TDQLFMLTAL PSVSGALLRV PYSFMVPIFG GRRWTAFSTG ILIIPCVWLG
FAVQDTSTPY SVFIIISLLC GFAGANFASS MANISFFFPK QKQGGALGLN GGLGNMGVSV
MQLVAPLVVS LSIFAVFGSQ GVKQPDGTEL YLANASWIWV PFLAIFTIAA WFGMNDLATS
KASIKEQLPV LKRGHLWIMS LLYLATFGSF IGFSAGFAML SKTQFPDVQI LQYAFFGPFI
GALARSAGGA LSDRLGGTRV TLVNFILMAI FSGLLFLTLP TDGQGGSFMA FFAVFLALFL
TAGLGSGSTF QMISVIFRKL TMDRVKAEGG SDERAMREAA TDTAAALGFI SAIGAIGGFF
IPKAFGSSLA LTGSPVGAMK VFLIFYIACV VITWAVYGRH SKK