Gene ECH74115_1673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1673 
SymbolnhaB 
ID6968270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1611333 
End bp1612874 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID643385632 
Productsodium/proton antiporter 
Protein accessionYP_002270126 
Protein GI209400416 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3067] Na+/H+ antiporter 
TIGRFAM ID[TIGR00774] Na+/H+ antiporter NhaB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000094034 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.63338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATCT CCTGGGGCCG CGCGCTATGG CGCAATTTTT TGGGCCAGTC CCCCGACTGG 
TACAAACTCG CCCTCATCAT TTTCTTAATC GTAAATCCGT TAATTTTCCT CATCAGCCCT
TTCGTCGCTG GCTGGTTGCT GGTAGCGGAA TTTATTTTCA CTCTGGCGAT GGCCCTGAAA
TGCTACCCGC TGCTCCCCGG TGGCCTGTTG GCTATCGAAG CGGTATTCAT CGGCATGACC
AGCGCGGAAC ACGTCCGTGA AGAGGTGGCG GCAAATCTTG AAGTCTTGCT GTTACTGATG
TTTATGGTGG CGGGTATCTA TTTTATGAAA CAGCTATTGC TGTTCATATT TACCCGTTTG
CTGTTAAGCA TTCGCTCCAA AATGCTGCTG TCGCTCTCTT TTTGCGTAGC GGCAGCGTTC
CTCTCCGCGT TCCTCGATGC CTTAACCGTC GTGGCGGTGG TGATCAGCGT TGCCGTCGGT
TTTTATGGTA TTTATCATCG CGTAGCCTCT TCCCGTACCG AAGACACCGA CCTGCAAGAC
GATAGTCATA TCGACAAGCA TTACAAAGTG GTTCTGGAAC AGTTCCGTGG CTTTCTGCGT
AGCCTGATGA TGCATGCCGG TGTCGGCACC GCATTAGGCG GCGTAATGAC CATGGTGGGC
GAACCACAGA ACCTGATCAT CGCTAAAGCG GCTGGCTGGC ATTTTGGCGA TTTCTTCCTG
CGCATGTCGC CGGTGACCGT TCCGGTTCTG ATTTGTGGCC TGTTAACCTG CCTGCTGGTA
GAGAAGCTGC GTTGGTTTGG CTACGGTGAA ACGCTGCCGG AGAAAGTCCG CGAAGTGTTG
CAACAGTTTG ACGATCAAAG CCGCCACCAG CGTACCCGTC AGGATAAAAT CCGTCTGATT
GTCCAGGCGA TTATTGGCGT CTGGCTGGTG ACTGCGCTGG CGTTGCATCT GGCGGAAGTT
GGCTTGATTG GTTTGTCAGT CATTATTCTG GCAACATCAT TGACCGGTGT CACCGATGAA
CATGCTATCG GTAAAGCCTT CACCGAATCT CTGCCATTCA CCGCACTGTT GACGGTGTTT
TTCTCGGTAG TCGCGGTGAT TATCGACCAA CAACTGTTTT CGCCGATTAT TCAGTTTGTG
TTGCAGGCAT CGGAACATGC TCAGCTGTCG CTGTTCTATA TTTTCAACGG TCTGCTGTCA
TCCATTTCGG ATAACGTCTT CGTGGGGACG ATTTATATCA ACGAAGCGAA AGCGGCAATG
GAAAGTGGCG CTATCACGTT GAAGCAATAC GAGCTGCTGG CGGTCGCCAT TAATACCGGT
ACCAATCTGC CCTCTGTCGC TACGCCGAAC GGTCAGGCTG CGTTCCTGTT CCTGCTGACC
TCTGCACTCG CGCCATTGAT TCGCCTCTCT TATGGCCGCA TGGTGTGGAT GGCCCTGCCT
TACACCCTCG TCCTGACACT CGTCGGACTG CTCTGCGTCG AGTTTACGCT TGCCCCTGTA
ACCGAATGGT TTATGCAAAT GGGCTGGATA GCAACGCTTT GA
 
Protein sequence
MEISWGRALW RNFLGQSPDW YKLALIIFLI VNPLIFLISP FVAGWLLVAE FIFTLAMALK 
CYPLLPGGLL AIEAVFIGMT SAEHVREEVA ANLEVLLLLM FMVAGIYFMK QLLLFIFTRL
LLSIRSKMLL SLSFCVAAAF LSAFLDALTV VAVVISVAVG FYGIYHRVAS SRTEDTDLQD
DSHIDKHYKV VLEQFRGFLR SLMMHAGVGT ALGGVMTMVG EPQNLIIAKA AGWHFGDFFL
RMSPVTVPVL ICGLLTCLLV EKLRWFGYGE TLPEKVREVL QQFDDQSRHQ RTRQDKIRLI
VQAIIGVWLV TALALHLAEV GLIGLSVIIL ATSLTGVTDE HAIGKAFTES LPFTALLTVF
FSVVAVIIDQ QLFSPIIQFV LQASEHAQLS LFYIFNGLLS SISDNVFVGT IYINEAKAAM
ESGAITLKQY ELLAVAINTG TNLPSVATPN GQAAFLFLLT SALAPLIRLS YGRMVWMALP
YTLVLTLVGL LCVEFTLAPV TEWFMQMGWI ATL