Gene EcSMS35_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1963 
SymbolnhaB 
ID6145508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1986997 
End bp1988538 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID641616839 
Productsodium/proton antiporter 
Protein accessionYP_001744015 
Protein GI170680470 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3067] Na+/H+ antiporter 
TIGRFAM ID[TIGR00774] Na+/H+ antiporter NhaB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000140006 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATCT CCTGGGGCCG TGCCCTATGG CGCAACTTTT TGGGCCAGTC CCCCGACTGG 
TACAAACTCG CCCTCATTAT TTTCTTAATT CTTAATCCGT TAATTTTCAT CATCAGCCCC
TTTGTTGCCG GTTGGCTGCT GGTGGCGGAA TTTATTTTCA CTCTGGCGAT GGCCCTCAAA
TGCTACCCGC TACTCCCCGG TGGCCTGTTG GCTATCGAAG CGGTATTCAT CGGCATGACC
AGCGCGGAAC ATGTTCGTGA AGAGGTGGCG GCGAATCTTG AAGTCTTGCT GTTACTGATG
TTTATGGTGG CGGGTATCTA TTTTATGAAA CAGCTGTTGC TGTTCATATT TACCCGTTTG
CTGCTAAGCA TTCGCTCCAA AATGCTGCTG TCGCTCTCTT TTTGCATGGC GGCGGCGTTC
CTCTCCGCGT TCCTCGATGC CTTAACCGTC GTGGCGGTGG TGATCAGCGT CGCAGTCGGT
TTTTATGGTA TTTATCATCG TGTTGCCTCT TCCCGTACTG AAGATACCGA CCTGCAAGAC
GATAGTCATA TCGACAAGCA TTACAAAGTG GTTCTGGAAC AGTTCCGTGG CTTTTTGCGC
AGCCTGATGA TGCATGCCGG TGTTGGCACC GCATTAGGCG GCGTGATGAC CATGGTGGGC
GAACCACAGA ACTTGATCAT CGCTAAAGCA GCTGGCTGGC ATTTTGGCGA TTTCTTCCTG
CGCATGTCGC CAGTGACCGT TCCGGTTCTG ATTTGTGGCC TGTTGACCTG CCTGCTGGTG
GAGAAGCTGC GTTGTTTTGG CTACGGCGAA ACGCTGCCGG AGAAAGTCCG CGAAGTGCTG
CAACAGTTTG ACGATCAAAG TCGCCACCAG CGTACCCGTC AGGATAAAAT TCGTCTGATT
GTCCAGGCGA TTATCGGCGT CTGGCTGGTC ACCGCGCTGG CGTTGCATCT GGCGGAGGTT
GGCCTGATTG GTTTGTCGGT CATTATTCTG GCGACATCAT TGACTGGTGT CACTGATGAA
CATGCTATCG GTAAAGCCTT CACCGAATCT CTGCCATTCA CCGCACTGTT GACTGTCTTT
TTCTCGGTCG TCGCGGTAAT TATTGATCAG CAACTGTTCT CACCAATCAT TCATTTTGTC
CTACAGGCAT CGGAACACGC CCAGCTGTCG TTGTTCTATA TTTTTAACGG CCTGCTGTCA
TCCATTTCGG ATAATGTCTT CGTGGGGACG ATTTATATCA ACGAAGCGAA AGCGGCAATG
GAAAGTGGCG CTATCACGTT GAAGCAATAC GAGCTGCTGG CGGTCGCCAT TAATACCGGT
ACCAATCTGC CCTCTGTCGC TACACCGAAC GGTCAGGCTG CGTTCCTGTT CCTGCTGACC
TCTGCACTCG CGCCATTGAT TCGCCTCTCT TATGGCCGCA TGGTGTGGAT GGCCCTGCCT
TACACCCTCG TACTGACACT CGTCGGACTG CTCTGCGTCG AGTTTACGCT TGCCCCTGTA
ACCGAATGGT TTATGCAAAT GGGCTGGATA GCAACGCTTT GA
 
Protein sequence
MEISWGRALW RNFLGQSPDW YKLALIIFLI LNPLIFIISP FVAGWLLVAE FIFTLAMALK 
CYPLLPGGLL AIEAVFIGMT SAEHVREEVA ANLEVLLLLM FMVAGIYFMK QLLLFIFTRL
LLSIRSKMLL SLSFCMAAAF LSAFLDALTV VAVVISVAVG FYGIYHRVAS SRTEDTDLQD
DSHIDKHYKV VLEQFRGFLR SLMMHAGVGT ALGGVMTMVG EPQNLIIAKA AGWHFGDFFL
RMSPVTVPVL ICGLLTCLLV EKLRCFGYGE TLPEKVREVL QQFDDQSRHQ RTRQDKIRLI
VQAIIGVWLV TALALHLAEV GLIGLSVIIL ATSLTGVTDE HAIGKAFTES LPFTALLTVF
FSVVAVIIDQ QLFSPIIHFV LQASEHAQLS LFYIFNGLLS SISDNVFVGT IYINEAKAAM
ESGAITLKQY ELLAVAINTG TNLPSVATPN GQAAFLFLLT SALAPLIRLS YGRMVWMALP
YTLVLTLVGL LCVEFTLAPV TEWFMQMGWI ATL