Gene EcSMS35_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0017 
SymbolnhaA 
ID6146923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp19950 
End bp21116 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content51% 
IMG OID641614918 
ProductpH-dependent sodium/proton antiporter 
Protein accessionYP_001742134 
Protein GI170684120 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3004] Na+/H+ antiporter 
TIGRFAM ID[TIGR00773] Na+/H+ antiporter NhaA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.453914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACATC TGCATCGATT CTTTAGCAGT GATGCCTCGG GAGGCATTAT TCTTATCATT 
GCCGCTATCC TGGCGATGAT TATGGCCAAT AGCGGCGCAA CCAGTGGATG GTATCACGAC
TTTCTGGAGA CGCCGGTTCA GCTCCGGGTT GGTTCACTCG AAATCAACAA AAACATGCTG
TTATGGATAA ATGACGCGCT GATGGCGGTA TTTTTCCTGT TAGTCGGTCT GGAAGTTAAA
CGTGAACTGA TGCAAGGATC GCTAGCCAGC TTACGCCAGG CCGCATTTCC AGTTATCGCC
GCTATTGGTG GGATGATTGT GCCGGCATTG CTCTATCTGG CCTTTAACTA TGCCGATCCG
ATTACCCGCG AAGGGTGGGC GATCCCGGCG GCTACTGACA TTGCCTTTGC ACTTGGTGTG
CTGGCGCTGT TGGGAAGTCG TGTTCCGTTA GCGCTGAAGA TCTTTTTGAT GGCTCTGGCT
ATTATCGACG ATCTTGGGGC CATCATTATC ATCGCATTGT TCTACACTAA TGACTTATCG
ATGGCGTCTC TCGGCGTTGC GGCAGTAGCG ATTGCCGTAC TCGCGGTACT GAATATCTGT
GGTGTTCGTC GCACGGGCGT TTATATTCTG GTTGGCGTGG TACTGTGGAC AGCGGTGTTG
AAATCAGGTG TCCATGCAAC GCTGGCTGGC GTCATTGTCG GCTTCTTTAT TCCGTTAAAA
GAGAAGCATG GGCGCTCTCC GGCTAAACGT CTGGAGCATG TTTTGCATCC GTGGGTGGCA
TATCTGATTT TGCCGTTGTT TGCGTTTGCC AATGCGGGGG TATCTCTGCA AGGCGTGACG
CTGGATGGCC TGACCTCCAT TCTGCCATTA GGGATCATCG CCGGTTTGCT GATTGGCAAG
CCACTGGGTA TTAGTCTGTT CTGCTGGTTG GCGCTGCGTT TGAAACTGGC GCATCTGCCT
GAGGGAACGA CTTATCAGCA AATTATGGCG GTGGGGATCC TATGCGGTAT CGGTTTTACT
ATGTCTATCT TTATTGCCAG CCTTGCCTTT GGTAGCGTAG ATCCAGAACT GATTAACTGG
GCGAAACTCG GTATCCTGGT CGGTTCTATC TCTTCGGCGG TAATTGGATA CAGCTGGTTA
CGCGTTCGTT TGCGTCCATC AGTTTGA
 
Protein sequence
MKHLHRFFSS DASGGIILII AAILAMIMAN SGATSGWYHD FLETPVQLRV GSLEINKNML 
LWINDALMAV FFLLVGLEVK RELMQGSLAS LRQAAFPVIA AIGGMIVPAL LYLAFNYADP
ITREGWAIPA ATDIAFALGV LALLGSRVPL ALKIFLMALA IIDDLGAIII IALFYTNDLS
MASLGVAAVA IAVLAVLNIC GVRRTGVYIL VGVVLWTAVL KSGVHATLAG VIVGFFIPLK
EKHGRSPAKR LEHVLHPWVA YLILPLFAFA NAGVSLQGVT LDGLTSILPL GIIAGLLIGK
PLGISLFCWL ALRLKLAHLP EGTTYQQIMA VGILCGIGFT MSIFIASLAF GSVDPELINW
AKLGILVGSI SSAVIGYSWL RVRLRPSV