Gene EcolC_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3638 
SymbolnhaA 
ID6066352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3983646 
End bp3984812 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content52% 
IMG OID641603053 
ProductpH-dependent sodium/proton antiporter 
Protein accessionYP_001726576 
Protein GI170021622 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3004] Na+/H+ antiporter 
TIGRFAM ID[TIGR00773] Na+/H+ antiporter NhaA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACATC TGCATCGATT CTTTAGCAGT GATGCCTCGG GAGGCATTAT TCTTATCATT 
GCCGCTATCC TGGCGATGAT GATGGCCAAC AGCGGCGCAA CCAGTGGATG GTATCACGAC
TTTCTGGAGA CGCCGGTTCA GCTCCGGGTT GGTTCACTCG AAATCAACAA AAACATGCTG
TTATGGATAA ATGACGCGCT GATGGCGGTA TTTTTCCTGT TAGTCGGTCT GGAAGTTAAA
CGTGAACTGA TGCAAGGATC GCTAGCCAGC TTACGCCAGG CCGCATTTCC AGTTATCGCC
GCTATTGGTG GGATGATTGT GCCGGCATTA CTCTATCTGG CCTTTAACTA TGCCGATCCG
ATTACCCGCG AAGGGTGGGC GATCCCGGCG GCTACTGACA TTGCCTTTGC ACTTGGTGTG
CTGGCGCTGT TGGGAAGTCG TGTTCCGTTA GCACTGAAGA TCTTTTTGAT GGCTCTGGCT
ATTATCGACG ATCTTGGGGC CATCATTATC ATCGCATTGT TCTACACTAA TGACTTATCG
ATGGCCTCTC TTGGCGTCGC GGCTGTAGCA ATTGCGGTAC TCGCGGTATT GAATCTGTGT
GGTGTACGCC GCACGGGCGT CTATATTCTG GTTGGCGTAG TGTTGTGGAC TGCGGTGTTG
AAATCGGGGG TTCACGCAAC CCTGGCGGGG GTAATTGTCG GCTTCTTTAT TCCTTTGAAA
GAGAAGCATG GGCGTTCTCC AGCGAAGCGA CTGGAGCATG TGTTGCACCC GTGGGTGGCG
TATCTGATTT TACCGCTGTT TGCATTTGCT AATGCTGGCG TTTCACTGCA AGGTGTCACG
CTGGATGGCT TGACCTCCAT TCTGCCATTG GGGATCATCG CTGGCTTGCT GATTGGCAAA
CCGCTGGGGA TTAGTCTGTT CTGCTGGTTG GCGCTGCGTT TGAAACTGGC GCATCTGCCT
GAGGGAACGA CTTATCAGCA AATTATGGCG GTGGGGATCC TGTGCGGTAT CGGTTTTACT
ATGTCTATCT TTATTGCCAG CCTGGCCTTT GGTAGCGTAG ATCCAGAACT GATTAACTGG
GCGAAACTCG GTATCCTGGT CGGTTCTATC TCTTCGGCGG TAATTGGATA CAGCTGGTTA
CGCGTTCGTT TGCGTCCATC AGTGTGA
 
Protein sequence
MKHLHRFFSS DASGGIILII AAILAMMMAN SGATSGWYHD FLETPVQLRV GSLEINKNML 
LWINDALMAV FFLLVGLEVK RELMQGSLAS LRQAAFPVIA AIGGMIVPAL LYLAFNYADP
ITREGWAIPA ATDIAFALGV LALLGSRVPL ALKIFLMALA IIDDLGAIII IALFYTNDLS
MASLGVAAVA IAVLAVLNLC GVRRTGVYIL VGVVLWTAVL KSGVHATLAG VIVGFFIPLK
EKHGRSPAKR LEHVLHPWVA YLILPLFAFA NAGVSLQGVT LDGLTSILPL GIIAGLLIGK
PLGISLFCWL ALRLKLAHLP EGTTYQQIMA VGILCGIGFT MSIFIASLAF GSVDPELINW
AKLGILVGSI SSAVIGYSWL RVRLRPSV