Gene EcDH1_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3578 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3852722 
End bp3853888 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content51% 
IMG OID 
ProductNa+/H+ antiporter NhaA 
Protein accessionACX41191 
Protein GI260450769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00256476 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACATC TGCATCGATT CTTTAGCAGT GATGCCTCGG GAGGCATTAT TCTTATCATT 
GCCGCTATCC TGGCGATGAT TATGGCCAAC AGCGGCGCAA CCAGTGGATG GTATCACGAC
TTTCTGGAGA CGCCGGTTCA GCTCCGGGTT GGTTCACTCG AAATCAACAA AAACATGCTG
TTATGGATAA ATGACGCGCT GATGGCGGTA TTTTTCCTGT TAGTCGGTCT GGAAGTTAAA
CGTGAACTGA TGCAAGGATC GCTAGCCAGC TTACGCCAGG CCGCATTTCC AGTTATCGCC
GCTATTGGTG GGATGATTGT GCCGGCATTA CTCTATCTGG CTTTTAACTA TGCCGATCCG
ATTACCCGCG AAGGGTGGGC GATCCCGGCG GCTACTGACA TTGCTTTTGC ACTTGGTGTA
CTGGCGCTGT TGGGAAGTCG TGTTCCGTTA GCGCTGAAGA TCTTTTTGAT GGCTCTGGCT
ATTATCGACG ATCTTGGGGC CATCATTATC ATCGCATTGT TCTACACTAA TGACTTATCG
ATGGCCTCTC TTGGCGTCGC GGCTGTAGCA ATTGCGGTAC TCGCGGTATT GAATCTGTGT
GGTGCACGCC GCACGGGCGT CTATATTCTT GTTGGCGTGG TGTTGTGGAC TGCGGTGTTG
AAATCGGGGG TTCACGCAAC TCTGGCGGGG GTAATTGTCG GCTTCTTTAT TCCTTTGAAA
GAGAAGCATG GGCGTTCTCC AGCGAAGCGA CTGGAGCATG TGTTGCACCC GTGGGTGGCG
TATCTGATTT TGCCGCTGTT TGCATTTGCT AATGCTGGCG TTTCACTGCA AGGCGTCACG
CTGGATGGCT TGACCTCCAT TCTGCCATTG GGGATCATCG CTGGCTTGCT GATTGGCAAA
CCGCTGGGGA TTAGTCTGTT CTGCTGGTTG GCGCTGCGTT TGAAACTGGC GCATCTGCCT
GAGGGAACGA CTTATCAGCA AATTATGGTG GTGGGGATCC TGTGCGGTAT CGGTTTTACT
ATGTCTATCT TTATTGCCAG CCTGGCCTTT GGTAGCGTAG ATCCAGAACT GATTAACTGG
GCGAAACTCG GTATCCTGGT CGGTTCTATC TCTTCGGCGG TAATTGGATA CAGCTGGTTA
CGCGTTCGTT TGCGTCCATC AGTTTGA
 
Protein sequence
MKHLHRFFSS DASGGIILII AAILAMIMAN SGATSGWYHD FLETPVQLRV GSLEINKNML 
LWINDALMAV FFLLVGLEVK RELMQGSLAS LRQAAFPVIA AIGGMIVPAL LYLAFNYADP
ITREGWAIPA ATDIAFALGV LALLGSRVPL ALKIFLMALA IIDDLGAIII IALFYTNDLS
MASLGVAAVA IAVLAVLNLC GARRTGVYIL VGVVLWTAVL KSGVHATLAG VIVGFFIPLK
EKHGRSPAKR LEHVLHPWVA YLILPLFAFA NAGVSLQGVT LDGLTSILPL GIIAGLLIGK
PLGISLFCWL ALRLKLAHLP EGTTYQQIMV VGILCGIGFT MSIFIASLAF GSVDPELINW
AKLGILVGSI SSAVIGYSWL RVRLRPSV