Gene EcDH1_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2026 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2184215 
End bp2185588 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID 
Productsugar (Glycoside-Pentoside-Hexuronide) transporter 
Protein accessionACX39683 
Protein GI260449261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.232812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAAC AACTCTCCTG GCGCACCATC GTCGGCTACA GCCTCGGTGA CGTCGCCAAT 
AACTTCGCCT TCGCAATGGG GGCGCTCTTC CTGTTGAGTT ACTACACCGA CGTCGCTGGC
GTCGGTGCCG CTGCGGCGGG CACCATGCTG TTACTGGTGC GGGTATTCGA TGCCTTCGCC
GACGTCTTTG CCGGACGAGT GGTGGACAGT GTGAATACCC GCTGGGGAAA ATTCCGCCCG
TTTTTACTCT TCGGTACTGC GCCGTTAATG ATCTTCAGCG TGCTGGTATT CTGGGTGCTG
ACCGACTGGA GCCATGGTAG CAAAGTGGTG TATGCATATT TGACCTACAT GGGCCTCGGG
CTTTGCTACA GCCTGGTGAA TATTCCTTAT GGTTCACTTG CTACCGCGAT GACCCAACAA
CCACAATCCC GCGCCCGTCT GGGCGCGGCT CGTGGGATTG CCGCTTCATT GACCTTTGTC
TGCCTGGCAT TTCTGATAGG ACCGAGCATT AAGAACTCCA GCCCGGAAGA GATGGTGTCG
GTATACCATT TCTGGACAAT TGTGCTGGCG ATTGCCGGAA TGGTGCTTTA CTTCATCTGC
TTCAAATCGA CGCGTGAGAA TGTGGTACGT ATCGTTGCGC AGCCGTCATT GAATATCAGT
CTGCAAACCC TGAAACGGAA TCGCCCGCTG TTTATGTTGT GCATCGGTGC GCTGTGTGTG
CTGATTTCGA CCTTTGCGGT CAGCGCCTCG TCGTTGTTCT ACGTGCGCTA TGTGTTAAAT
GATACCGGGC TGTTCACTGT GCTGGTACTG GTGCAAAACC TGGTTGGTAC TGTGGCATCG
GCACCGCTGG TGCCGGGGAT GGTCGCGAGG ATCGGTAAAA AGAATACCTT CCTGATTGGC
GCTTTGCTGG GAACCTGCGG TTATCTGCTG TTCTTCTGGG TTTCCGTCTG GTCACTGCCG
GTGGCGTTGG TTGCGTTGGC CATCGCTTCA ATTGGTCAGG GCGTTACCAT GACCGTGATG
TGGGCGCTGG AAGCTGATAC CGTAGAATAC GGTGAATACC TGACCGGCGT GCGAATTGAA
GGGCTCACCT ATTCACTATT CTCATTTACC CGTAAATGCG GTCAGGCAAT CGGAGGTTCA
ATTCCTGCCT TTATTTTGGG GTTAAGCGGA TATATCGCCA ATCAGGTGCA AACGCCGGAA
GTTATTATGG GCATCCGCAC ATCAATTGCC TTAGTACCTT GCGGATTTAT GCTACTGGCA
TTCGTTATTA TCTGGTTTTA TCCGCTCACG GATAAAAAAT TCAAAGAAAT CGTGGTTGAA
ATTGATAATC GTAAAAAAGT GCAGCAGCAA TTAATCAGCG ATATCACTAA TTAA
 
Protein sequence
MNQQLSWRTI VGYSLGDVAN NFAFAMGALF LLSYYTDVAG VGAAAAGTML LLVRVFDAFA 
DVFAGRVVDS VNTRWGKFRP FLLFGTAPLM IFSVLVFWVL TDWSHGSKVV YAYLTYMGLG
LCYSLVNIPY GSLATAMTQQ PQSRARLGAA RGIAASLTFV CLAFLIGPSI KNSSPEEMVS
VYHFWTIVLA IAGMVLYFIC FKSTRENVVR IVAQPSLNIS LQTLKRNRPL FMLCIGALCV
LISTFAVSAS SLFYVRYVLN DTGLFTVLVL VQNLVGTVAS APLVPGMVAR IGKKNTFLIG
ALLGTCGYLL FFWVSVWSLP VALVALAIAS IGQGVTMTVM WALEADTVEY GEYLTGVRIE
GLTYSLFSFT RKCGQAIGGS IPAFILGLSG YIANQVQTPE VIMGIRTSIA LVPCGFMLLA
FVIIWFYPLT DKKFKEIVVE IDNRKKVQQQ LISDITN