Gene EcDH1_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3335 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3585377 
End bp3586759 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content62% 
IMG OID 
Productsugar (Glycoside-Pentoside-Hexuronide) transporter 
Protein accessionACX40957 
Protein GI260450535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.736057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAT TAACCATGAA AGACAAAATT GGCTACGGGC TGGGAGACAC CGCCTGCGGC 
TTCGTCTGGC AGGCCACGAT GTTCCTGCTG GCCTATTTCT ACACCGACGT CTTCGGCCTG
TCGGCGGGGA TTATGGGCAC GCTGTTTTTG GTCTCCCGCG TGCTCGACGC CGTCACCGAC
CCGCTGATGG GGCTGCTGGT AGACCGCACC CGCACGCGGC ACGGCCAGTT CCGCCCGTTC
CTGCTGTGGG GGGCCATCCC GTTCGGCATC GTCTGCGTGC TGACCTTCTA CACGCCGGAC
TTCTCCGCAC AGGGCAAGAT CATCTACGCC TGCGTGACCT ACATTCTCCT GACCCTGGTC
TACACCTTCG TTAACGTGCC GTACTGCGCC ATGCCGGGCG TCATCACCGC CGACCCGAAA
GAGCGTCACG CCCTGCAGTC CTGGCGCTTC TTCCTGGCGG CGGCGGGCTC GCTCGCTATC
AGCGGCATCG CGCTGCCGCT GGTGAGCATC ATCGGCAAAG GGGACGAGCA GGTGGGCTAC
TTCGGCGCCA TGTGCGTGCT GGGGCTGAGC GGCGTGGTGC TGCTCTACGT CTGCTTCTTC
ACGACCAAAG AGCGCTACAC CTTTGAGGTG CAGCCGGGCT CGTCGGTGGC GAAAGACCTT
AAGCTGCTGC TGGGCAACAG CCAGTGGCGC ATCATGTGCG CGTTCAAGAT GATGGCGACC
TGCTCCAACG TGGTGCGCGG CGGGGCGACG CTCTACTTCG TGAAATACGT GATGGATCAC
CCGGAGTTGG CGACCCAGTT TTTACTTTAC GGCAGCCTCG CCACCATGTT CGGCTCGCTT
TGCTCCTCAC GCCTGCTGGG CCGCTTCGAC CGCGTCACCG CCTTCAAGTG GATCATCGTC
GCCTACTCGC TGATCAGCCT GCTGATTTTC GTCACCCCGG CGGAGCACAT CGCGCTCATT
TTTGCCCTCA ACATCCTGTT CCTGTTCGTC TTTAATACCA CCACGCCGCT GCAGTGGCTG
ATGGCTTCTG ACGTGGTGGA CTACGAGGAG AGCCGCAGCG GTCGCCGCCT CGACGGGCTG
GTGTTCTCCA CCTACCTGTT CAGCCTGAAG ATTGGCCTGG CGATTGGCGG GGCGGTGGTG
GGCTGGATCC TGGCGTACGT CAACTATTCC GCCAACAGCA GCGTGCAGCC GGTTGAGGTG
CTCACCACCA TCAAAATTCT GTTCTGCGTG GTGCCGGTGG TGCTCTACGC GGGCATGTTC
ATCATGCTGT CGCTCTACAA GCTCACCGAT GCCCGCGTGG AGGCCATCAG CCGGCAGCTG
ATTAAGCACC GCGCGGCGCA GGGCGAGGCC GTTCCCGACG CCGCGACAGC CGCATCCCAT
TAA
 
Protein sequence
MTQLTMKDKI GYGLGDTACG FVWQATMFLL AYFYTDVFGL SAGIMGTLFL VSRVLDAVTD 
PLMGLLVDRT RTRHGQFRPF LLWGAIPFGI VCVLTFYTPD FSAQGKIIYA CVTYILLTLV
YTFVNVPYCA MPGVITADPK ERHALQSWRF FLAAAGSLAI SGIALPLVSI IGKGDEQVGY
FGAMCVLGLS GVVLLYVCFF TTKERYTFEV QPGSSVAKDL KLLLGNSQWR IMCAFKMMAT
CSNVVRGGAT LYFVKYVMDH PELATQFLLY GSLATMFGSL CSSRLLGRFD RVTAFKWIIV
AYSLISLLIF VTPAEHIALI FALNILFLFV FNTTTPLQWL MASDVVDYEE SRSGRRLDGL
VFSTYLFSLK IGLAIGGAVV GWILAYVNYS ANSSVQPVEV LTTIKILFCV VPVVLYAGMF
IMLSLYKLTD ARVEAISRQL IKHRAAQGEA VPDAATAASH