Gene EcDH1_0012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0012 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp12032 
End bp13324 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID 
Productd-galactonate transporter 
Protein accessionACX37713 
Protein GI260447291 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTC CCGTTAATGC AGCAAAGCCG GGGCGTCGGC GTTATCTGAC GCTGGTGATG 
ATCTTTATTA CGGTAGTCAT TTGTTATGTC GACCGCGCCA ACCTGGCCGT GGCTTCCGCC
CATATTCAGG AAGAGTTCGG CATTACCAAA GCGGAAATGG GCTATGTATT TTCGGCCTTC
GCCTGGCTTT ATACGCTATG TCAGATCCCC GGCGGTTGGT TTTTAGATCG CGTAGGTTCT
CGCGTGACTT ATTTTATTGC GATATTTGGC TGGTCAGTGG CGACTTTATT CCAGGGCTTT
GCCACGGGCT TAATGTCATT AATTGGTCTG CGCGCGATAA CCGGTATTTT CGAAGCGCCT
GCTTTCCCGA CCAATAACCG GATGGTGACC AGCTGGTTCC CGGAACATGA ACGCGCTTCT
GCCGTTGGTT TTTATACGTC TGGTCAGTTT GTCGGTCTGG CGTTTCTGAC GCCGCTGCTG
ATCTGGATTC AGGAGATGTT GAGCTGGCAC TGGGTGTTCA TTGTCACTGG TGGTATCGGC
ATTATCTGGT CGCTGATTTG GTTTAAGGTT TATCAGCCGC CGCGCCTGAC CAAAGGTATC
AGCAAAGCTG AACTGGATTA CATTCGTGAT GGCGGCGGTC TGGTGGATGG TGATGCGCCG
GTGAAGAAAG AGGCGCGTCA GCCGTTAACA GCCAAAGACT GGAAACTGGT GTTCCATCGT
AAACTGATCG GCGTTTATCT TGGGCAATTT GCGGTGGCTT CTACACTGTG GTTTTTCTTA
ACCTGGTTCC CGAACTATTT AACCCAGGAA AAAGGAATCA CGGCGCTGAA AGCAGGCTTT
ATGACCACGG TGCCATTCCT CGCGGCGTTT GTCGGCGTCC TGCTCTCTGG CTGGGTAGCG
GATCTGCTGG TACGTAAGGG CTTTTCACTG GGCTTTGCGC GTAAAACGCC GATTATCTGC
GGCTTGCTGA TCTCCACCTG CATTATGGGC GCTAACTACA CTAACGATCC GATGATGATT
ATGTGCCTGA TGGCGCTGGC ATTCTTCGGT AACGGTTTTG CTTCGATTAC CTGGTCGCTG
GTTTCTTCTC TGGCACCGAT GCGCCTGATT GGTTTAACCG GCGGCGTGTT TAACTTCGCC
GGTGGTCTGG GCGGCATCAC CGTTCCGCTG GTGGTGGGGT ACCTGGCGCA GGGTTACGGT
TTCGCACCTG CACTGGTTTA TATCTCCGCC GTCGCGTTGA TTGGCGCGCT CTCTTATATC
CTGCTGGTGG GCGATGTGAA GCGCGTTGGC TAA
 
Protein sequence
MDIPVNAAKP GRRRYLTLVM IFITVVICYV DRANLAVASA HIQEEFGITK AEMGYVFSAF 
AWLYTLCQIP GGWFLDRVGS RVTYFIAIFG WSVATLFQGF ATGLMSLIGL RAITGIFEAP
AFPTNNRMVT SWFPEHERAS AVGFYTSGQF VGLAFLTPLL IWIQEMLSWH WVFIVTGGIG
IIWSLIWFKV YQPPRLTKGI SKAELDYIRD GGGLVDGDAP VKKEARQPLT AKDWKLVFHR
KLIGVYLGQF AVASTLWFFL TWFPNYLTQE KGITALKAGF MTTVPFLAAF VGVLLSGWVA
DLLVRKGFSL GFARKTPIIC GLLISTCIMG ANYTNDPMMI MCLMALAFFG NGFASITWSL
VSSLAPMRLI GLTGGVFNFA GGLGGITVPL VVGYLAQGYG FAPALVYISA VALIGALSYI
LLVGDVKRVG