Gene EcDH1_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0032 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp30978 
End bp32666 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content56% 
IMG OID 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionACX37730 
Protein GI260447308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones80 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTT CGGGCACAAC ATCGACGCGT AAGCGCTTTA CCGGCGCAGA ATTTATCGTT 
CATTTCCTGG AACAGCAGGG CATTAAGATT GTGACAGGCA TTCCGGGCGG TTCTATCCTG
CCTGTTTACG ATGCCTTAAG CCAAAGCACG CAAATCCGCC ATATTCTGGC CCGTCATGAA
CAGGGCGCGG GCTTTATCGC TCAGGGAATG GCGCGCACCG ACGGTAAACC GGCGGTCTGT
ATGGCCTGTA GCGGACCGGG TGCGACTAAC CTGGTGACCG CCATTGCCGA TGCGCGGCTG
GACTCCATCC CGCTGATTTG CATCACTGGT CAGGTTCCCG CCTCGATGAT CGGCACCGAC
GCCTTCCAGG AAGTGGACAC CTACGGCATC TCTATCCCCA TCACCAAACA CAACTATCTG
GTCAGACATA TCGAAGAACT CCCGCAGGTC ATGAGCGATG CCTTCCGCAT TGCGCAATCA
GGCCGCCCAG GCCCGGTGTG GATAGACATT CCTAAGGATG TGCAAACGGC AGTTTTTGAG
ATTGAAACAC AGCCCGCTAT GGCAGAAAAA GCCGCCGCCC CCGCCTTTAG CGAAGAAAGC
ATTCGTGACG CAGCGGCGAT GATTAACGCT GCCAAACGCC CGGTGCTTTA TCTGGGCGGC
GGTGTGATCA ATGCGCCCGC ACGGGTGCGT GAACTGGCGG AGAAAGCGCA ACTGCCTACC
ACCATGACTT TAATGGCGCT GGGCATGTTG CCAAAAGCGC ATCCGTTGTC GCTGGGTATG
CTGGGGATGC ACGGCGTGCG CAGCACCAAC TATATTTTGC AGGAGGCGGA TTTGTTGATA
GTGCTCGGTG CGCGTTTTGA TGACCGGGCG ATTGGCAAAA CCGAGCAGTT CTGTCCGAAT
GCCAAAATCA TTCATGTCGA TATCGACCGT GCAGAGCTGG GTAAAATCAA GCAGCCGCAC
GTGGCGATTC AGGCGGATGT TGATGACGTG CTGGCGCAGT TGATCCCGCT GGTGGAAGCG
CAACCGCGTG CAGAGTGGCA CCAGTTGGTA GCGGATTTGC AGCGTGAGTT TCCGTGTCCA
ATCCCGAAAG CGTGCGATCC GTTAAGCCAT TACGGCCTGA TCAACGCCGT TGCCGCCTGT
GTCGATGACA ATGCAATTAT CACCACCGAC GTTGGTCAGC ATCAGATGTG GACCGCGCAA
GCTTATCCGC TCAATCGCCC ACGCCAGTGG CTGACCTCCG GTGGGCTGGG CACGATGGGT
TTTGGCCTGC CTGCGGCGAT TGGCGCTGCG CTGGCGAACC CGGATCGCAA AGTGTTGTGT
TTCTCCGGCG ACGGCAGCCT GATGATGAAT ATTCAGGAGA TGGCGACCGC CAGTGAAAAT
CAGCTGGATG TCAAAATCAT TCTGATGAAC AACGAAGCGC TGGGGCTGGT GCATCAGCAA
CAGAGTCTGT TCTACGAGCA AGGCGTTTTT GCCGCCACCT ATCCGGGCAA AATCAACTTT
ATGCAGATTG CCGCCGGATT CGGCCTCGAA ACCTGTGATT TGAATAACGA AGCCGATCCG
CAGGCTTCAT TGCAGGAAAT CATCAATCGC CCTGGCCCGG CGCTGATCCA TGTGCGCATT
GATGCCGAAG AAAAAGTTTA CCCGATGGTG CCGCCAGGTG CGGCGAATAC TGAAATGGTG
GGGGAATAA
 
Protein sequence
MASSGTTSTR KRFTGAEFIV HFLEQQGIKI VTGIPGGSIL PVYDALSQST QIRHILARHE 
QGAGFIAQGM ARTDGKPAVC MACSGPGATN LVTAIADARL DSIPLICITG QVPASMIGTD
AFQEVDTYGI SIPITKHNYL VRHIEELPQV MSDAFRIAQS GRPGPVWIDI PKDVQTAVFE
IETQPAMAEK AAAPAFSEES IRDAAAMINA AKRPVLYLGG GVINAPARVR ELAEKAQLPT
TMTLMALGML PKAHPLSLGM LGMHGVRSTN YILQEADLLI VLGARFDDRA IGKTEQFCPN
AKIIHVDIDR AELGKIKQPH VAIQADVDDV LAQLIPLVEA QPRAEWHQLV ADLQREFPCP
IPKACDPLSH YGLINAVAAC VDDNAIITTD VGQHQMWTAQ AYPLNRPRQW LTSGGLGTMG
FGLPAAIGAA LANPDRKVLC FSGDGSLMMN IQEMATASEN QLDVKIILMN NEALGLVHQQ
QSLFYEQGVF AATYPGKINF MQIAAGFGLE TCDLNNEADP QASLQEIINR PGPALIHVRI
DAEEKVYPMV PPGAANTEMV GE