Gene EcolC_0028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0028 
Symbol 
ID6068661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp28583 
End bp30271 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content56% 
IMG OID641599432 
Productacetolactate synthase catalytic subunit 
Protein accessionYP_001723042 
Protein GI170018088 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTT CGGGCACAAC ATCGACGCGT AAGCGCTTTA CCGGCGCAGA ATTTATCGTT 
CATTTCCTGG AACAGCAGGG CATTAAGATT GTGACGGGCA TTCCGGGCGG TTCTATCCTG
CCTGTTTACG ATGCCTTAAG CCAAAGTACG CAAATCCGCC ATATTCTGGC TCGCCATGAA
CAGGGCGCGG GATTTATCGC TCAGGGAATG GCGCGCACCG ACGGTAAACC GGCGGTCTGT
ATGGCCTGTA GCGGACCGGG TGCGACTAAC CTGGTGACCG CCATTGCCGA TGCGCGGCTG
GACTCCATCC CGCTGATTTG CATCACTGGT CAGGTTCCCG CCTCGATGAT CGGCACCGAC
GCCTTCCAGG AAGTCGACAC CTACGGCATC TCTATCCCCA TCACCAAACA CAACTATCTG
GTCAGACATA TCGAAGAACT CCCGCAGGTC ATGAGCGATG CCTTCCGCAT TGCGCAATCA
GGCCGCCCAG GCCCGGTGTG GATAGACATT CCTAAGGATG TGCAAACGGC GGTTTTTGAG
ATTGAAGCTC AGCCCGCGGT GGCAGAAAAA GCCGCTGCAC CCGCCTTTAG CGAAGAAAGC
ATTCGTGACG CAGCTACAAT GATTAACGCT GCCAAACGCC CGGTGCTTTA TCTGGGTGGT
GGTGTGATCA ATGCGCCTGC GCGGGTGCGT GAACTGGCGG AGAAAGCGCA ACTGCCTACC
ACCATGACTT TAATGGCGCT GGGCATGCTG CCAAAAGCGC ATCCGTTGTC GCTGGGTATG
CTGGGGATGC ACGGCGTGCG CAGCACTAAC TATATCTTGC AGGAGGCGGA TTTACTGATT
GTGCTCGGTG CGCGTTTTGA TGACCGGGCG ATTGGCAAAA CCGAGCAGTT CTGTCCGAAT
GCCAAAATCA TTCATGTCGA TATCGACCGT GCAGAGCTGG GTAAAATCAA GCAGCCGCAT
GTGGCGATTC AGGCGGATGT TGATGACGTG CTGGCGCAGT TGATCCCGCT GGTGGAAGCG
CAACCGCGTG CAGAGTGGCA CCAGTTGGTA GCGGATTTGC AGCGTGAGTT TCCGTGTCCA
ATCCCGAAAG CGTGCGATCC ATTAAGCCAT TACGGCCTGA TCAACGCCGT TGCCGCCTGT
GTCGATGACA ATGCGATTAT CACCACCGAT GTGGGGCAGC ATCAGATGTG GACCGCGCAA
GCTTATCCGC TCAATCGCCC ACGCCAGTGG CTGACCTCCG GTGGGCTGGG CACGATGGGT
TTTGGCCTGC CTGCGGCGAT TGGCGCGGCG CTGGCGAACC CGGATCGCAA AGTGTTGTGT
TTCTCCGGCG ACGGCAGCCT GATGATGAAT ATTCAGGAGA TGGCGACCGC CAGTGAAAAT
CAGCTGGATG TCAAAATCAT TCTGATGAAC AACGAAGCGC TGGGGCTGGT GCATCAGCAA
CAGAGTCTGT TCTACGAGCA AGGCGTTTTT GCCGCCACCT ATCCGGGCAA AATCAACTTT
ATGCAGATTG CCGCCGGATT CGGCCTCGAA ACCTGTGATT TGAATAACGA AGCCGATCCG
CAGGCTGCAT TGCAGGAAAT CATCAATCGC CCTGGCCCGG CGCTGATCCA TGTGCGCATT
GATGCCGAAG AAAAAGTTTA CCCGATGGTG CCGCCAGGTG CGGCGAATAC TGAAATGGTG
GGGGAATAA
 
Protein sequence
MASSGTTSTR KRFTGAEFIV HFLEQQGIKI VTGIPGGSIL PVYDALSQST QIRHILARHE 
QGAGFIAQGM ARTDGKPAVC MACSGPGATN LVTAIADARL DSIPLICITG QVPASMIGTD
AFQEVDTYGI SIPITKHNYL VRHIEELPQV MSDAFRIAQS GRPGPVWIDI PKDVQTAVFE
IEAQPAVAEK AAAPAFSEES IRDAATMINA AKRPVLYLGG GVINAPARVR ELAEKAQLPT
TMTLMALGML PKAHPLSLGM LGMHGVRSTN YILQEADLLI VLGARFDDRA IGKTEQFCPN
AKIIHVDIDR AELGKIKQPH VAIQADVDDV LAQLIPLVEA QPRAEWHQLV ADLQREFPCP
IPKACDPLSH YGLINAVAAC VDDNAIITTD VGQHQMWTAQ AYPLNRPRQW LTSGGLGTMG
FGLPAAIGAA LANPDRKVLC FSGDGSLMMN IQEMATASEN QLDVKIILMN NEALGLVHQQ
QSLFYEQGVF AATYPGKINF MQIAAGFGLE TCDLNNEADP QAALQEIINR PGPALIHVRI
DAEEKVYPMV PPGAANTEMV GE