Gene ECH74115_5104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5104 
SymbolilvB 
ID6966647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4744675 
End bp4746363 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content56% 
IMG OID643388776 
Productacetolactate synthase catalytic subunit 
Protein accessionYP_002273202 
Protein GI209396868 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.127246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTT CGGGCACATC ATCGACGCGT AAGCGCTTTA CCGGCGCAGA ATTTATCGTT 
CATTTCCTGG AACAGCAGGG CATTAAGATT GTGACGGGCA TTCCGGGCGG TTCTATCCTG
CCTGTTTACG ATGCCTTAAG CCAAAGCACG CAAATCCGCC ATATTCTGGC TCGCCATGAA
CAGGGCGCGG GGTTTATCGC TCAGGGAATG GCGCGCACCG ACGGTAAACC GGCGGTCTGT
ATGGCCTGTA GCGGACCGGG GGCGACTAAC CTGGTGACCG CCATTGCTGA TGCGCGGCTG
GACTCCATCC CGCTGATTTG CATCACTGGT CAGGTTCCCG CCTCAATGAT CGGCACTGAC
GCCTTCCAGG AAGTCGACAC CTACGGCATC TCTATCCCCA TCACCAAACA CAACTATCTG
GTCAGACATA TCGAAGAACT CCCGCAGGTC ATGAGCGATG CCTTCCGTAT TGCGCAATCA
GGCCGCCCTG GCCCGGTGTG GATAGACATT CCTAAGGATG TGCAAACGGC CGTTTTTGAG
ATTGAAGCAC AGCCCGCGGT GGCAGAAAAA GCCGTCGCGC CCGCCTTTAG CGAAGAAAGC
ATTCGTGACG CAGCGGCAAT GATTAATGCT GCCAAACGCC CGGTGCTTTA TCTGGGCGGT
GGTGTGATCA ATGCGCCTGC GAGAGTGCGT GAACTGGCGG AGAAAGCGCA GCTGCCTACC
ACAATGACTT TAATGGCGCT GGGCATGTTG CCAAAAGCGC ATCCGTTGTC GCTGGGTATG
TTGGGGATGC ACGGTGTGCG CAGCACCAAC TATATTTTGC AGGAGGCGGA TTTGTTGATT
GTGCTCGGTG CGCGTTTTGA TGACCGGGCG ATTGGCAAAA CCGAGCAGTT CTGCCCGAAT
GCCAAAATTA TTCATGTCGA TATCGACCGC GCAGAGCTGG GTAAAATCAA GCAGCCGCAC
GTAGCGATTC AGGCGGATGT TGATGACGTG CTGGCGCAGC TGATCCCGCA GGTGGAAGCG
CAACCGCGTG CAGAGTGGCA CCAGTTGGTA GCGGATTTGC AGCGCGAATT CCCGTGTCCA
ATCCCGAAAG CGTGCGATCC GTTAAGCCAT TACGGCCTGA TCAACGCCGT TGCCGCCTGT
GTCGATGACA ATGCGATTAT CACCACCGAT GTGGGTCAGC ATCAGATGTG GACGGCGCAG
GCTTATCCGC TCAATCGCCC ACGCCAGTGG CTGACCTCCG GCGGGCTGGG CACGATGGGT
TTTGGCCTGC CTGCGGCGAT TGGCGCGGCG CTGGCGAACC CGGATCGCAA AGTGTTGTGT
TTCTCCGGCG ATGGCAGCCT GATGATGAAT ATTCAGGAGA TGGCGACCGC CAGCGAAAAC
CAGCTGGATG TTAAAATCAT TCTGATGAAC AACGAAGCGC TGGGGCTGGT GCATCAGCAA
CAGAGTCTGT TCTACAAGCA GGGCGTTTTT GCCGCTACCT ATCCGGGAAA AATCAACTTT
ATGCAGATTG CCGCCGGATT CGGCCTCGAA ACCTGTGATT TGAATAACGA AGCCGATCCG
CAGGCTGCAT TGCAGGAAAT CATCAATCGC CCTGGTCCGG CGCTGATCCA TGTGCGCATT
GATGCCGAAG AAAAAGTGTA CCCGATGGTG CCGCCAGGTG CGGCGAATAC AGAAATGGTG
GGGGAATAA
 
Protein sequence
MASSGTSSTR KRFTGAEFIV HFLEQQGIKI VTGIPGGSIL PVYDALSQST QIRHILARHE 
QGAGFIAQGM ARTDGKPAVC MACSGPGATN LVTAIADARL DSIPLICITG QVPASMIGTD
AFQEVDTYGI SIPITKHNYL VRHIEELPQV MSDAFRIAQS GRPGPVWIDI PKDVQTAVFE
IEAQPAVAEK AVAPAFSEES IRDAAAMINA AKRPVLYLGG GVINAPARVR ELAEKAQLPT
TMTLMALGML PKAHPLSLGM LGMHGVRSTN YILQEADLLI VLGARFDDRA IGKTEQFCPN
AKIIHVDIDR AELGKIKQPH VAIQADVDDV LAQLIPQVEA QPRAEWHQLV ADLQREFPCP
IPKACDPLSH YGLINAVAAC VDDNAIITTD VGQHQMWTAQ AYPLNRPRQW LTSGGLGTMG
FGLPAAIGAA LANPDRKVLC FSGDGSLMMN IQEMATASEN QLDVKIILMN NEALGLVHQQ
QSLFYKQGVF AATYPGKINF MQIAAGFGLE TCDLNNEADP QAALQEIINR PGPALIHVRI
DAEEKVYPMV PPGAANTEMV GE