Gene EcSMS35_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4038 
SymbolilvB 
ID6143610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4127897 
End bp4129585 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content56% 
IMG OID641618863 
Productacetolactate synthase catalytic subunit 
Protein accessionYP_001746001 
Protein GI170683518 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.725928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.773319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTT CGGGCACAAC ATCGACGCGT AAGCGCTTTA CCGGCGCAGA ATTTATCGTT 
CATTTCCTGG AACAGCAGGG CATTAAGATT GTGACGGGCA TTCCGGGCGG TTCTATCCTG
CCTGTTTACG ATGCCTTAAG CCAAAGCACG CAAATCCGCC ATATTCTGGC TCGCCATGAA
CAGGGAGCGG GGTTTATCGC TCAGGGAATG GCGCGCACCG ACGGTAAACC GGCGGTCTGT
ATGGCCTGTA GCGGACCGGG TGCGACTAAC CTGGTGACCG CCATTGCCGA TGCGCGGCTG
GATTCCATCC CGCTGATTTG CATCACCGGT CAGGTTCCTG CCTCGATGAT CGGCACCGAC
GCCTTCCAGG AAGTGGACAC CTACGGCATC TCTATCCCCA TCACCAAACA CAACTATCTG
GTCAGACATA TCGAAGAGCT CCCACAGGTC ATGAGCGACG CCTTTCGTAT TGCGCAATCA
GGCCGCCCTG GCCCGGTGTG GATAGACATT CCTAAGGATG TGCAAACGGC GGTTTTTGAG
ATTGAAGCAC AGGCCGCTAT GGCAGAAAAA GCTGCCGCGC CTGCGTTTAG TGAAGAGAGT
ATTCGTGACG CAGCTGCAAT GATTAACGCC GCCAAACGCC CGGTGCTTTA TCTGGGCGGC
GGTGTGATCA ATGCGCCCGC ACGGGTGCGT GAACTGGCAG AGAAAGCGCA ATTGCCAACT
ACCATGACCT TAATGGCGCT GGGCATGCTG CCAAAAGCGC ATCCGCTGTC GCTGGGTATG
CTGGGGATGC ACGGCGTGCG CAGCACCAAC TATATACTGC AGGAGGCGGA TCTGCTGATT
GTTCTCGGTG CGCGTTTTGA TGACCGGGCG ATTGGCAAAA CCGAGCAGTT CTGCCCGAAT
GCGAAAATCA TTCATGTCGA TATCGACCGT GCAGAGCTGG GCAAAATCAA GCAACCGCAC
GTAGCGATTC AGGCGGATGT TGATGACGTG CTGGCGCAAT TGATCCCGCA GGTGGAAGCG
CAACCGCGTG CAGAGTGGCA CCAGTTGGTG GCGGATTTGC AGCGCGAATT CCCGTATCCA
ATCCCGAAAG CGTGCGATCC GTTAACGCAT TACGGCCTGA TCAACGCTGT TGCCGCCTGT
GTCGATGACA ATGCGATTAT CACCACCGAC GTGGGTCAGC ATCAGATGTG GACCGCGCAA
GCCTATCCGC TCAATCGCCC GCGCCAGTGG CTGACCTCCG GCGGGCTGGG CACGATGGGC
TTTGGCCTGC CTGCGGCGAT TGGCGCGGCG CTGGCGAACC CGGATCGCAA AGTGTTGTGT
TTCTCCGGCG ACGGCAGCCT GATGATGAAT ATTCAGGAGA TGGCGACCGC CAGTGAAAAT
CAGCTGGATG TCAAAATCAT TCTGATGAAC AACGAAGCGC TGGGGCTGGT ACATCAGCAA
CAGAGTCTGT TCTACAAGCA GGGCGTTTTT GCCGCTACCT ATCCGGGAAA AATCAACTTT
ATGCAGATTG CCGCCGGATT CGGCCTCGAA ACCTGTGATT TGAATAACGA AACCGATCCG
CAGGCTGCAT TACAGGAAAT CATCAATCGC CCTGGTCCGG CGCTGATCCA TGTGCGTATT
GATGCCGAAG AAAAAGTGTA CCCGATGGTG CCGCCAGGTG CGGCGAATAC TGAAATGGTG
GGGGAATAA
 
Protein sequence
MASSGTTSTR KRFTGAEFIV HFLEQQGIKI VTGIPGGSIL PVYDALSQST QIRHILARHE 
QGAGFIAQGM ARTDGKPAVC MACSGPGATN LVTAIADARL DSIPLICITG QVPASMIGTD
AFQEVDTYGI SIPITKHNYL VRHIEELPQV MSDAFRIAQS GRPGPVWIDI PKDVQTAVFE
IEAQAAMAEK AAAPAFSEES IRDAAAMINA AKRPVLYLGG GVINAPARVR ELAEKAQLPT
TMTLMALGML PKAHPLSLGM LGMHGVRSTN YILQEADLLI VLGARFDDRA IGKTEQFCPN
AKIIHVDIDR AELGKIKQPH VAIQADVDDV LAQLIPQVEA QPRAEWHQLV ADLQREFPYP
IPKACDPLTH YGLINAVAAC VDDNAIITTD VGQHQMWTAQ AYPLNRPRQW LTSGGLGTMG
FGLPAAIGAA LANPDRKVLC FSGDGSLMMN IQEMATASEN QLDVKIILMN NEALGLVHQQ
QSLFYKQGVF AATYPGKINF MQIAAGFGLE TCDLNNETDP QAALQEIINR PGPALIHVRI
DAEEKVYPMV PPGAANTEMV GE