Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4038 |
Symbol | ilvB |
ID | 6143610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4127897 |
End bp | 4129585 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618863 |
Product | acetolactate synthase catalytic subunit |
Protein accession | YP_001746001 |
Protein GI | 170683518 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.725928 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.773319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGTT CGGGCACAAC ATCGACGCGT AAGCGCTTTA CCGGCGCAGA ATTTATCGTT CATTTCCTGG AACAGCAGGG CATTAAGATT GTGACGGGCA TTCCGGGCGG TTCTATCCTG CCTGTTTACG ATGCCTTAAG CCAAAGCACG CAAATCCGCC ATATTCTGGC TCGCCATGAA CAGGGAGCGG GGTTTATCGC TCAGGGAATG GCGCGCACCG ACGGTAAACC GGCGGTCTGT ATGGCCTGTA GCGGACCGGG TGCGACTAAC CTGGTGACCG CCATTGCCGA TGCGCGGCTG GATTCCATCC CGCTGATTTG CATCACCGGT CAGGTTCCTG CCTCGATGAT CGGCACCGAC GCCTTCCAGG AAGTGGACAC CTACGGCATC TCTATCCCCA TCACCAAACA CAACTATCTG GTCAGACATA TCGAAGAGCT CCCACAGGTC ATGAGCGACG CCTTTCGTAT TGCGCAATCA GGCCGCCCTG GCCCGGTGTG GATAGACATT CCTAAGGATG TGCAAACGGC GGTTTTTGAG ATTGAAGCAC AGGCCGCTAT GGCAGAAAAA GCTGCCGCGC CTGCGTTTAG TGAAGAGAGT ATTCGTGACG CAGCTGCAAT GATTAACGCC GCCAAACGCC CGGTGCTTTA TCTGGGCGGC GGTGTGATCA ATGCGCCCGC ACGGGTGCGT GAACTGGCAG AGAAAGCGCA ATTGCCAACT ACCATGACCT TAATGGCGCT GGGCATGCTG CCAAAAGCGC ATCCGCTGTC GCTGGGTATG CTGGGGATGC ACGGCGTGCG CAGCACCAAC TATATACTGC AGGAGGCGGA TCTGCTGATT GTTCTCGGTG CGCGTTTTGA TGACCGGGCG ATTGGCAAAA CCGAGCAGTT CTGCCCGAAT GCGAAAATCA TTCATGTCGA TATCGACCGT GCAGAGCTGG GCAAAATCAA GCAACCGCAC GTAGCGATTC AGGCGGATGT TGATGACGTG CTGGCGCAAT TGATCCCGCA GGTGGAAGCG CAACCGCGTG CAGAGTGGCA CCAGTTGGTG GCGGATTTGC AGCGCGAATT CCCGTATCCA ATCCCGAAAG CGTGCGATCC GTTAACGCAT TACGGCCTGA TCAACGCTGT TGCCGCCTGT GTCGATGACA ATGCGATTAT CACCACCGAC GTGGGTCAGC ATCAGATGTG GACCGCGCAA GCCTATCCGC TCAATCGCCC GCGCCAGTGG CTGACCTCCG GCGGGCTGGG CACGATGGGC TTTGGCCTGC CTGCGGCGAT TGGCGCGGCG CTGGCGAACC CGGATCGCAA AGTGTTGTGT TTCTCCGGCG ACGGCAGCCT GATGATGAAT ATTCAGGAGA TGGCGACCGC CAGTGAAAAT CAGCTGGATG TCAAAATCAT TCTGATGAAC AACGAAGCGC TGGGGCTGGT ACATCAGCAA CAGAGTCTGT TCTACAAGCA GGGCGTTTTT GCCGCTACCT ATCCGGGAAA AATCAACTTT ATGCAGATTG CCGCCGGATT CGGCCTCGAA ACCTGTGATT TGAATAACGA AACCGATCCG CAGGCTGCAT TACAGGAAAT CATCAATCGC CCTGGTCCGG CGCTGATCCA TGTGCGTATT GATGCCGAAG AAAAAGTGTA CCCGATGGTG CCGCCAGGTG CGGCGAATAC TGAAATGGTG GGGGAATAA
|
Protein sequence | MASSGTTSTR KRFTGAEFIV HFLEQQGIKI VTGIPGGSIL PVYDALSQST QIRHILARHE QGAGFIAQGM ARTDGKPAVC MACSGPGATN LVTAIADARL DSIPLICITG QVPASMIGTD AFQEVDTYGI SIPITKHNYL VRHIEELPQV MSDAFRIAQS GRPGPVWIDI PKDVQTAVFE IEAQAAMAEK AAAPAFSEES IRDAAAMINA AKRPVLYLGG GVINAPARVR ELAEKAQLPT TMTLMALGML PKAHPLSLGM LGMHGVRSTN YILQEADLLI VLGARFDDRA IGKTEQFCPN AKIIHVDIDR AELGKIKQPH VAIQADVDDV LAQLIPQVEA QPRAEWHQLV ADLQREFPYP IPKACDPLTH YGLINAVAAC VDDNAIITTD VGQHQMWTAQ AYPLNRPRQW LTSGGLGTMG FGLPAAIGAA LANPDRKVLC FSGDGSLMMN IQEMATASEN QLDVKIILMN NEALGLVHQQ QSLFYKQGVF AATYPGKINF MQIAAGFGLE TCDLNNETDP QAALQEIINR PGPALIHVRI DAEEKVYPMV PPGAANTEMV GE
|
| |