Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3884 |
Symbol | ilvB |
ID | 5592983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3880677 |
End bp | 3882365 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640922994 |
Product | acetolactate synthase catalytic subunit |
Protein accession | YP_001460471 |
Protein GI | 157163153 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGTT CGGGCACAAC ATCGACGCGT AAGCGCTTTA CCGGCGCAGA ATTTATCGTT CATTTCCTGG AACAGCAGGG CATTAAGATT GTGACGGGCA TTCCGGGCGG TTCTATCCTG CCTGTTTACG ATGCCTTAAG CCAAAGTACG CAAATCCGCC ATATTCTGGC TCGCCATGAA CAGGGCGCGG GATTTATCGC TCAGGGAATG GCGCGCACCG ACGGTAAACC GGCGGTCTGT ATGGCCTGTA GCGGACCGGG TGCGACTAAC CTGGTGACCG CCATTGCCGA TGCGCGGCTG GACTCCATCC CGCTGATTTG CATCACTGGT CAGGTTCCCG CCTCGATGAT CGGCACCGAC GCCTTCCAGG AAGTCGACAC CTACGGCATC TCTATCCCCA TCACCAAACA CAACTATCTG GTCAGACATA TCGAAGAACT CCCGCAGGTC ATGAGCGATG CCTTCCGCAT TGCGCAATCA GGCCGCCCAG GCCCGGTGTG GATAGACATT CCTAAGGATG TGCAAACGGC GGTTTTTGAG ATTGAAGCTC AGCCCGCGGT GGCAGAAAAA GCCGCTGCAC CCGCCTTTAG CGAAGAAAGC ATTCGTGACG CAGCTACAAT GATTAACGCT GCCAAACGCC CGGTGCTTTA TCTGGGTGGT GGTGTGATCA ATGCGCCTGC GCGGGTGCGT GAACTGGCGG AGAAAGCGCA ACTGCCTACC ACCATGACTT TAATGGCGCT GGGCATGCTG CCAAAAGCGC ATCCGTTGTC GCTGGGTATG CTGGGGATGC ACGGCGTGCG CAGCACTAAC TATATCTTGC AGGAGGCGGA TTTACTGATT GTGCTCGGTG CGCGTTTTGA TGACCGGGCG ATTGGCAAAA CCGAGCAGTT CTGTCCGAAT GCCAAAATCA TTCATGTCGA TATCGACCGT GCAGAGCTGG GTAAAATCAA GCAGCCGCAT GTGGCGATTC AGGCGGATGT TGATGACGTG CTGGCGCAGT TGATCCCGCT GGTGGAAGCG CAACCGCGTG CAGAGTGGCA CCAGTTGGTA GCGGATTTGC AGCGTGAGTT TCCGTGTCCA ATCCCGAAAG CGTGCGATCC ATTAAGCCAT TACGGCCTGA TCAACGCCGT TGCCGCCTGT GTCGATGACA ATGCGATTAT CACCACCGAT GTGGGGCAGC ATCAGATGTG GACCGCGCAA GCTTATCCGC TCAATCGCCC ACGCCAGTGG CTGACCTCCG GTGGGCTGGG CACGATGGGT TTTGGCCTGC CTGCGGCGAT TGGCGCGGCG CTGGCGAACC CGGATCGCAA AGTGTTGTGT TTCTCCGGCG ACGGCAGCCT GATGATGAAT ATTCAGGAGA TGGCGACCGC CAGTGAAAAT CAGCTGGATG TCAAAATCAT TCTGATGAAC AACGAAGCGC TGGGGCTGGT GCATCAGCAA CAGAGTCTGT TCTACGAGCA AGGCGTTTTT GCCGCCACCT ATCCGGGCAA AATCAACTTT ATGCAGATTG CCGCCGGATT CGGCCTCGAA ACCTGTGATT TGAATAACGA AGCCGATCCG CAGGCTGCAT TGCAGGAAAT CATCAATCGC CCTGGCCCGG CGCTGATCCA TGTGCGCATT GATGCCGAAG AAAAAGTTTA CCCGATGGTG CCGCCAGGTG CGGCGAATAC TGAAATGGTG GGGGAATAA
|
Protein sequence | MASSGTTSTR KRFTGAEFIV HFLEQQGIKI VTGIPGGSIL PVYDALSQST QIRHILARHE QGAGFIAQGM ARTDGKPAVC MACSGPGATN LVTAIADARL DSIPLICITG QVPASMIGTD AFQEVDTYGI SIPITKHNYL VRHIEELPQV MSDAFRIAQS GRPGPVWIDI PKDVQTAVFE IEAQPAVAEK AAAPAFSEES IRDAATMINA AKRPVLYLGG GVINAPARVR ELAEKAQLPT TMTLMALGML PKAHPLSLGM LGMHGVRSTN YILQEADLLI VLGARFDDRA IGKTEQFCPN AKIIHVDIDR AELGKIKQPH VAIQADVDDV LAQLIPLVEA QPRAEWHQLV ADLQREFPCP IPKACDPLSH YGLINAVAAC VDDNAIITTD VGQHQMWTAQ AYPLNRPRQW LTSGGLGTMG FGLPAAIGAA LANPDRKVLC FSGDGSLMMN IQEMATASEN QLDVKIILMN NEALGLVHQQ QSLFYEQGVF AATYPGKINF MQIAAGFGLE TCDLNNEADP QAALQEIINR PGPALIHVRI DAEEKVYPMV PPGAANTEMV GE
|
| |