Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4134 |
Symbol | ilvG |
ID | 6144247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4230348 |
End bp | 4231994 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618957 |
Product | acetolactate synthase 2 catalytic subunit |
Protein accession | YP_001746089 |
Protein GI | 170684034 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.82212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.957886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGCG CACAGTGGGT GGTACATGCG TTGCGGGCAC AGGGTGTGAA TACCGTTTTC GGTTATCCGG GTGGCGCAAT TATGCCGGTT TACGATGCAT TGTATGACGG CGGCATGGAG CACTTGCTGT GCCGACATGA ACAGGGTGCG GCAATGGCGG CTATCGGTTA TGCCCGTGCT ACTGGCAAAA CTGGCGTATG TATCGCCACG TCTGGTCCGG GCGCAACCAA CCTGATAACC GGGCTTGCGG ACGCACTGTT AGATTCCATC CCCGTTGTTG CTATCACCGG TCAGGTATCC GCACCGTTTA TCGGCACTGA CGCATTTCAG GAGGTGGATA TTCTGGGGTT GTCGTTAGCC TGCACCAAGC ACAGTTTCCT GGTGCAATCA CTGGAAGAGT TACCGCGTAT CATGGCCGAA GCGTTTGACG TCGCCAGCTC TGGTCGTCCT GGTCCGGTTT TGGTCGATAT TCCAAAAGAT ATCCAGTTAG CCAGCGGCGA TCTGGAACCG TGGTTCACCA CCGTTGAAAA CGAAGTGACT TTCCCCCATG CCGAAGTCGA GCAAGCGCGC CAGATGCTGG CAAAAGCGCA AAAACCGATG CTGTACGTTG GCGGTGGCGT TGGCATGGCG CAGGCAGTTC CGGCTTTGCG TGAATTTCTC GCTACCACAA AAATGCCTGC CACCTGCACG CTGAAAGGGC TGGGCGCAGT TGAAGCAGAT TATCCGTACT ATCTGGGCAT GCTGGGAATG CACGGCACCA AAGCGGCGAA CTTCGCGGTT CAGGAGTGCG ATCTACTGAT AGCCGTGGGC GCACGTTTTG ATGACCGGGT GACCGGCAAA CTGAACACCT TCGCACCACA CGCCAGCGTT ATCCATATGG ATATCGATCC GGCAGAAATG AACAAGCTGC GTCAGGCACA TGCGGCATTA CAGGGTGATT TAAATGCTCT GTTACCAGCA TTACAGCAGC CGTTAAATAT CGATGACTGG CAGCAACACT GCGCACAACT GCGTGATGAA CATGCCTGGC GTTACGACCA TCCTGGTGAC GCTATCTACG CGCCGTTGTT ATTAAAACAA CTGTCGGATC GTAAACCTGC GGATTGCATC GTGACCACAG ATGTGGGGCA GCACCAGATG TGGTCTGCCC AGCACATCGT CCACACTCGC CCGGAAAATT TCATCACCTC CAGCGGCTTA GGCACCATGG GTTTTGGTTT ACCGGCGGCG GTTGGCGCAC AAGTCGCGCG ACCGAACGAT ACCGTCGTCT GTATCTCCGG TGACGGCTCT TTCATGATGA ATGTGCAAGA GCTGGGCACC GTAAAACGCA AGCAGTTACC GTTGAAAATC GTCTTACTCG ATAACCAACG GTTAGGGATG GTTCGACAAT GGCAGCAACT GTTTTTCCAG GAACGATATA GCGAAACCAC CCTTACCGAT AATCCCGATT TCCTCATGTT AGCCAGCGCC TTCGGCATCC CTGGCCAACA CATCACCCGT AAAGACCAGG TTGAAGCGGC ACTCGACACC ATGCTGAACA GTGATGGGCC ATACCTGCTT CATGTCTCAA TCGACGAACT TGAGAACGTC TGGCCGCTGG TGCCGCCAGG TGCCAGTAAT TCAGAAATGT TGGAGAAATT ATCATGA
|
Protein sequence | MNGAQWVVHA LRAQGVNTVF GYPGGAIMPV YDALYDGGME HLLCRHEQGA AMAAIGYARA TGKTGVCIAT SGPGATNLIT GLADALLDSI PVVAITGQVS APFIGTDAFQ EVDILGLSLA CTKHSFLVQS LEELPRIMAE AFDVASSGRP GPVLVDIPKD IQLASGDLEP WFTTVENEVT FPHAEVEQAR QMLAKAQKPM LYVGGGVGMA QAVPALREFL ATTKMPATCT LKGLGAVEAD YPYYLGMLGM HGTKAANFAV QECDLLIAVG ARFDDRVTGK LNTFAPHASV IHMDIDPAEM NKLRQAHAAL QGDLNALLPA LQQPLNIDDW QQHCAQLRDE HAWRYDHPGD AIYAPLLLKQ LSDRKPADCI VTTDVGQHQM WSAQHIVHTR PENFITSSGL GTMGFGLPAA VGAQVARPND TVVCISGDGS FMMNVQELGT VKRKQLPLKI VLLDNQRLGM VRQWQQLFFQ ERYSETTLTD NPDFLMLASA FGIPGQHITR KDQVEAALDT MLNSDGPYLL HVSIDELENV WPLVPPGASN SEMLEKLS
|
| |