Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2714 |
Symbol | |
ID | 4810708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3202721 |
End bp | 3204388 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108133 |
Product | acetolactate synthase, large subunit |
Protein accession | YP_001039106 |
Protein GI | 125975196 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTA CCGGTTCGAA AATATTAATT GAGTGCCTTA AGGAACAAGG TGTGGATACA ATATTTGGAT TTCCGGGCGG AGCGGTATTG AATATATATG ATGAACTGTA CAGGTCGCAG AATGAAATCC GGCATATACT GACTTCCCAC GAACAGGGAG CTGCTCATGC CGCAGACGGC TATGCAAGGG CAACGGGAAA GGTTGGGGTG TGTCTTGCCA CATCCGGCCC CGGGGCCACA AATCTGGTTA CAGGAATAGC AACCGCTTAC ATGGATTCGG TTCCGATGGT TGCAATTACA GGACAGGTGG CGACGCCGCT CTTGGGCAAG GATTCCTTCC AGGAAGTTGA CATAACCGGA ATAACAATGC CCATAACAAA ACACAATTTT ATAGTAAAGG ATGTTAACAA GCTGGCTGAC ATAGTGCGAA GAGCCTTTTA CATAGCAAAA GAAGGAAGAC CAGGCCCGGT TCTGATTGAT ATATGCAAAG ATGTAACGGC GGCATATGCT GAATATGAAC CAAAGTCCCC TCAGGAATTG CCCGAGGTAC CTGTAAGAGT GGATGAAAAG TGTATTGATG AAGCGGCGGA GGCAATTAAC AAAGCGGAAA GACCGGTTAT TCTCGCCGGA GGAGGCGTAT CAATCGCGGG AGCAAATAAA GAACTTTTTG AGTTTGCAAC AAATGCCCAG ATACCAGTTA CCACAACTTT AATGGGCATG GGTGCTTTCC CGGGAACCCA TGAGCTGTTC ATGGGAATGA TTGGAATGCA CGGCACAAAG ACGACAAACA TGGCGGTTTC GGAATCGGAT CTTTTTATTG CGATTGGTGC AAGATTTAGT GACAGAGTGA TAAGCAATGT TCAGAGATTT GCACCTAAAG CAAGCATAAT GCATATAGAC ATTGACCCTG CCGAAATCGG AAAAAATATT AATGTTCAAT ATGCTCTTGA GGGAAACATC AAGAAAATAT TGCAGCTTCT GAACGAGAGA GTAAAGAAGA AAGAATGCAC TGACTGGGTT AGAAAAATCA ATGAGTGGAA GGAACTGTAT CCTCTTAAGT ATCCTCAGGA TGACAAGCTT CATCCGCAAT ATATTATTGA GAGAATGTAT GAACTTACCA AAGGAGAGGC AATAATAACT ACCGAGGTGG GTCAGCACCA GATGTGGGCC GCCCAGTTTT ACAAATACAC TTCTCCAAGA CAGTTCCTGT CCTCAGGTGG TCTGGGTACC ATGGGATATG GTCTTGGAGC ATGCATTGGC GCCCGGATTG GAAGACCCGA CAAAAAGGTA ATTAATGTTG CCGGTGACGG CAGCTTCAGA ATGAACTGCA ATGAGCTGGC CACAGCTGTT GAGTACAAGC TTCCGATAAT AGTTGCGATA TTCAACAATC ATGCTCTGGG AATGGTAAGA CAGTGGCAGC AGTTGTTCTA CGGCGGAAGG TATTCCTCAA CCTCGCTGGA CAGATGTACA GACTTTAAAG CTTTGGCGGA AGCTTACGGT GCAATCGGTA TAAATGTCAC AGCCAAAGAA GAAGTCGATG AAGCTTTAAA CAGAGCACTG GCGTCTGAGG ATACCCCTGT GGTAATCAAT TTTGAAATTG ACAAGGATGA AATGGTATTT CCTATTGTTC CGCCGGGAGC TCCTTTAAGC GAGCTTATTG AGGAGTAA
|
Protein sequence | MKLTGSKILI ECLKEQGVDT IFGFPGGAVL NIYDELYRSQ NEIRHILTSH EQGAAHAADG YARATGKVGV CLATSGPGAT NLVTGIATAY MDSVPMVAIT GQVATPLLGK DSFQEVDITG ITMPITKHNF IVKDVNKLAD IVRRAFYIAK EGRPGPVLID ICKDVTAAYA EYEPKSPQEL PEVPVRVDEK CIDEAAEAIN KAERPVILAG GGVSIAGANK ELFEFATNAQ IPVTTTLMGM GAFPGTHELF MGMIGMHGTK TTNMAVSESD LFIAIGARFS DRVISNVQRF APKASIMHID IDPAEIGKNI NVQYALEGNI KKILQLLNER VKKKECTDWV RKINEWKELY PLKYPQDDKL HPQYIIERMY ELTKGEAIIT TEVGQHQMWA AQFYKYTSPR QFLSSGGLGT MGYGLGACIG ARIGRPDKKV INVAGDGSFR MNCNELATAV EYKLPIIVAI FNNHALGMVR QWQQLFYGGR YSSTSLDRCT DFKALAEAYG AIGINVTAKE EVDEALNRAL ASEDTPVVIN FEIDKDEMVF PIVPPGAPLS ELIEE
|
| |