Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2516 |
Symbol | |
ID | 4809272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2984630 |
End bp | 2986258 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107932 |
Product | acetolactate synthase, large subunit |
Protein accession | YP_001038911 |
Protein GI | 125975001 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000744784 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTT CAGGTGCGCA GGCAATTGTG AAAGCTTTGG AATTGGAAGG CGTGGAAGTT GTTTTCGGTT ATCCGGGAGC TGCAATTTGT CCTTTCTATG ATGCGCTGAT GGAATCTAAA ATAAGGCATA TACTCACCAG GCATGAACAG GGAGCTGCCC ATGCCGCCAG CGGATATGCC AGAACAACGG GAAGGGTTGG TGTGTGTGTT GCCACATCAG GACCGGGAGC AACCAACCTT ATTACAGGCA TTGCTGACGC ATACATGGAC TCAATACCGT TGGTGGCCAT CACCGGGCAG GTTAATTCGG AGCTGATTGG AAGAGACGTG TTCCAGGAGG CTGACATTAC CGGCGCGACG GATCCTTTCT GCAAGCATAA TTATTTGGTA AAAAACGCAA AGGATTTGCC CCGGGTTTTA AAGGAAGCCT TTTACATAGC ATCCACCGGA AGGCCGGGCC CTGTTCTTAT CGACGTACCC ATAGACGTGC AGACAAAAGA GATTAATTTT GAGTACCCGG AAAGTGTTGA TATAAAAGGC TACAAGCCAA ATTTAAAAGG CCATTCTCTT CAGATAAAAA AGATTGCTCA AGCTATTGAA AAGGCAAAAA AACCGGTTAT TTGCGCCGGG GGAGGAGTGA TTAATTCCAA TGCATCCGAG GAGCTTTTAA CTCTTTCCCG AAAGTGCGGC ATACCTGTTG TTACAACTTT GATGGGTATC GGGTCCGTGC CGTATGATTA TGAGCTCAAT TTAGGAATGC TTGGGACTCA CGGAGTATAT ATTGCGAATT ATGCCGTAAA CAATGCTGAC CTTTTGATTA TCATCGGCGC GAGGGTTGCG GACAGGGCTA TAAGCAATCC CCAGCAGGTT GCAAAGAGAA AGCAGATAGT TCATATTGAC ATAGATCCTG CGGAGATAGG CAAAAATATC GATGTTTCAA TACCGGTGGT AGGAGATGTG AAGCAGGTAT TAAAAGAGCT TATAGATATT TCCCAAAAGG GAGATACGGA AGAATGGATA AAGACAACTC AAAAAGAAAG AGAAAAACAT GCCGAAAAAC CTGAACCAAG GCCCGGTATA GGTTTTGTGA ATCCCAAATA TTTGTTGTCT GTTTTGACCG GGCTTTTGGG TGATGATGAT ATAATTACAA CAGAAGTGGG ACAGAACCAG ATATGGGCTG CAAACTATTT TGGTGTCAAA AAGCCCCGGA CCTTTATAAC GTCCGGAGGT CTGGGTACCA TGGGATACGG GCTTCCCGCC GCGGTGGGGG CAAAAATTGG CTGTCCCGAC CGCAAAGTGG TATGTGTGGG AGGAGACGGG AGCTTCCAGA TGAACATGCA GGAGCTTGGC ACCATCAAGC AAAACAGGCT GGGAGTGAAA GTAATCTTAT TCAACAACTC AAGGCTGGGA ATGGTAAGGG AGCTGCAAAA GACAAAGTAC TGCGGCCGTT ATTTCCAGGT ATTTTTGGAC GACAATCCTG ACTTTATAAA GTTGTTTGAC GCTTATGGTT TCAAGGGCAG GAGAATAGAC GACGATTCCC AGGTGGAAGA TGCGTTGAAA GAGATGCTGT CGGACGACAA ACCTTACCTT CTCGAGTGCA AAATTGACCC GGAAGAATCA ACACTATGA
|
Protein sequence | MKLSGAQAIV KALELEGVEV VFGYPGAAIC PFYDALMESK IRHILTRHEQ GAAHAASGYA RTTGRVGVCV ATSGPGATNL ITGIADAYMD SIPLVAITGQ VNSELIGRDV FQEADITGAT DPFCKHNYLV KNAKDLPRVL KEAFYIASTG RPGPVLIDVP IDVQTKEINF EYPESVDIKG YKPNLKGHSL QIKKIAQAIE KAKKPVICAG GGVINSNASE ELLTLSRKCG IPVVTTLMGI GSVPYDYELN LGMLGTHGVY IANYAVNNAD LLIIIGARVA DRAISNPQQV AKRKQIVHID IDPAEIGKNI DVSIPVVGDV KQVLKELIDI SQKGDTEEWI KTTQKEREKH AEKPEPRPGI GFVNPKYLLS VLTGLLGDDD IITTEVGQNQ IWAANYFGVK KPRTFITSGG LGTMGYGLPA AVGAKIGCPD RKVVCVGGDG SFQMNMQELG TIKQNRLGVK VILFNNSRLG MVRELQKTKY CGRYFQVFLD DNPDFIKLFD AYGFKGRRID DDSQVEDALK EMLSDDKPYL LECKIDPEES TL
|
| |