Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2256 |
Symbol | |
ID | 3830751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2360836 |
End bp | 2362518 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637830176 |
Product | acetolactate synthase, large subunit |
Protein accession | YP_431086 |
Protein GI | 83591077 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGG AGACTAGCGG CCGCACTGGC GCCCGGGCCG TGGTTGAGGC CCTCCTGGAC GAGGGTGTGG AACTGGTCTT CGGTTATCCC GGCGGGGCGG TATTGCCCCT CTATCATGAG CTGGCCCGGA CGCCCATCCG CCATATCCTG GTTCGCCAGG AGCAGAACGC CGTTCATGCC GCCAGCGGTT ACGCCCGCGC CAGCGGCAGA ACAGGTGTCT GCTTTGCCAC CTCGGGTCCG GGGGCGACCA ACCTGGTCAC CGGTATTGCC ACCGCTTTTA TGGATTCTGT GCCGGTGGTT ATCTTTACCG GCCAGGTCCC GACGCGGATG GTGGGTAGCG ATGCCTTCCA GGAAACCGAC ATTACGGGCA TTACCATGCC TATTACCAAG CATAATTACC TGGTCAAGGA TGTAGAGGAA TTACCCCGAA TCGTCAAGGA GGCCTTCTAT ATTGCCGGCA CCGGTCGCCC GGGACCGGTG CTGGTAGATA TACCCAAGGA TGTGGCCCTG GCCCCCTGCC GGGCACCCTT ACCGGAAAGG GTGGAGCTGC GGGGTTACAA ACCCACCTAT CATGGCCATC CGGGCCAGCT TCGCTCCCTG GCGCGAATCT TAGGCGAGGC CGAGCGGCCG TTAATCTTCG CGGGGGGCGG GGTACAGGTT TCCCGAGCCG AGGATTATTT ACGCCAACTG GTAGAAAAGC TCCAGATACC GGTAGTAACC TCCCTCACGG GGCTGGGTTC CTTTCCCGAG GATAATCCTT TATCTTTAGG TATGGTCGGC CTCCATGGCA AGCCCTGCGC CAACCATGCC CTCATGGAGT GCGACCTCCT GGTGGGCCTG GGGGTACGCT TTGACGACCG GGTAACGGGA GCCCTGGATA AGTTCGCCCC CCGGGCCAGG ATTGCCCACC TGGATATTGA CCCGGCGGAA ATCGGCAAAA ACGTCCGGGT GGATTTACCT CTGGTAGGCG ATATCAGCTG TATCCTGAAG GAACTCCTGC CCCTGGTGGA ACCCGCCGGA CACGGCCCCT GGCTGCAGCG CATTAAAGAA TTGCGTAATC TCTACCCCCT GACCTATGGC CGCGGCGGCG AGGTGCGGCC CCAGTGGGTA ATCGAGCGCC TGGGGGAGAT GACCCGCGGC CAGGCGATTA TTACTACCGA TGTCGGCCAG CATCAGATGT GGGCAGCCCT CTTTTACGGT TTTACCGAAC CCCGCACCTT CATTTCTTCC TGTGGCCTGG GAACCATGGG TTACGGCCTG CCGGCAGCCG TGGGCGCCGC CCTGGCCCGG CCCGATAAAC AGGTGTGGTT GATAACCGGC GACGGCAGCT TCCAGATGAG CATGGCGGAA CTGGGTACAG CCAGGGAGCA GGGCGTACCT TTAAAGATTT TACTTTTCAA TAACCAAAGC CTGGCCATGG TGCGCCAGCT GCAGCACTTT TACTATGAAC GCCAGTATAC CGCCATCGAG TTTACCGGCA ACCCCGACTT TGTCCGCCTG GCGGAGTGCT ACGGGGCCGA GGGGTTGCGT ATAAGTAAGC AGGAAGAAGT GGTGCCAGTC CTGGCTCGGG CTATGGGCAA CGACCGCCTG ACATTGATTG AATGCCTGAT CAGTCCTGAA GAGATGGTAT ACCCCATGGT CCCGGAAGGG GCGGCCCTGG ACGAGATGAT TCTTCCGGAA TAA
|
Protein sequence | MAQETSGRTG ARAVVEALLD EGVELVFGYP GGAVLPLYHE LARTPIRHIL VRQEQNAVHA ASGYARASGR TGVCFATSGP GATNLVTGIA TAFMDSVPVV IFTGQVPTRM VGSDAFQETD ITGITMPITK HNYLVKDVEE LPRIVKEAFY IAGTGRPGPV LVDIPKDVAL APCRAPLPER VELRGYKPTY HGHPGQLRSL ARILGEAERP LIFAGGGVQV SRAEDYLRQL VEKLQIPVVT SLTGLGSFPE DNPLSLGMVG LHGKPCANHA LMECDLLVGL GVRFDDRVTG ALDKFAPRAR IAHLDIDPAE IGKNVRVDLP LVGDISCILK ELLPLVEPAG HGPWLQRIKE LRNLYPLTYG RGGEVRPQWV IERLGEMTRG QAIITTDVGQ HQMWAALFYG FTEPRTFISS CGLGTMGYGL PAAVGAALAR PDKQVWLITG DGSFQMSMAE LGTAREQGVP LKILLFNNQS LAMVRQLQHF YYERQYTAIE FTGNPDFVRL AECYGAEGLR ISKQEEVVPV LARAMGNDRL TLIECLISPE EMVYPMVPEG AALDEMILPE
|
| |