Gene Moth_2256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2256 
Symbol 
ID3830751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2360836 
End bp2362518 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content60% 
IMG OID637830176 
Productacetolactate synthase, large subunit 
Protein accessionYP_431086 
Protein GI83591077 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG AGACTAGCGG CCGCACTGGC GCCCGGGCCG TGGTTGAGGC CCTCCTGGAC 
GAGGGTGTGG AACTGGTCTT CGGTTATCCC GGCGGGGCGG TATTGCCCCT CTATCATGAG
CTGGCCCGGA CGCCCATCCG CCATATCCTG GTTCGCCAGG AGCAGAACGC CGTTCATGCC
GCCAGCGGTT ACGCCCGCGC CAGCGGCAGA ACAGGTGTCT GCTTTGCCAC CTCGGGTCCG
GGGGCGACCA ACCTGGTCAC CGGTATTGCC ACCGCTTTTA TGGATTCTGT GCCGGTGGTT
ATCTTTACCG GCCAGGTCCC GACGCGGATG GTGGGTAGCG ATGCCTTCCA GGAAACCGAC
ATTACGGGCA TTACCATGCC TATTACCAAG CATAATTACC TGGTCAAGGA TGTAGAGGAA
TTACCCCGAA TCGTCAAGGA GGCCTTCTAT ATTGCCGGCA CCGGTCGCCC GGGACCGGTG
CTGGTAGATA TACCCAAGGA TGTGGCCCTG GCCCCCTGCC GGGCACCCTT ACCGGAAAGG
GTGGAGCTGC GGGGTTACAA ACCCACCTAT CATGGCCATC CGGGCCAGCT TCGCTCCCTG
GCGCGAATCT TAGGCGAGGC CGAGCGGCCG TTAATCTTCG CGGGGGGCGG GGTACAGGTT
TCCCGAGCCG AGGATTATTT ACGCCAACTG GTAGAAAAGC TCCAGATACC GGTAGTAACC
TCCCTCACGG GGCTGGGTTC CTTTCCCGAG GATAATCCTT TATCTTTAGG TATGGTCGGC
CTCCATGGCA AGCCCTGCGC CAACCATGCC CTCATGGAGT GCGACCTCCT GGTGGGCCTG
GGGGTACGCT TTGACGACCG GGTAACGGGA GCCCTGGATA AGTTCGCCCC CCGGGCCAGG
ATTGCCCACC TGGATATTGA CCCGGCGGAA ATCGGCAAAA ACGTCCGGGT GGATTTACCT
CTGGTAGGCG ATATCAGCTG TATCCTGAAG GAACTCCTGC CCCTGGTGGA ACCCGCCGGA
CACGGCCCCT GGCTGCAGCG CATTAAAGAA TTGCGTAATC TCTACCCCCT GACCTATGGC
CGCGGCGGCG AGGTGCGGCC CCAGTGGGTA ATCGAGCGCC TGGGGGAGAT GACCCGCGGC
CAGGCGATTA TTACTACCGA TGTCGGCCAG CATCAGATGT GGGCAGCCCT CTTTTACGGT
TTTACCGAAC CCCGCACCTT CATTTCTTCC TGTGGCCTGG GAACCATGGG TTACGGCCTG
CCGGCAGCCG TGGGCGCCGC CCTGGCCCGG CCCGATAAAC AGGTGTGGTT GATAACCGGC
GACGGCAGCT TCCAGATGAG CATGGCGGAA CTGGGTACAG CCAGGGAGCA GGGCGTACCT
TTAAAGATTT TACTTTTCAA TAACCAAAGC CTGGCCATGG TGCGCCAGCT GCAGCACTTT
TACTATGAAC GCCAGTATAC CGCCATCGAG TTTACCGGCA ACCCCGACTT TGTCCGCCTG
GCGGAGTGCT ACGGGGCCGA GGGGTTGCGT ATAAGTAAGC AGGAAGAAGT GGTGCCAGTC
CTGGCTCGGG CTATGGGCAA CGACCGCCTG ACATTGATTG AATGCCTGAT CAGTCCTGAA
GAGATGGTAT ACCCCATGGT CCCGGAAGGG GCGGCCCTGG ACGAGATGAT TCTTCCGGAA
TAA
 
Protein sequence
MAQETSGRTG ARAVVEALLD EGVELVFGYP GGAVLPLYHE LARTPIRHIL VRQEQNAVHA 
ASGYARASGR TGVCFATSGP GATNLVTGIA TAFMDSVPVV IFTGQVPTRM VGSDAFQETD
ITGITMPITK HNYLVKDVEE LPRIVKEAFY IAGTGRPGPV LVDIPKDVAL APCRAPLPER
VELRGYKPTY HGHPGQLRSL ARILGEAERP LIFAGGGVQV SRAEDYLRQL VEKLQIPVVT
SLTGLGSFPE DNPLSLGMVG LHGKPCANHA LMECDLLVGL GVRFDDRVTG ALDKFAPRAR
IAHLDIDPAE IGKNVRVDLP LVGDISCILK ELLPLVEPAG HGPWLQRIKE LRNLYPLTYG
RGGEVRPQWV IERLGEMTRG QAIITTDVGQ HQMWAALFYG FTEPRTFISS CGLGTMGYGL
PAAVGAALAR PDKQVWLITG DGSFQMSMAE LGTAREQGVP LKILLFNNQS LAMVRQLQHF
YYERQYTAIE FTGNPDFVRL AECYGAEGLR ISKQEEVVPV LARAMGNDRL TLIECLISPE
EMVYPMVPEG AALDEMILPE