Gene Cthe_2714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2714 
Symbol 
ID4810708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3202721 
End bp3204388 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content45% 
IMG OID640108133 
Productacetolactate synthase, large subunit 
Protein accessionYP_001039106 
Protein GI125975196 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTA CCGGTTCGAA AATATTAATT GAGTGCCTTA AGGAACAAGG TGTGGATACA 
ATATTTGGAT TTCCGGGCGG AGCGGTATTG AATATATATG ATGAACTGTA CAGGTCGCAG
AATGAAATCC GGCATATACT GACTTCCCAC GAACAGGGAG CTGCTCATGC CGCAGACGGC
TATGCAAGGG CAACGGGAAA GGTTGGGGTG TGTCTTGCCA CATCCGGCCC CGGGGCCACA
AATCTGGTTA CAGGAATAGC AACCGCTTAC ATGGATTCGG TTCCGATGGT TGCAATTACA
GGACAGGTGG CGACGCCGCT CTTGGGCAAG GATTCCTTCC AGGAAGTTGA CATAACCGGA
ATAACAATGC CCATAACAAA ACACAATTTT ATAGTAAAGG ATGTTAACAA GCTGGCTGAC
ATAGTGCGAA GAGCCTTTTA CATAGCAAAA GAAGGAAGAC CAGGCCCGGT TCTGATTGAT
ATATGCAAAG ATGTAACGGC GGCATATGCT GAATATGAAC CAAAGTCCCC TCAGGAATTG
CCCGAGGTAC CTGTAAGAGT GGATGAAAAG TGTATTGATG AAGCGGCGGA GGCAATTAAC
AAAGCGGAAA GACCGGTTAT TCTCGCCGGA GGAGGCGTAT CAATCGCGGG AGCAAATAAA
GAACTTTTTG AGTTTGCAAC AAATGCCCAG ATACCAGTTA CCACAACTTT AATGGGCATG
GGTGCTTTCC CGGGAACCCA TGAGCTGTTC ATGGGAATGA TTGGAATGCA CGGCACAAAG
ACGACAAACA TGGCGGTTTC GGAATCGGAT CTTTTTATTG CGATTGGTGC AAGATTTAGT
GACAGAGTGA TAAGCAATGT TCAGAGATTT GCACCTAAAG CAAGCATAAT GCATATAGAC
ATTGACCCTG CCGAAATCGG AAAAAATATT AATGTTCAAT ATGCTCTTGA GGGAAACATC
AAGAAAATAT TGCAGCTTCT GAACGAGAGA GTAAAGAAGA AAGAATGCAC TGACTGGGTT
AGAAAAATCA ATGAGTGGAA GGAACTGTAT CCTCTTAAGT ATCCTCAGGA TGACAAGCTT
CATCCGCAAT ATATTATTGA GAGAATGTAT GAACTTACCA AAGGAGAGGC AATAATAACT
ACCGAGGTGG GTCAGCACCA GATGTGGGCC GCCCAGTTTT ACAAATACAC TTCTCCAAGA
CAGTTCCTGT CCTCAGGTGG TCTGGGTACC ATGGGATATG GTCTTGGAGC ATGCATTGGC
GCCCGGATTG GAAGACCCGA CAAAAAGGTA ATTAATGTTG CCGGTGACGG CAGCTTCAGA
ATGAACTGCA ATGAGCTGGC CACAGCTGTT GAGTACAAGC TTCCGATAAT AGTTGCGATA
TTCAACAATC ATGCTCTGGG AATGGTAAGA CAGTGGCAGC AGTTGTTCTA CGGCGGAAGG
TATTCCTCAA CCTCGCTGGA CAGATGTACA GACTTTAAAG CTTTGGCGGA AGCTTACGGT
GCAATCGGTA TAAATGTCAC AGCCAAAGAA GAAGTCGATG AAGCTTTAAA CAGAGCACTG
GCGTCTGAGG ATACCCCTGT GGTAATCAAT TTTGAAATTG ACAAGGATGA AATGGTATTT
CCTATTGTTC CGCCGGGAGC TCCTTTAAGC GAGCTTATTG AGGAGTAA
 
Protein sequence
MKLTGSKILI ECLKEQGVDT IFGFPGGAVL NIYDELYRSQ NEIRHILTSH EQGAAHAADG 
YARATGKVGV CLATSGPGAT NLVTGIATAY MDSVPMVAIT GQVATPLLGK DSFQEVDITG
ITMPITKHNF IVKDVNKLAD IVRRAFYIAK EGRPGPVLID ICKDVTAAYA EYEPKSPQEL
PEVPVRVDEK CIDEAAEAIN KAERPVILAG GGVSIAGANK ELFEFATNAQ IPVTTTLMGM
GAFPGTHELF MGMIGMHGTK TTNMAVSESD LFIAIGARFS DRVISNVQRF APKASIMHID
IDPAEIGKNI NVQYALEGNI KKILQLLNER VKKKECTDWV RKINEWKELY PLKYPQDDKL
HPQYIIERMY ELTKGEAIIT TEVGQHQMWA AQFYKYTSPR QFLSSGGLGT MGYGLGACIG
ARIGRPDKKV INVAGDGSFR MNCNELATAV EYKLPIIVAI FNNHALGMVR QWQQLFYGGR
YSSTSLDRCT DFKALAEAYG AIGINVTAKE EVDEALNRAL ASEDTPVVIN FEIDKDEMVF
PIVPPGAPLS ELIEE