Gene Cthe_2516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2516 
Symbol 
ID4809272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2984630 
End bp2986258 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content46% 
IMG OID640107932 
Productacetolactate synthase, large subunit 
Protein accessionYP_001038911 
Protein GI125975001 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000744784 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTT CAGGTGCGCA GGCAATTGTG AAAGCTTTGG AATTGGAAGG CGTGGAAGTT 
GTTTTCGGTT ATCCGGGAGC TGCAATTTGT CCTTTCTATG ATGCGCTGAT GGAATCTAAA
ATAAGGCATA TACTCACCAG GCATGAACAG GGAGCTGCCC ATGCCGCCAG CGGATATGCC
AGAACAACGG GAAGGGTTGG TGTGTGTGTT GCCACATCAG GACCGGGAGC AACCAACCTT
ATTACAGGCA TTGCTGACGC ATACATGGAC TCAATACCGT TGGTGGCCAT CACCGGGCAG
GTTAATTCGG AGCTGATTGG AAGAGACGTG TTCCAGGAGG CTGACATTAC CGGCGCGACG
GATCCTTTCT GCAAGCATAA TTATTTGGTA AAAAACGCAA AGGATTTGCC CCGGGTTTTA
AAGGAAGCCT TTTACATAGC ATCCACCGGA AGGCCGGGCC CTGTTCTTAT CGACGTACCC
ATAGACGTGC AGACAAAAGA GATTAATTTT GAGTACCCGG AAAGTGTTGA TATAAAAGGC
TACAAGCCAA ATTTAAAAGG CCATTCTCTT CAGATAAAAA AGATTGCTCA AGCTATTGAA
AAGGCAAAAA AACCGGTTAT TTGCGCCGGG GGAGGAGTGA TTAATTCCAA TGCATCCGAG
GAGCTTTTAA CTCTTTCCCG AAAGTGCGGC ATACCTGTTG TTACAACTTT GATGGGTATC
GGGTCCGTGC CGTATGATTA TGAGCTCAAT TTAGGAATGC TTGGGACTCA CGGAGTATAT
ATTGCGAATT ATGCCGTAAA CAATGCTGAC CTTTTGATTA TCATCGGCGC GAGGGTTGCG
GACAGGGCTA TAAGCAATCC CCAGCAGGTT GCAAAGAGAA AGCAGATAGT TCATATTGAC
ATAGATCCTG CGGAGATAGG CAAAAATATC GATGTTTCAA TACCGGTGGT AGGAGATGTG
AAGCAGGTAT TAAAAGAGCT TATAGATATT TCCCAAAAGG GAGATACGGA AGAATGGATA
AAGACAACTC AAAAAGAAAG AGAAAAACAT GCCGAAAAAC CTGAACCAAG GCCCGGTATA
GGTTTTGTGA ATCCCAAATA TTTGTTGTCT GTTTTGACCG GGCTTTTGGG TGATGATGAT
ATAATTACAA CAGAAGTGGG ACAGAACCAG ATATGGGCTG CAAACTATTT TGGTGTCAAA
AAGCCCCGGA CCTTTATAAC GTCCGGAGGT CTGGGTACCA TGGGATACGG GCTTCCCGCC
GCGGTGGGGG CAAAAATTGG CTGTCCCGAC CGCAAAGTGG TATGTGTGGG AGGAGACGGG
AGCTTCCAGA TGAACATGCA GGAGCTTGGC ACCATCAAGC AAAACAGGCT GGGAGTGAAA
GTAATCTTAT TCAACAACTC AAGGCTGGGA ATGGTAAGGG AGCTGCAAAA GACAAAGTAC
TGCGGCCGTT ATTTCCAGGT ATTTTTGGAC GACAATCCTG ACTTTATAAA GTTGTTTGAC
GCTTATGGTT TCAAGGGCAG GAGAATAGAC GACGATTCCC AGGTGGAAGA TGCGTTGAAA
GAGATGCTGT CGGACGACAA ACCTTACCTT CTCGAGTGCA AAATTGACCC GGAAGAATCA
ACACTATGA
 
Protein sequence
MKLSGAQAIV KALELEGVEV VFGYPGAAIC PFYDALMESK IRHILTRHEQ GAAHAASGYA 
RTTGRVGVCV ATSGPGATNL ITGIADAYMD SIPLVAITGQ VNSELIGRDV FQEADITGAT
DPFCKHNYLV KNAKDLPRVL KEAFYIASTG RPGPVLIDVP IDVQTKEINF EYPESVDIKG
YKPNLKGHSL QIKKIAQAIE KAKKPVICAG GGVINSNASE ELLTLSRKCG IPVVTTLMGI
GSVPYDYELN LGMLGTHGVY IANYAVNNAD LLIIIGARVA DRAISNPQQV AKRKQIVHID
IDPAEIGKNI DVSIPVVGDV KQVLKELIDI SQKGDTEEWI KTTQKEREKH AEKPEPRPGI
GFVNPKYLLS VLTGLLGDDD IITTEVGQNQ IWAANYFGVK KPRTFITSGG LGTMGYGLPA
AVGAKIGCPD RKVVCVGGDG SFQMNMQELG TIKQNRLGVK VILFNNSRLG MVRELQKTKY
CGRYFQVFLD DNPDFIKLFD AYGFKGRRID DDSQVEDALK EMLSDDKPYL LECKIDPEES
TL