Gene Cthe_2518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2518 
Symbol 
ID4809274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2986810 
End bp2987805 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content44% 
IMG OID640107934 
Productketol-acid reductoisomerase 
Protein accessionYP_001038913 
Protein GI125975003 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0059] Ketol-acid reductoisomerase 
TIGRFAM ID[TIGR00465] ketol-acid reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000352718 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA TTTATTATGA CAGTGATTGC AATCTTGGCT TGCTTAAAGG AAAAACTGTG 
GCGATAATAG GCTATGGAAG CCAGGGACAC GCACATGCTC AAAACCTGAG GGATAGCGGA
ATAGATGTCG TTGTCGGTTT GCCTGAGGGT TCAAAATCAA GACAGAAGGC TGTTGAGGAT
GGCCTGAGAG TTGAAAATAC TGATGTTGCG GCAAAGATGG CCGATGTTGT AATGATGCTT
GTTCCTGACC AGCTTCAGCA GGATATCTAT GAAAACAGCA TCAAGCCGAA TTTGGAGGAA
GGAAATGTTC TTGCTTTTGC CCACGGCTTT GCAATTCACT TTAAAACCAT TGTTCCTCCA
CCGGAAGTCG ATGTTATCAT GATAGCACCA AAAGGTCCCG GACATACGGT AAGAAGCCAG
TATGTGGAGG GAAAAGGTGT TCCGTCTCTG ATTGCGGTTT ACCAGGATGC TTCCGGCAAA
GCAAAAGATT ATGCTCTTGC ATATGCGGCA GGAATAGGAG CAGGAAGAGC CGGAATCCTT
GAAACTACCT TCAAAGAAGA GACAGAAACC GATTTGTTCG GTGAACAGGC CGTATTGTGC
GGAGGAGTAA CCGAACTTAT GAAGGCCGGA TTTGAGACTC TGGTTGAAGC CGGATATCAG
CCTGAAATAG CTTATTTTGA ATGTATTCAC GAAATGAAGC TCATTGTAGA CCTTATAAAC
CAGGGCGGAT TCTCATACAT GAGATACTCC ATAAGTGATA CGGCAGAATA TGGTGACTAC
ATAACAGGAA AGAAGATAAT TACCGAAGAA ACAAGAAAAG CTATGAAGGC TGTATTGAAA
GACATCCAGG ACGGAGTTTT TGCTTCAAAA TGGATAACTG AGAACAAGGC GGGAGGAAGG
GCACAGTTCC TTGCCATGAG AAGAAATGAA GCAGAACACC AGCTGGAGAA AGTCGGCGCG
GAATTAAGAA AGATGATGAG CTGGCTTAAG AAATAA
 
Protein sequence
MAKIYYDSDC NLGLLKGKTV AIIGYGSQGH AHAQNLRDSG IDVVVGLPEG SKSRQKAVED 
GLRVENTDVA AKMADVVMML VPDQLQQDIY ENSIKPNLEE GNVLAFAHGF AIHFKTIVPP
PEVDVIMIAP KGPGHTVRSQ YVEGKGVPSL IAVYQDASGK AKDYALAYAA GIGAGRAGIL
ETTFKEETET DLFGEQAVLC GGVTELMKAG FETLVEAGYQ PEIAYFECIH EMKLIVDLIN
QGGFSYMRYS ISDTAEYGDY ITGKKIITEE TRKAMKAVLK DIQDGVFASK WITENKAGGR
AQFLAMRRNE AEHQLEKVGA ELRKMMSWLK K