Gene Cthe_2393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2393 
Symbol 
ID4811045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2858204 
End bp2859139 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content44% 
IMG OID640107806 
Productthiamine pyrophosphate enzyme-like TPP-binding 
Protein accessionYP_001038788 
Protein GI125974878 
COG category[C] Energy production and conversion 
COG ID[COG1013] Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00141499 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTATA ATTTGAAAGA AGTTGCAAAA AAACCGGAAA GACTTACGGG CGGACACAGA 
ATGTGTGCAG GCTGCGGAGC TCCGATAGTT GTAAGACAGG TTCTTAAGGC ATTAAAACCG
GAAGATCATG CGGTTATCTC AGCTGCAACA GGTTGTTTGG AAGTTTCAAC TTTCATTTAC
CCTTATACAG CATGGAAGGA TTCTTTCATT CACAGTGCGT TTGAAAATAC CGGTGCTACA
ATTTCCGGTG CGGAAGCGGC TTATAAAGTA TTGAAGAAAA AAGGAAAAAT TGAAGGGGAG
ACCAAGTTTA TTGCGTTCGG TGGTGACGGC GGAACATACG ACATAGGACT TCAGGCACTC
TCAGGAGCGA TGGAAAGAGG ACACGACATG GTTTATGTGT GCTACGACAA TGGAGCATAC
ATGAACACAG GTATCCAGAG GTCTTCTGCC ACTCCGAAAT ACGCTGATAC CACAACTTCA
CCTGTTGGAA AGAAGATACC CGGTAAAATG CAGCCAAGAA AAGACCTGAC AGAAGTATTG
GTAAATCATC GCATACCTTA TGTTGCTCAA ACCGCTCCTT TCGGGAACAT GAAGGACCTC
TATGAAAAAG CTGAAAAAGC TATTTATACA CCCGGTCCTG CGTTCCTGAA CGTGTTGGCA
CCGTGCCCGA GAGGATGGAG ATACAACACT CCTGATTTGA TGGAATTGAG CAAATTGGCG
GTTGAAACTT GCTTCTGGCC GCTTTATGAA GTAATTGACG GCAAATATAT AATAAACTAC
AAGCCGAAGG AAAAAGTTCC CGTCAAGGAA TTCTTGAAAC TTCAGGGAAG ATTTAAACAT
CTTTTCAAAG CCGGCAACGA ATATATGCTG GAAGAAATTC AGAAAGAAGT CGACTTAAGA
TGGGAGAGAC TCTTGAAGCT GGCCGGAGAG GCTTAA
 
Protein sequence
MAYNLKEVAK KPERLTGGHR MCAGCGAPIV VRQVLKALKP EDHAVISAAT GCLEVSTFIY 
PYTAWKDSFI HSAFENTGAT ISGAEAAYKV LKKKGKIEGE TKFIAFGGDG GTYDIGLQAL
SGAMERGHDM VYVCYDNGAY MNTGIQRSSA TPKYADTTTS PVGKKIPGKM QPRKDLTEVL
VNHRIPYVAQ TAPFGNMKDL YEKAEKAIYT PGPAFLNVLA PCPRGWRYNT PDLMELSKLA
VETCFWPLYE VIDGKYIINY KPKEKVPVKE FLKLQGRFKH LFKAGNEYML EEIQKEVDLR
WERLLKLAGE A