Gene Cthe_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0601 
Symbol 
ID4808203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp736904 
End bp737974 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content41% 
IMG OID640106015 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001037029 
Protein GI125973119 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGG ACAATCTTTA CAGAGTGCTG GATGCCAATG TCAACAGGAC ATCGGAAGGG 
CTTAGGGTTT TGGAGGATTT GGCAAGATTC TGTTATAATG ACAGACTGCT TTCAAAGAGG
ATTAAGGAAC TAAGACACAG TGTAAGAAAG AATATTGCAG GATTGGTTCC AAACCTTATA
AGCAGCAGGG ATTCGGTGAA TGATGTGGGT TTAAAAACGT CAATGGAGAT GGATATCGAC
CGAAAAGCCT CACTTTTGGA TCTTGCCCGG GCAAATTTCA AGCGGGTTCA GGAAGCTTTG
AGGACTGTGG AGGAAAGCCT TAAAGTTTTG AATGAAAATG ACCTTTCCAA GTTTTATGAA
AGTTGCAGGT TTGAAACGTA CAGCATTGAG AAGGAATATT TTAAAGTTTT GACTTTCGAA
AATAAGAAAG GCAGATTGAA TGAAATAATA ACCGGTCTTT ATTGTATTAC ATCGGAAGAA
CACTCCAAAG GGCGCAGTAA TATTGAGGTT GTGGAGAAAA TGATAAAGGC CGGGGTAAAG
ATAATTCAAT ACAGGGAAAA AAAGAAGAGT CTTTTGGAAA AATACAACGA ATGCAAAAAA
ATAAGGGAAA TGACCTTAGA TTCGGGCGTT ACATTTATCG TAAACGACAA CATTGATATT
GCAATGATGG TAAAGGCCGA CGGAGTACAT ATAGGTCAGG ATGATCTTCC CATAGAAAAA
GTAAGAGAGC TTGTGGGGGA TGAGATGATT ATCGGGATAT CCACCCATTC TCCAACGCAG
GCGGAAGACG CGGTAAGACG CGGAGCTGAT TATATAGGAG TGGGTCCTCT TTACAGAACA
TATACAAAGG AGGATGTCTG CGAACCTGTA GGGCTTGAGT ACCTTGACTA TGTTGTGAAG
AACATAAATA TTCCCTATGT TGCCATAGGT GGCATAAAGG AACACAACAT GGATGAGGTT
TTGGCCCGGG GAGCCCGGTG TATAGCCATG GTTACAGAGA TTGTGGGCGC GGATGACATA
GAAGAAAAAA TTTCCAAAGT AAAATCAAAA TTTTCGAGAG GGGTTTTATA A
 
Protein sequence
MGLDNLYRVL DANVNRTSEG LRVLEDLARF CYNDRLLSKR IKELRHSVRK NIAGLVPNLI 
SSRDSVNDVG LKTSMEMDID RKASLLDLAR ANFKRVQEAL RTVEESLKVL NENDLSKFYE
SCRFETYSIE KEYFKVLTFE NKKGRLNEII TGLYCITSEE HSKGRSNIEV VEKMIKAGVK
IIQYREKKKS LLEKYNECKK IREMTLDSGV TFIVNDNIDI AMMVKADGVH IGQDDLPIEK
VRELVGDEMI IGISTHSPTQ AEDAVRRGAD YIGVGPLYRT YTKEDVCEPV GLEYLDYVVK
NINIPYVAIG GIKEHNMDEV LARGARCIAM VTEIVGADDI EEKISKVKSK FSRGVL