Gene Cthe_0828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0828 
Symbol 
ID4810446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1008397 
End bp1010280 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content41% 
IMG OID640106245 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_001037256 
Protein GI125973346 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGACAGTA TTAATTTCCC TGATGATATA AAAAAACTGA ATTTGGAACA GTTAAAGCAA 
CTTGCAGGCG AGATTAGGTC TTTTCTTATT GAAAAGGTCT CAAAAACAGG CGGGCATCTT
GCTTCCAACT TGGGAGTCGT AGAGTTGACC CTTGCGCTTC ACAGGGTGTT TAACACTCCG
GAGGATAAAA TTATATGGGA TGTAGGTCAT CAATGCTATG TTCATAAGAT TATCACCGGA
AGAAAAGACA GGTTTGACAC CATAAGAAAG CTTGGCGGAC TTTCCGGTTT TCCAAAATCG
GCGGAAAGTG AGTATGATGC TTTCAATACC GGGCACAGCA GTACTTCCAT ATCTGCAGCC
CTGGGTATTG CAAAAGCAAG GGATTTAAGA AAAGAAAAAT ATTCGGTTGT TGCCGTTATC
GGAGACGGTG CCCTGACCGG AGGAATGGCT TTTGAAGCAT TGAATGATGC GGGAAGGTCA
CCGAATAATC TTATTGTTGT ATTAAATGAT AATGAAATGT CAATTTCGAA AAATGTAGGA
GGGCTTTCAG TTTATTTGAG CAAAATTCGA ACAGAACCCT TCTATTTTAA AGTTAAAGAA
GATATAGACA TTATTTTAAA CAAAATACCG GCAATCGGAA AAAGCGCGGT CAAAGCACTT
GGCAGGGTCA AAGGCACCAT AAAATACATG ATTATGCCGG GAATAGTGTT TGAAGAACTC
GGTTTTAAAT ATTTAGGACC TATTGACGGA CATAATATTG CCGAACTGGA AAACGTTCTT
ACAAGAGCCA AAAACACCAA AGGACCTGTA CTGGTGCATG TATGTACCCA AAAAGGAAGA
GGTTACACTT ACGCGGAAAA AAATCCGGCT GTTTTTCACG GCATCTCGCC CTTTGAGGTT
GAGACGGGGG AGGTTATTGC TAATAAAGTT CCGGGATATT CCGATGTATT TGGAAGTGAA
ATTGTCAGGA TTGCTGAAAA AGAAGAAAGG GTTGTTGCTC TTACGGCTGC AATGCCTCAT
GGAACAGGTC TTATCAAATT TTCAAAGAGA TTTCCGGAAA GGTTTTTTGA CGTTGGCATA
GCCGAGCAAC ATGCGGTAAC TTTTGGTGCC GGGCTTGCAA AAAACGGGAT GATTCCGGTC
ATAGCTCTTT ATTCGTCTTT TCTCCAGAGA GCCTATGACC AGGTAGTGCA TGATGTGGCT
CTTCAAAATC TGCATGTGGT TTTTGCGATA GACAGGGCCG GAATAGTCGG GGAAGACGGG
GAGACACATC AGGGAATTTA TGACATATCT TTTTTAAGAC ATATACCAAA TATGACCATT
CTTGCTCCCT GTGATTATAA TGAGCTTGCC AAAATGCTTG AGTATGCCGT ACTGGAGCAT
AGCGGTCCGA TAGCGATAAG GTACCCGAGA GGAGCAGGAC CTGAAAAGCT TTTTGACACC
CCTGACATAA AGTTGGGACA ATCTCTGCTT ATAAGTGAAG GAAATGATGT TACCATTGCG
GCTGTCGGCA ACAAGGTGGA AGTGGCCATG AAGGTTGCCG AAAAGCTTAA GGAGACAGGT
TTGTCTGCGG ATGTGATTTA TTGCAGATTT ATAAAGCCCC TTGATTCAAA TACCATTATA
AATTCCGTAC TTAAAACAAA AAGACTTGTA ACAATAGAGG ATAATACCGT TGAGGGTGGA
TTTGGAAGCA GAGTTTTGGA AACAATAAAC CAGAAGGGGA TAAATGTCAC TACAAGAATG
TTTGGATATC CGGATGCTTT TATTCCTCAT GGCTCTATCA AAGAACTGGT GCATATGTAC
AGACTGGATC CGGATTCCAT TTTCAATGAT GTTTTAAAAC TGATAAATAA AAGCAAAGTG
AAAGAATTCC GAGCCATAAG ATAA
 
Protein sequence
MDSINFPDDI KKLNLEQLKQ LAGEIRSFLI EKVSKTGGHL ASNLGVVELT LALHRVFNTP 
EDKIIWDVGH QCYVHKIITG RKDRFDTIRK LGGLSGFPKS AESEYDAFNT GHSSTSISAA
LGIAKARDLR KEKYSVVAVI GDGALTGGMA FEALNDAGRS PNNLIVVLND NEMSISKNVG
GLSVYLSKIR TEPFYFKVKE DIDIILNKIP AIGKSAVKAL GRVKGTIKYM IMPGIVFEEL
GFKYLGPIDG HNIAELENVL TRAKNTKGPV LVHVCTQKGR GYTYAEKNPA VFHGISPFEV
ETGEVIANKV PGYSDVFGSE IVRIAEKEER VVALTAAMPH GTGLIKFSKR FPERFFDVGI
AEQHAVTFGA GLAKNGMIPV IALYSSFLQR AYDQVVHDVA LQNLHVVFAI DRAGIVGEDG
ETHQGIYDIS FLRHIPNMTI LAPCDYNELA KMLEYAVLEH SGPIAIRYPR GAGPEKLFDT
PDIKLGQSLL ISEGNDVTIA AVGNKVEVAM KVAEKLKETG LSADVIYCRF IKPLDSNTII
NSVLKTKRLV TIEDNTVEGG FGSRVLETIN QKGINVTTRM FGYPDAFIPH GSIKELVHMY
RLDPDSIFND VLKLINKSKV KEFRAIR