Gene Cthe_0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0999 
Symbol 
ID4811293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1196978 
End bp1198123 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content43% 
IMG OID640106417 
Product1-deoxy-D-xylulose 5-phosphate reductoisomerase 
Protein accessionYP_001037424 
Protein GI125973514 
COG category[I] Lipid transport and metabolism 
COG ID[COG0743] 1-deoxy-D-xylulose 5-phosphate reductoisomerase 
TIGRFAM ID[TIGR00243] 1-deoxy-D-xylulose 5-phosphate reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000542837 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAACA GGATTTCGAT TCTTGGCTCT ACAGGTTCCA TAGGTGTTCA GACCCTGGAT 
GTGGCCAGAA ATTTAAATAT AAAGGTTGAT GGACTGGCGG CAAATAAAAA CATAGATTTG
CTTGAAAAAC AGGCCAGAGA GTTTCAACCG AAGATAGTTG CGGTAAAGGA CGAAGAGAGA
GCGAGAATTT TAAGAGACAG GCTTTCTGAT ACCGACTGCA AAGTGGTGGG CGGTGTTGAA
GGCCTTAAAA TGGTGGCTTC TATTGAAACT GTTGAAACCG TTGTTACTTC TATTGTCGGA
ATTGCCGGCC TTATTCCCAC CATGGAGGCC ATAAAGCATA AAAAAAATAT AGCACTGGCA
AACAAGGAAA CCCTTGTAAC AGCGGGGCAT ATTGTCATGT CCGAGGCTGC CAGAATGGGT
GTTAAGATTC TTCCGGTGGA CAGTGAACAT TCTGCTGTTT TTCAGAGTTT AATGGGTAAT
AATAAAAAAG ATGTGGCAAA AATAATTTTG ACCGCGTCGG GAGGCCCCTT TAGAGGAAGA
AAAAAGGAAG AACTTCGAAA TGTCACGCTC AGGGAAGCAT TAAATCATCC TAACTGGAGC
ATGGGCAGCA AAATAACAAT TGATTCTGCA ACCATGATGA ATAAAGGTTT GGAGGTTATT
GAGGCTCACT GGCTTTTTGA AATACCGCAG GATGATATTG AGGTTTTGGT GCATCCGCAG
AGTATCATTC ATTCAATGGT TGAATACAAA GACGGTTCGA TAATTGCCCA GCTGGGCTCT
CCGGATATGA GGCTTCCGAT ACAGTTTGCC CTGACATATC CGGACCGAAA GCAAAACAAC
TTTTCAAAGC TTGACATTGT CAAGATTGGT AGTCTAACCT TTGAAGCTCC CGACCTTGAG
GCGTTTCCGT GCCTTGGGCT TGCTTTTGAG GCGTTACGGG CCGGTGGTAC CATGCCTGCG
GTGCTGAATG CGGCGAATGA AAAAGCCGTT GGATTGTTTT TGCAGGAGAA AATAAGGTTT
TTGGATATCC CCGAAATTAT AGAAAAAGTA ATGGGAAGAC ATTCAGTAAA ACCGGATCCG
GACATTGACG ATATAATTGA TGTCGATTTG TGGGCAAGGA AAATAGTTGA AGAAATTGTT
AAATAG
 
Protein sequence
MVNRISILGS TGSIGVQTLD VARNLNIKVD GLAANKNIDL LEKQAREFQP KIVAVKDEER 
ARILRDRLSD TDCKVVGGVE GLKMVASIET VETVVTSIVG IAGLIPTMEA IKHKKNIALA
NKETLVTAGH IVMSEAARMG VKILPVDSEH SAVFQSLMGN NKKDVAKIIL TASGGPFRGR
KKEELRNVTL REALNHPNWS MGSKITIDSA TMMNKGLEVI EAHWLFEIPQ DDIEVLVHPQ
SIIHSMVEYK DGSIIAQLGS PDMRLPIQFA LTYPDRKQNN FSKLDIVKIG SLTFEAPDLE
AFPCLGLAFE ALRAGGTMPA VLNAANEKAV GLFLQEKIRF LDIPEIIEKV MGRHSVKPDP
DIDDIIDVDL WARKIVEEIV K