Gene Cthe_2561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2561 
Symbol 
ID4809168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3031827 
End bp3032966 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content36% 
IMG OID640107976 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_001038955 
Protein GI125975045 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT TTTATAAAAA CAAAAGAGTG CTTATAACAG GTCATACCGG TTTTAAAGGT 
TCATGGTTAT CCGAGATATT GCTGCAATTT GGTGCCGAGG TTTGCGGATA TGCACTGGAA
TCAAAAGAAA GCTCGGATTT ATATTTAAAT CTGAAGCTGC ACAAAAACAT GAACTCATAT
ATTGGCGACA TTAGAAACTA TGACAAATTA AAAAAAGTAT TCGATACATT CAAACCTGAG
ATTGTTTTTC ATTTGGCTGC GCAGCCTATT GTGAGGGAAT CGTATAAAAA TCCTTTATAT
ACCTATGAAA CCAATGTCAT GGGAACAGTC AATCTGCTCG AAGCAGTAAG ACACTGCAGT
TCAGTCAGGT CCGTGGTTAA TGTAACAACC GATAAAGTAT ATAAAAATAT AAATGTTAAC
AAAGGATACA CTGAAACAGA CTATTTGTGT GGACAAGAAC CCTATTCAAA TTCCAAGTCA
TGTTCGGAAT TGGTAACCTA CAGCTATAAA AAATCTTTCT TTGATACTGA TGATTCTCCG
GCTGTTTCCA CTGCCAGAGC AGGAAATGTC ATTGGCGCGG GAGATTTTTC AAAAAACAGA
ATTATTCCGG ATTGTGTAAG AGCAGCATTC AGCAGGAACA AAATAGAGAT CAGAAATCCC
TATTCAATAA GGCCGTATCA GTATGTAATG GATTGCTTGT ACGGGTATCT GCTTATTGGA
ATGAAGCAAT ACTGCGACAG AAGTCTGGCG GGAGCATACA ATTTCGGACC TAAAGAAGAT
GATTGCAAAA CCACAATAGA AATCGTGGAT AAATTCTGTC ATGTCTGGGG TGACGGGTTG
GACTATTATA CAAAACCGGA TGATTCAGTA TATGAAAGTC AGATATTGAT GCTGGACAGT
AGTAAGTCCA ACAAGTTATT AAATTGGAAT CCTCAATATG ATATAGATCA TGCCATGCAT
AAGACCGTAG AATTGTATAA ACTGATTTAT GAAAAGAATT TTGACAAATA TGCTTGTTCA
CATATAGAGG ATTTTTTCAG CGGAGTATCA GCTTTTAAAA ACAACAGCCC ATTATCCATC
TCTGCAAAAA AATCCGCAAA AAACAATTCT CTCCATGAAA ATCAAATTTG TGCAATGTAA
 
Protein sequence
MKNFYKNKRV LITGHTGFKG SWLSEILLQF GAEVCGYALE SKESSDLYLN LKLHKNMNSY 
IGDIRNYDKL KKVFDTFKPE IVFHLAAQPI VRESYKNPLY TYETNVMGTV NLLEAVRHCS
SVRSVVNVTT DKVYKNINVN KGYTETDYLC GQEPYSNSKS CSELVTYSYK KSFFDTDDSP
AVSTARAGNV IGAGDFSKNR IIPDCVRAAF SRNKIEIRNP YSIRPYQYVM DCLYGYLLIG
MKQYCDRSLA GAYNFGPKED DCKTTIEIVD KFCHVWGDGL DYYTKPDDSV YESQILMLDS
SKSNKLLNWN PQYDIDHAMH KTVELYKLIY EKNFDKYACS HIEDFFSGVS AFKNNSPLSI
SAKKSAKNNS LHENQICAM