Gene Cthe_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1086 
Symbol 
ID4811384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1293343 
End bp1294329 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content40% 
IMG OID640106508 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001037511 
Protein GI125973601 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.889547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAA TGCTTGTTAC AGGCGGAGCA GGTTTTGTAG GAAGCAACTT TATACGCTTT 
TTTCTCAGAA GAAACAAAAA TTTTATTATC ATAAACATGG ATAATTTAAG CTCCACATCA
AACCTGGAAA ACGTAAAAGA TTTGGAAAAG TCGCCCCGAT ACCATTTTGT AAAGGGCAGT
ATAACAAACC ATGAACTTGT AAACTATGTT ATTAAAAGGC ATAGACCCGA CTGTATAATT
AATTTTGCAT CAGAATCAAG CCTGGATAAT TGCGCAAACA ATCCGCTAAA TTTCACACAG
ACCAACGTCC TCGGTACGCA GACGCTGCTT GAAAGCGCCC GTTATTTCTG GGGAAAAAAC
AAATTTCAGG GCAACCTCTT TATTCAAGTG TCAACCGGTG AGGTATATGG GAGCACACCG
GCAAATGATG TATTTTTCAG TGAGGAAGCA CCGCTTTTGT CTGACAATCC GTTTTCAGCT
TCCAAAGCCG GAGCAGATAT GCTGGTAAAA TCCTATACGA TTACCTATGG TTTTCCGGCA
ATAATAACCC GGTGCTGCCC AACTTACGGA CCTTGTCAGC ATATTGGAAA TTTTATTCCG
AAATGCATAA TAAATGCGCT TTCGGATAAA CCCATTACGG TCTGTGAAAA CAAAGTGCGG
GAGTGGATAT ATGTACTGGA CCACTGCATA GCTCTTACAA AGATTTTGTT TTACGGCCGG
ACAGGTGAAA TCTACAACAT CTCCTCCGGC AACGAAATAT CGGACTTTGA CGTGGCAAAA
AAGATTCTCG GACTTGTCGG CAAGCCCGAC AGCGCAATTG AAAAGGCAGA TGACAGTTCT
CTTCCAACCA AAAGATGTAT TCTTAACAGC TACAAACTGA AAAGCAATTT GAATTGGAGT
ATCAAGTTCA AACTAGAAGA AGGATTAAGG GAAACCATCT TATGGTACAA GCAAAATCCG
GATAGGTGGA AAAATGTAGA ATTATAA
 
Protein sequence
MKVMLVTGGA GFVGSNFIRF FLRRNKNFII INMDNLSSTS NLENVKDLEK SPRYHFVKGS 
ITNHELVNYV IKRHRPDCII NFASESSLDN CANNPLNFTQ TNVLGTQTLL ESARYFWGKN
KFQGNLFIQV STGEVYGSTP ANDVFFSEEA PLLSDNPFSA SKAGADMLVK SYTITYGFPA
IITRCCPTYG PCQHIGNFIP KCIINALSDK PITVCENKVR EWIYVLDHCI ALTKILFYGR
TGEIYNISSG NEISDFDVAK KILGLVGKPD SAIEKADDSS LPTKRCILNS YKLKSNLNWS
IKFKLEEGLR ETILWYKQNP DRWKNVEL