Gene Cthe_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0229 
Symbol 
ID4808577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp276040 
End bp277059 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content40% 
IMG OID640105641 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001036661 
Protein GI125972751 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0115301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGAG TAATATTAGT CACCGGTGCG GCAGGCTTTA TCGGTTTCCA CTTGGTACAG 
CGCTTGTTAA AAGAGGGTTG TAACGTCGTA GGTATAGATA ATTTAAATGA GTATTACGAT
GTTAAACTGA AAAAAGACCG CCTGAAATTG TTAAGTGAAA ATAAAAACTT TGTATTCCGC
AAAGTTGACA TAAAAAACAA AAAGGCAGTG GACCGTATCT TTGAAACCTA TCGGCCTTCC
TATGTAATCA ATCTTGCGGC ACAAGCGGGA GTGCGTTATT CCATTGAAAA TCCCTATGCC
TACGTGGATT CAAATTTGGT AGGATTTGTG AACATTCTTG AGGCTTGCCG AAAATACCCT
GTGAAGCACC TTATCTATGC TTCATCAAGT TCGGTATACG GGGGAAACAA AGTTTCGCCG
TTTTCCACCA GACATAATGT GGACCATCCT GTGTCTCTTT ATGCAGCCAC AAAAAAATCC
AATGAATTGC TGGCCCATAC CTACAGTCAT CTTTTCGGCA TTCCCACAAC AGGGCTGAGG
TTTTTTACCG TTTACGGCCC CTGGGGAAGA CCGGATATGG CGTATTTCTC ATTTACAAAA
GATATTTTAA GCGGAAACCC CATTAAAGTG TTCAATTATG GTAAAATGGA AAGAGACTTT
ACTTATATTG ATGATGTGGT GGAAGGAATT GTAAAATTAA TTGACAGAAT CCCGACGCCT
AATGAAAACT GGGATGAAAC TAAAGATGAC ATAAGTACCA GTTTTGCACC GTACAAAATC
TACAATATCG GCAACAACAA TCCTGTTCCG TTAATGAATT TCATAAGTGT TTTAGAGTCA
GCTCTTGGTA AGGTTGCAAA AAAAGTATAT TTGGATTTGC AACCCGGCGA TGTGCTCAGA
ACCTATGCGG ATATTTCCGA CCTTGAAAGG GATATAAATT TCAAGCCGTC CACAAGTATT
GAAGACGGGC TTCGAAAATT TGTACAGTGG TACAAGGAGT ATTATAAAGC TGAAATTTAG
 
Protein sequence
MEGVILVTGA AGFIGFHLVQ RLLKEGCNVV GIDNLNEYYD VKLKKDRLKL LSENKNFVFR 
KVDIKNKKAV DRIFETYRPS YVINLAAQAG VRYSIENPYA YVDSNLVGFV NILEACRKYP
VKHLIYASSS SVYGGNKVSP FSTRHNVDHP VSLYAATKKS NELLAHTYSH LFGIPTTGLR
FFTVYGPWGR PDMAYFSFTK DILSGNPIKV FNYGKMERDF TYIDDVVEGI VKLIDRIPTP
NENWDETKDD ISTSFAPYKI YNIGNNNPVP LMNFISVLES ALGKVAKKVY LDLQPGDVLR
TYADISDLER DINFKPSTSI EDGLRKFVQW YKEYYKAEI