Gene Cthe_2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2220 
Symbol 
ID4811085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2649234 
End bp2650178 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content45% 
IMG OID640107626 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001038615 
Protein GI125974705 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACATAT TGGTAACTGG AGGAGCCGGA TTTATAGGGC GCTGGGTGGT TAAAAGACTT 
TTGGAGGACG GACACAAGGT ATGGGTTTTG GACGACCTTT CCAACGGTCA GCGCAAAAAT
ATCGAAGAAT TTCTGTCAAA TCCCAACTTT GCCGGATTTG TTGAAGGTGA CATTAAAAAC
ATTCCTGTTC TTGAAACCCT TTTTGAAAAC AAGTTTGACA TATGCTATCA CCTGGCGGCG
AGTATTAACG TGCAGGACAG CATTGACGAC CCGGGTACAA CATTTCAAAA TGATGTGGTG
GGTACTTTCA ATGTGCTGGA ACAATGCAGA AAGCACAACA CTAAAATTGT TTTCATGAGC
ACCTGCATGG TGTATGACAG GGCAAACGAT GAAAACGGAA TAACCGAGGC CCATCCCACA
AAACCGGCCT CTCCCTATGC GGGCAGCAAG ATTGCAGGAG AGAATATGGT TTTGTCTTAC
TGGTACGCCT ATAAGCTGCC GGCGGTGGTT ATACGCCCCT TCAATACTTA CGGTCCTATG
CAGAAATCCA GCGGTGAGGG CGGCGTTGTG GCGATTTTCA TAAGGAGGAA TTTGGAAGGA
CTTCCTCTCA ATATATATGG GGACGGATGC CAGACAAGAG ATCTTCTTTA TGTTGAAGAC
TGTGCGGAAT TTGTTGTACG GGCAGGATAT TCCGACAGAG TAAACGGAGA AATAATAAAT
GCCGGACTGG GAAGGGATAT AAGTATAAAT GATCTTGCCC TTTTGATAGC AAAGGACAAA
GAAAAAATAG TACATGTGCC CCATATTCAT CCCCAAAGCG AGATAGCGAA GCTTCTGTGC
AACTATCAAA AGGCCAAAGA GCTTTTGGGA TGGACACCGA AAGTTTCTTT GGAGGAAGGC
ATAAAAAGAA CCGAAGAGTG GATAAGAAGC ACGCTTGCAA CTTAA
 
Protein sequence
MNILVTGGAG FIGRWVVKRL LEDGHKVWVL DDLSNGQRKN IEEFLSNPNF AGFVEGDIKN 
IPVLETLFEN KFDICYHLAA SINVQDSIDD PGTTFQNDVV GTFNVLEQCR KHNTKIVFMS
TCMVYDRAND ENGITEAHPT KPASPYAGSK IAGENMVLSY WYAYKLPAVV IRPFNTYGPM
QKSSGEGGVV AIFIRRNLEG LPLNIYGDGC QTRDLLYVED CAEFVVRAGY SDRVNGEIIN
AGLGRDISIN DLALLIAKDK EKIVHVPHIH PQSEIAKLLC NYQKAKELLG WTPKVSLEEG
IKRTEEWIRS TLAT