Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0229 |
Symbol | |
ID | 4808577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 276040 |
End bp | 277059 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105641 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001036661 |
Protein GI | 125972751 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0115301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGAG TAATATTAGT CACCGGTGCG GCAGGCTTTA TCGGTTTCCA CTTGGTACAG CGCTTGTTAA AAGAGGGTTG TAACGTCGTA GGTATAGATA ATTTAAATGA GTATTACGAT GTTAAACTGA AAAAAGACCG CCTGAAATTG TTAAGTGAAA ATAAAAACTT TGTATTCCGC AAAGTTGACA TAAAAAACAA AAAGGCAGTG GACCGTATCT TTGAAACCTA TCGGCCTTCC TATGTAATCA ATCTTGCGGC ACAAGCGGGA GTGCGTTATT CCATTGAAAA TCCCTATGCC TACGTGGATT CAAATTTGGT AGGATTTGTG AACATTCTTG AGGCTTGCCG AAAATACCCT GTGAAGCACC TTATCTATGC TTCATCAAGT TCGGTATACG GGGGAAACAA AGTTTCGCCG TTTTCCACCA GACATAATGT GGACCATCCT GTGTCTCTTT ATGCAGCCAC AAAAAAATCC AATGAATTGC TGGCCCATAC CTACAGTCAT CTTTTCGGCA TTCCCACAAC AGGGCTGAGG TTTTTTACCG TTTACGGCCC CTGGGGAAGA CCGGATATGG CGTATTTCTC ATTTACAAAA GATATTTTAA GCGGAAACCC CATTAAAGTG TTCAATTATG GTAAAATGGA AAGAGACTTT ACTTATATTG ATGATGTGGT GGAAGGAATT GTAAAATTAA TTGACAGAAT CCCGACGCCT AATGAAAACT GGGATGAAAC TAAAGATGAC ATAAGTACCA GTTTTGCACC GTACAAAATC TACAATATCG GCAACAACAA TCCTGTTCCG TTAATGAATT TCATAAGTGT TTTAGAGTCA GCTCTTGGTA AGGTTGCAAA AAAAGTATAT TTGGATTTGC AACCCGGCGA TGTGCTCAGA ACCTATGCGG ATATTTCCGA CCTTGAAAGG GATATAAATT TCAAGCCGTC CACAAGTATT GAAGACGGGC TTCGAAAATT TGTACAGTGG TACAAGGAGT ATTATAAAGC TGAAATTTAG
|
Protein sequence | MEGVILVTGA AGFIGFHLVQ RLLKEGCNVV GIDNLNEYYD VKLKKDRLKL LSENKNFVFR KVDIKNKKAV DRIFETYRPS YVINLAAQAG VRYSIENPYA YVDSNLVGFV NILEACRKYP VKHLIYASSS SVYGGNKVSP FSTRHNVDHP VSLYAATKKS NELLAHTYSH LFGIPTTGLR FFTVYGPWGR PDMAYFSFTK DILSGNPIKV FNYGKMERDF TYIDDVVEGI VKLIDRIPTP NENWDETKDD ISTSFAPYKI YNIGNNNPVP LMNFISVLES ALGKVAKKVY LDLQPGDVLR TYADISDLER DINFKPSTSI EDGLRKFVQW YKEYYKAEI
|
| |