Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2561 |
Symbol | |
ID | 4809168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3031827 |
End bp | 3032966 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107976 |
Product | CDP-glucose 4,6-dehydratase |
Protein accession | YP_001038955 |
Protein GI | 125975045 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02622] CDP-glucose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATT TTTATAAAAA CAAAAGAGTG CTTATAACAG GTCATACCGG TTTTAAAGGT TCATGGTTAT CCGAGATATT GCTGCAATTT GGTGCCGAGG TTTGCGGATA TGCACTGGAA TCAAAAGAAA GCTCGGATTT ATATTTAAAT CTGAAGCTGC ACAAAAACAT GAACTCATAT ATTGGCGACA TTAGAAACTA TGACAAATTA AAAAAAGTAT TCGATACATT CAAACCTGAG ATTGTTTTTC ATTTGGCTGC GCAGCCTATT GTGAGGGAAT CGTATAAAAA TCCTTTATAT ACCTATGAAA CCAATGTCAT GGGAACAGTC AATCTGCTCG AAGCAGTAAG ACACTGCAGT TCAGTCAGGT CCGTGGTTAA TGTAACAACC GATAAAGTAT ATAAAAATAT AAATGTTAAC AAAGGATACA CTGAAACAGA CTATTTGTGT GGACAAGAAC CCTATTCAAA TTCCAAGTCA TGTTCGGAAT TGGTAACCTA CAGCTATAAA AAATCTTTCT TTGATACTGA TGATTCTCCG GCTGTTTCCA CTGCCAGAGC AGGAAATGTC ATTGGCGCGG GAGATTTTTC AAAAAACAGA ATTATTCCGG ATTGTGTAAG AGCAGCATTC AGCAGGAACA AAATAGAGAT CAGAAATCCC TATTCAATAA GGCCGTATCA GTATGTAATG GATTGCTTGT ACGGGTATCT GCTTATTGGA ATGAAGCAAT ACTGCGACAG AAGTCTGGCG GGAGCATACA ATTTCGGACC TAAAGAAGAT GATTGCAAAA CCACAATAGA AATCGTGGAT AAATTCTGTC ATGTCTGGGG TGACGGGTTG GACTATTATA CAAAACCGGA TGATTCAGTA TATGAAAGTC AGATATTGAT GCTGGACAGT AGTAAGTCCA ACAAGTTATT AAATTGGAAT CCTCAATATG ATATAGATCA TGCCATGCAT AAGACCGTAG AATTGTATAA ACTGATTTAT GAAAAGAATT TTGACAAATA TGCTTGTTCA CATATAGAGG ATTTTTTCAG CGGAGTATCA GCTTTTAAAA ACAACAGCCC ATTATCCATC TCTGCAAAAA AATCCGCAAA AAACAATTCT CTCCATGAAA ATCAAATTTG TGCAATGTAA
|
Protein sequence | MKNFYKNKRV LITGHTGFKG SWLSEILLQF GAEVCGYALE SKESSDLYLN LKLHKNMNSY IGDIRNYDKL KKVFDTFKPE IVFHLAAQPI VRESYKNPLY TYETNVMGTV NLLEAVRHCS SVRSVVNVTT DKVYKNINVN KGYTETDYLC GQEPYSNSKS CSELVTYSYK KSFFDTDDSP AVSTARAGNV IGAGDFSKNR IIPDCVRAAF SRNKIEIRNP YSIRPYQYVM DCLYGYLLIG MKQYCDRSLA GAYNFGPKED DCKTTIEIVD KFCHVWGDGL DYYTKPDDSV YESQILMLDS SKSNKLLNWN PQYDIDHAMH KTVELYKLIY EKNFDKYACS HIEDFFSGVS AFKNNSPLSI SAKKSAKNNS LHENQICAM
|
| |