Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0997 |
Symbol | |
ID | 4811291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1194233 |
End bp | 1195291 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106415 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001037422 |
Protein GI | 125973512 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATACA TCAAAAGAAA GAAAACAAGA AAGGTGCGAG TTGGAGATAT ATATATAGGC GGCGACGCAA AAATTACTGT TCAGTCCATG ACAAATACCG ACACAAGGGA CGTTGAGGCT ACCGTCAATC AAATAAAAAT GCTTGAGGAT ATCGGCTGTG ACATTATAAG GGTTGCTGTT GTGGACCAGG AGGCGGCAGA GGCGATAAAG GAGATAAAAA AGTCCATAAA GATTCCCCTG GTGGCTGATA TTCATTTTGA TTATCGGCTT GCCATATCCA GTATGGAAAA CGGAGCAGAC AAGATAAGAC TTAATCCGGG AAACATAGGA GACAGGGAAA GAGTAAGAAA GGTTGTTGAG GTTGCAAAGT CCAGGCAGAT ACCAATTCGC ATTGGGGTGA ATTCCGGTTC CCTTGAGAAG AATGTCATTG AAAAGTACGG CGGAATAACT CCTGAGGCGA TGGTGGAAAG TGCGCTGCAG CATGTTCGAA TATTGGAGGA ATTGGATTTT TACGATATCG TAATTTCCCT AAAGGCTTCA AGTGTTCCAA TGACCATAGC GGCTTACCGC CTGATGTCTG AAAAAACCGA TTATCCTTTG CATATCGGTG TTACCGAAGC GGGAACCGTG TTTAAAGGCA CCATTAAATC CTGTGCCGGA TTAGGCTGCC TTTTGGCTGA AGGCATAGGA GACACAATAA GAGTATCGCT TACCGGGGAT CCAAAGGAAG AGGTTTTGGT CGGACATGAG CTGTTAAGGG CTTTAGGTAT TGAAAAAGGC GGGATTGAGC TTGTTTCCTG CCCTACATGC GGAAGATGTC AGATTGACTT GATTGGAATA GCTGAAAAAG TGGAAGAAAG GCTTGAAGGT CTTGATAAAA ATATCAAAGT GGCAATTATG GGTTGTGCCG TAAACGGACC GGGTGAAGCT AAGGAAGCGG ATATTGGCAT TGCCGGGGGA AAAGGTGAAG TATTGTTGTT TAAAAAGGGA GTTATAGTCC GTAAGATCCC CCAGGAAAGG GCAGTGGAAG AACTTATGGA GGAAATACTG AGAATGTAA
|
Protein sequence | MEYIKRKKTR KVRVGDIYIG GDAKITVQSM TNTDTRDVEA TVNQIKMLED IGCDIIRVAV VDQEAAEAIK EIKKSIKIPL VADIHFDYRL AISSMENGAD KIRLNPGNIG DRERVRKVVE VAKSRQIPIR IGVNSGSLEK NVIEKYGGIT PEAMVESALQ HVRILEELDF YDIVISLKAS SVPMTIAAYR LMSEKTDYPL HIGVTEAGTV FKGTIKSCAG LGCLLAEGIG DTIRVSLTGD PKEEVLVGHE LLRALGIEKG GIELVSCPTC GRCQIDLIGI AEKVEERLEG LDKNIKVAIM GCAVNGPGEA KEADIGIAGG KGEVLLFKKG VIVRKIPQER AVEELMEEIL RM
|
| |