Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0828 |
Symbol | |
ID | 4810446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1008397 |
End bp | 1010280 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106245 |
Product | 1-deoxy-D-xylulose-5-phosphate synthase |
Protein accession | YP_001037256 |
Protein GI | 125973346 |
COG category | [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1154] Deoxyxylulose-5-phosphate synthase |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACAGTA TTAATTTCCC TGATGATATA AAAAAACTGA ATTTGGAACA GTTAAAGCAA CTTGCAGGCG AGATTAGGTC TTTTCTTATT GAAAAGGTCT CAAAAACAGG CGGGCATCTT GCTTCCAACT TGGGAGTCGT AGAGTTGACC CTTGCGCTTC ACAGGGTGTT TAACACTCCG GAGGATAAAA TTATATGGGA TGTAGGTCAT CAATGCTATG TTCATAAGAT TATCACCGGA AGAAAAGACA GGTTTGACAC CATAAGAAAG CTTGGCGGAC TTTCCGGTTT TCCAAAATCG GCGGAAAGTG AGTATGATGC TTTCAATACC GGGCACAGCA GTACTTCCAT ATCTGCAGCC CTGGGTATTG CAAAAGCAAG GGATTTAAGA AAAGAAAAAT ATTCGGTTGT TGCCGTTATC GGAGACGGTG CCCTGACCGG AGGAATGGCT TTTGAAGCAT TGAATGATGC GGGAAGGTCA CCGAATAATC TTATTGTTGT ATTAAATGAT AATGAAATGT CAATTTCGAA AAATGTAGGA GGGCTTTCAG TTTATTTGAG CAAAATTCGA ACAGAACCCT TCTATTTTAA AGTTAAAGAA GATATAGACA TTATTTTAAA CAAAATACCG GCAATCGGAA AAAGCGCGGT CAAAGCACTT GGCAGGGTCA AAGGCACCAT AAAATACATG ATTATGCCGG GAATAGTGTT TGAAGAACTC GGTTTTAAAT ATTTAGGACC TATTGACGGA CATAATATTG CCGAACTGGA AAACGTTCTT ACAAGAGCCA AAAACACCAA AGGACCTGTA CTGGTGCATG TATGTACCCA AAAAGGAAGA GGTTACACTT ACGCGGAAAA AAATCCGGCT GTTTTTCACG GCATCTCGCC CTTTGAGGTT GAGACGGGGG AGGTTATTGC TAATAAAGTT CCGGGATATT CCGATGTATT TGGAAGTGAA ATTGTCAGGA TTGCTGAAAA AGAAGAAAGG GTTGTTGCTC TTACGGCTGC AATGCCTCAT GGAACAGGTC TTATCAAATT TTCAAAGAGA TTTCCGGAAA GGTTTTTTGA CGTTGGCATA GCCGAGCAAC ATGCGGTAAC TTTTGGTGCC GGGCTTGCAA AAAACGGGAT GATTCCGGTC ATAGCTCTTT ATTCGTCTTT TCTCCAGAGA GCCTATGACC AGGTAGTGCA TGATGTGGCT CTTCAAAATC TGCATGTGGT TTTTGCGATA GACAGGGCCG GAATAGTCGG GGAAGACGGG GAGACACATC AGGGAATTTA TGACATATCT TTTTTAAGAC ATATACCAAA TATGACCATT CTTGCTCCCT GTGATTATAA TGAGCTTGCC AAAATGCTTG AGTATGCCGT ACTGGAGCAT AGCGGTCCGA TAGCGATAAG GTACCCGAGA GGAGCAGGAC CTGAAAAGCT TTTTGACACC CCTGACATAA AGTTGGGACA ATCTCTGCTT ATAAGTGAAG GAAATGATGT TACCATTGCG GCTGTCGGCA ACAAGGTGGA AGTGGCCATG AAGGTTGCCG AAAAGCTTAA GGAGACAGGT TTGTCTGCGG ATGTGATTTA TTGCAGATTT ATAAAGCCCC TTGATTCAAA TACCATTATA AATTCCGTAC TTAAAACAAA AAGACTTGTA ACAATAGAGG ATAATACCGT TGAGGGTGGA TTTGGAAGCA GAGTTTTGGA AACAATAAAC CAGAAGGGGA TAAATGTCAC TACAAGAATG TTTGGATATC CGGATGCTTT TATTCCTCAT GGCTCTATCA AAGAACTGGT GCATATGTAC AGACTGGATC CGGATTCCAT TTTCAATGAT GTTTTAAAAC TGATAAATAA AAGCAAAGTG AAAGAATTCC GAGCCATAAG ATAA
|
Protein sequence | MDSINFPDDI KKLNLEQLKQ LAGEIRSFLI EKVSKTGGHL ASNLGVVELT LALHRVFNTP EDKIIWDVGH QCYVHKIITG RKDRFDTIRK LGGLSGFPKS AESEYDAFNT GHSSTSISAA LGIAKARDLR KEKYSVVAVI GDGALTGGMA FEALNDAGRS PNNLIVVLND NEMSISKNVG GLSVYLSKIR TEPFYFKVKE DIDIILNKIP AIGKSAVKAL GRVKGTIKYM IMPGIVFEEL GFKYLGPIDG HNIAELENVL TRAKNTKGPV LVHVCTQKGR GYTYAEKNPA VFHGISPFEV ETGEVIANKV PGYSDVFGSE IVRIAEKEER VVALTAAMPH GTGLIKFSKR FPERFFDVGI AEQHAVTFGA GLAKNGMIPV IALYSSFLQR AYDQVVHDVA LQNLHVVFAI DRAGIVGEDG ETHQGIYDIS FLRHIPNMTI LAPCDYNELA KMLEYAVLEH SGPIAIRYPR GAGPEKLFDT PDIKLGQSLL ISEGNDVTIA AVGNKVEVAM KVAEKLKETG LSADVIYCRF IKPLDSNTII NSVLKTKRLV TIEDNTVEGG FGSRVLETIN QKGINVTTRM FGYPDAFIPH GSIKELVHMY RLDPDSIFND VLKLINKSKV KEFRAIR
|
| |