Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0601 |
Symbol | |
ID | 4808203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 736904 |
End bp | 737974 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106015 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_001037029 |
Protein GI | 125973119 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTGG ACAATCTTTA CAGAGTGCTG GATGCCAATG TCAACAGGAC ATCGGAAGGG CTTAGGGTTT TGGAGGATTT GGCAAGATTC TGTTATAATG ACAGACTGCT TTCAAAGAGG ATTAAGGAAC TAAGACACAG TGTAAGAAAG AATATTGCAG GATTGGTTCC AAACCTTATA AGCAGCAGGG ATTCGGTGAA TGATGTGGGT TTAAAAACGT CAATGGAGAT GGATATCGAC CGAAAAGCCT CACTTTTGGA TCTTGCCCGG GCAAATTTCA AGCGGGTTCA GGAAGCTTTG AGGACTGTGG AGGAAAGCCT TAAAGTTTTG AATGAAAATG ACCTTTCCAA GTTTTATGAA AGTTGCAGGT TTGAAACGTA CAGCATTGAG AAGGAATATT TTAAAGTTTT GACTTTCGAA AATAAGAAAG GCAGATTGAA TGAAATAATA ACCGGTCTTT ATTGTATTAC ATCGGAAGAA CACTCCAAAG GGCGCAGTAA TATTGAGGTT GTGGAGAAAA TGATAAAGGC CGGGGTAAAG ATAATTCAAT ACAGGGAAAA AAAGAAGAGT CTTTTGGAAA AATACAACGA ATGCAAAAAA ATAAGGGAAA TGACCTTAGA TTCGGGCGTT ACATTTATCG TAAACGACAA CATTGATATT GCAATGATGG TAAAGGCCGA CGGAGTACAT ATAGGTCAGG ATGATCTTCC CATAGAAAAA GTAAGAGAGC TTGTGGGGGA TGAGATGATT ATCGGGATAT CCACCCATTC TCCAACGCAG GCGGAAGACG CGGTAAGACG CGGAGCTGAT TATATAGGAG TGGGTCCTCT TTACAGAACA TATACAAAGG AGGATGTCTG CGAACCTGTA GGGCTTGAGT ACCTTGACTA TGTTGTGAAG AACATAAATA TTCCCTATGT TGCCATAGGT GGCATAAAGG AACACAACAT GGATGAGGTT TTGGCCCGGG GAGCCCGGTG TATAGCCATG GTTACAGAGA TTGTGGGCGC GGATGACATA GAAGAAAAAA TTTCCAAAGT AAAATCAAAA TTTTCGAGAG GGGTTTTATA A
|
Protein sequence | MGLDNLYRVL DANVNRTSEG LRVLEDLARF CYNDRLLSKR IKELRHSVRK NIAGLVPNLI SSRDSVNDVG LKTSMEMDID RKASLLDLAR ANFKRVQEAL RTVEESLKVL NENDLSKFYE SCRFETYSIE KEYFKVLTFE NKKGRLNEII TGLYCITSEE HSKGRSNIEV VEKMIKAGVK IIQYREKKKS LLEKYNECKK IREMTLDSGV TFIVNDNIDI AMMVKADGVH IGQDDLPIEK VRELVGDEMI IGISTHSPTQ AEDAVRRGAD YIGVGPLYRT YTKEDVCEPV GLEYLDYVVK NINIPYVAIG GIKEHNMDEV LARGARCIAM VTEIVGADDI EEKISKVKSK FSRGVL
|
| |