Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0600 |
Symbol | |
ID | 4808202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 736260 |
End bp | 736904 |
Gene Length | 645 bp |
Protein Length | 214 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106014 |
Product | thiamine biosynthesis protein ThiF |
Protein accession | YP_001037028 |
Protein GI | 125973118 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR02354] thiamine biosynthesis protein ThiF, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAT TTGAAATTGG GCTTATGTCT TATCTAGGCA GGGAAAAGCT TGAAAAGCTC GGTAAAGTAA AGGTGGGAAT CGCCGGGGCC GGAGGGCTTG GTTCCAATTG TGCATTGCAC CTGGTGAGAA GCGGTTTTAA AAATTTAAAA ATAGTGGATT TTGACGTGGT GGAACCATCT AATCTGAACA GGCAGTTTTT CTTCCTGGAT CAGTTGGGAC GTCCCAAGGT TTTGGCTCTA AAGGAGAATT TGGCAAGAAT CAACCCTGAT GTAATGATAG AGGCTATACA GCAAAAAGTA GACAGAAATA ATGCAAAGCA GCTGTTTGAT GACTGTGACG CTGTGGTGGA GGCTTTTGAT AAAGTTGAGT GCAAAACCAT GATGGCGGAG GCTTTTGCCT CGTCGGGAAA ATTTTTTGTA TGTGCATCGG GACTTGCCGG TTGGGGAAAC AGTGATGAAA TAAGAATACG AAAAGTACAT GACAGGTTCT ATATAGTCGG GGATTTTGTT ACCGGAGTCA AAGAAGGCAT ACCTCCCATT TCTCCCAGGG TTAATATTGC AGCAGCAAAA CAGGCGGACT TGATTTTAAG CTTCTTTTTG GATGATGCGG ATAATACCGT AAATGTGAAT TGTCAGGAAG GATGA
|
Protein sequence | MNEFEIGLMS YLGREKLEKL GKVKVGIAGA GGLGSNCALH LVRSGFKNLK IVDFDVVEPS NLNRQFFFLD QLGRPKVLAL KENLARINPD VMIEAIQQKV DRNNAKQLFD DCDAVVEAFD KVECKTMMAE AFASSGKFFV CASGLAGWGN SDEIRIRKVH DRFYIVGDFV TGVKEGIPPI SPRVNIAAAK QADLILSFFL DDADNTVNVN CQEG
|
| |