Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0599 |
Symbol | |
ID | 4808201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 735074 |
End bp | 736183 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106013 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_001037027 |
Protein GI | 125973117 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTT ATGAAAGATA TCTGGAGTAC AAAAATTTTG ATTTTGAAAA TTTCTTCGAC CAAGTAACGG ACAGGGATAT TATTAATATA ATAAACAAAG ACCGGCGGCT TTCGGAACTG GAATTCCTCA TGCTTCTTTC AAAAAAAGCT GTAAAATACC TTGAACCCTT GGCTCAAAAG GCAAACAGGA TTACGGTGCA GAATTTCGGA AAGGTCATAT TCCTGTACAC GCCGATGTAC CTTGCAAATT ACTGCGTAAA TCAATGTATT TACTGTGGTT TTAACATAAC CAACAATATA AAGCGAAGAA AACTTACTTT GGATGAGGTT GAAAAAGAAG CTTATGCAAT TTCATCCACG GGTCTTAGGC ATATTCTGAT TTTAACGGGA GAGTCCCGAA AGGAAAGTCC TGTTCAATAC ATAAAGGACT GCGTTAAAAT TCTTCAAAAG TATTTCAGGT CCATATCAAT AGAGGTTTAC CCTCTTGAGG AGAACGAGTA CGCCGAACTG ATAGAGGCGG GGGTTGACGG TCTTACCATC TATCAGGAGG TATATGATGA AGAAAAATAC AAGGCTCTTC ACCTGAAAGG TCCCAAAAGA AACTACTTAT ACAGGCTTGA TGCTCCTGAA AGGGCATGCA AGGCATCAAT GAGGAATGTA AACATAGGTG CCCTGCTGGG ACTTCATGAC TGGCGGACGG AGGCTTTTTA CACGGGACTT CACGCTGATT ACCTGCAAAA CAAGTATCCG GATGTGGAAA TTGGTTTGTC CCTTCCAAGA ATAAGGCCCC ATCCCTGTGG AAGTTTTGTA CCTGATTGCA AAGTGGAAGA CAGGGATCTG GTACAGATAA TGATAGCCTA CAGATTGTTT ATGCCAAGAG CCGGGATAGC AATTTCTACA AGAGAAAGAG AGAGCCTTAG AAATAATCTT ATTGGTCTGG GAGTTACCAA AATGTCTGCC GCGTCAAGTA CAGAGGTCGG AGGTCACACC CTTGGCGATA AAAGTGACGG ACAGTTTGAT GTAAATGACA GGCGCGGTGT TGAAGAGATG AGACAAATGA TATACAGCAA AGGTTATCAG CCGGTGTTTA AAGACTGGCA GGCAATATAA
|
Protein sequence | MSFYERYLEY KNFDFENFFD QVTDRDIINI INKDRRLSEL EFLMLLSKKA VKYLEPLAQK ANRITVQNFG KVIFLYTPMY LANYCVNQCI YCGFNITNNI KRRKLTLDEV EKEAYAISST GLRHILILTG ESRKESPVQY IKDCVKILQK YFRSISIEVY PLEENEYAEL IEAGVDGLTI YQEVYDEEKY KALHLKGPKR NYLYRLDAPE RACKASMRNV NIGALLGLHD WRTEAFYTGL HADYLQNKYP DVEIGLSLPR IRPHPCGSFV PDCKVEDRDL VQIMIAYRLF MPRAGIAIST RERESLRNNL IGLGVTKMSA ASSTEVGGHT LGDKSDGQFD VNDRRGVEEM RQMIYSKGYQ PVFKDWQAI
|
| |