Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1504 |
Symbol | thiH |
ID | 5170901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 1491720 |
End bp | 1493141 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640564030 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001245088 |
Protein GI | 148270628 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0434134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTATGT ATGTGTTTGT GAAAGAGCGT GTAGAGAGCA GATCTTTCAT ACCGGAAGAA AAGATATTTG AACTTCTGGA GAAAACGAAA AACCCGGATC CTGCAAGGGT GAGAGAGATC ATCCAGAAGT CGCTGGACAA GAACAGGCTC GAGCCGGAAG AGACGGCCAC CCTTTTGAAT GTGGAAGATC CAGAGCTTCT GGAGGAGATC TTCGAAGCAG CCCGCACTCT GAAAGAACGA ATCTACGGAA ACAGAATAGT TCTCTTCGCA CCGCTGTACA TAGGAAACGA TTGTGTCAAC GACTGTGTTT ACTGTGGTTT CAGAGTTTCC AACAAAGTGG TGGAAAGAAG AACGCTCACG GAAGAGCAAT TGAAAGAAGA AGTCAGGGCA CTCGTTTCCC AAGGGCACAA AAGACTCATC GTCGTCTATG GAGAGCACCC CAAGTATTCA CCAGAGTTCA TCGCAAGGAC GATCGACATC GTTTACAACA CAAAGTACGG AAACGGCGAG ATCAGGCGTG TGAACGTCAA CGCTGCTCCT CAAACGATTG AAGGCTACAA GATCATAAAG TCCGTGGGAA TCGGAACCTT CCAGATCTTT CAGGAAACGT ATCACAGGGA AACGTATTTG AAACTCCATC CAAGAGGTCC GAAATCGAAC TACAACTGGA GGCTCTATGG ACTGGACAGA GCAATGATGG CGGGAATCGA CGACGTTGGA ATAGGAGCGC TCTTCGGCCT CTACGACTGG AAGTTCGAAG TGATGGGTCT CCTTTACCAC ACGATACACC TCGAAGAGAG ATTCGGTGTG GGACCACACA CCATCTCTTT CCCGAGGATA AAACCCGCGA TAAACACACC ATATTCGCAG AAACCGGAAC ACATCGTGAG CGACGAAGAC TTCAAGAAGC TCGTCGCCAT CATAAGGCTT TCTGTTCCGT ACACGGGCAT GATTCTCACC GCAAGAGAAC CCGCAAAACT CAGGGACGAG GTGATAAAGC TCGGTGTTTC ACAGATAGAC GCCGGCTCCA GAATAGGAAT CGGAGCGTAC TCTCACAGAG AAGACGACGA GGACAGAAAG AGACAGTTCA CGCTCGAAGA CCCAAGACCA CTCGATCAGG TGATGAGAAG CCTACTGAAA GAGGGCTTTG TACCTTCATT CTGCACCGCG TGTTACAGGG CAGGAAGAAC TGGGGAACAC TTCATGGAAT TTGCAATTCC CGGTTTCGTG AAGAACTTCT GCACACCGAA CGCCCTGTTC ACACTTCAGG AATACCTCTG TGACTACGCA ACAGAAGAAA CAAGGAAGGT AGGAGAGGAG GTTATTGAAA GAGAACTTCA GAAGATGAAT CCAAAGATCA GAGAGAGAGT TAGAGAGGGA CTCGAAAAGA TAAAGCGCGG TGAGAGGGAT GTTAGATTTT AA
|
Protein sequence | MCMYVFVKER VESRSFIPEE KIFELLEKTK NPDPARVREI IQKSLDKNRL EPEETATLLN VEDPELLEEI FEAARTLKER IYGNRIVLFA PLYIGNDCVN DCVYCGFRVS NKVVERRTLT EEQLKEEVRA LVSQGHKRLI VVYGEHPKYS PEFIARTIDI VYNTKYGNGE IRRVNVNAAP QTIEGYKIIK SVGIGTFQIF QETYHRETYL KLHPRGPKSN YNWRLYGLDR AMMAGIDDVG IGALFGLYDW KFEVMGLLYH TIHLEERFGV GPHTISFPRI KPAINTPYSQ KPEHIVSDED FKKLVAIIRL SVPYTGMILT AREPAKLRDE VIKLGVSQID AGSRIGIGAY SHREDDEDRK RQFTLEDPRP LDQVMRSLLK EGFVPSFCTA CYRAGRTGEH FMEFAIPGFV KNFCTPNALF TLQEYLCDYA TEETRKVGEE VIERELQKMN PKIRERVREG LEKIKRGERD VRF
|
| |