Gene Tpet_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1504 
SymbolthiH 
ID5170901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1491720 
End bp1493141 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content48% 
IMG OID640564030 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001245088 
Protein GI148270628 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0434134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTATGT ATGTGTTTGT GAAAGAGCGT GTAGAGAGCA GATCTTTCAT ACCGGAAGAA 
AAGATATTTG AACTTCTGGA GAAAACGAAA AACCCGGATC CTGCAAGGGT GAGAGAGATC
ATCCAGAAGT CGCTGGACAA GAACAGGCTC GAGCCGGAAG AGACGGCCAC CCTTTTGAAT
GTGGAAGATC CAGAGCTTCT GGAGGAGATC TTCGAAGCAG CCCGCACTCT GAAAGAACGA
ATCTACGGAA ACAGAATAGT TCTCTTCGCA CCGCTGTACA TAGGAAACGA TTGTGTCAAC
GACTGTGTTT ACTGTGGTTT CAGAGTTTCC AACAAAGTGG TGGAAAGAAG AACGCTCACG
GAAGAGCAAT TGAAAGAAGA AGTCAGGGCA CTCGTTTCCC AAGGGCACAA AAGACTCATC
GTCGTCTATG GAGAGCACCC CAAGTATTCA CCAGAGTTCA TCGCAAGGAC GATCGACATC
GTTTACAACA CAAAGTACGG AAACGGCGAG ATCAGGCGTG TGAACGTCAA CGCTGCTCCT
CAAACGATTG AAGGCTACAA GATCATAAAG TCCGTGGGAA TCGGAACCTT CCAGATCTTT
CAGGAAACGT ATCACAGGGA AACGTATTTG AAACTCCATC CAAGAGGTCC GAAATCGAAC
TACAACTGGA GGCTCTATGG ACTGGACAGA GCAATGATGG CGGGAATCGA CGACGTTGGA
ATAGGAGCGC TCTTCGGCCT CTACGACTGG AAGTTCGAAG TGATGGGTCT CCTTTACCAC
ACGATACACC TCGAAGAGAG ATTCGGTGTG GGACCACACA CCATCTCTTT CCCGAGGATA
AAACCCGCGA TAAACACACC ATATTCGCAG AAACCGGAAC ACATCGTGAG CGACGAAGAC
TTCAAGAAGC TCGTCGCCAT CATAAGGCTT TCTGTTCCGT ACACGGGCAT GATTCTCACC
GCAAGAGAAC CCGCAAAACT CAGGGACGAG GTGATAAAGC TCGGTGTTTC ACAGATAGAC
GCCGGCTCCA GAATAGGAAT CGGAGCGTAC TCTCACAGAG AAGACGACGA GGACAGAAAG
AGACAGTTCA CGCTCGAAGA CCCAAGACCA CTCGATCAGG TGATGAGAAG CCTACTGAAA
GAGGGCTTTG TACCTTCATT CTGCACCGCG TGTTACAGGG CAGGAAGAAC TGGGGAACAC
TTCATGGAAT TTGCAATTCC CGGTTTCGTG AAGAACTTCT GCACACCGAA CGCCCTGTTC
ACACTTCAGG AATACCTCTG TGACTACGCA ACAGAAGAAA CAAGGAAGGT AGGAGAGGAG
GTTATTGAAA GAGAACTTCA GAAGATGAAT CCAAAGATCA GAGAGAGAGT TAGAGAGGGA
CTCGAAAAGA TAAAGCGCGG TGAGAGGGAT GTTAGATTTT AA
 
Protein sequence
MCMYVFVKER VESRSFIPEE KIFELLEKTK NPDPARVREI IQKSLDKNRL EPEETATLLN 
VEDPELLEEI FEAARTLKER IYGNRIVLFA PLYIGNDCVN DCVYCGFRVS NKVVERRTLT
EEQLKEEVRA LVSQGHKRLI VVYGEHPKYS PEFIARTIDI VYNTKYGNGE IRRVNVNAAP
QTIEGYKIIK SVGIGTFQIF QETYHRETYL KLHPRGPKSN YNWRLYGLDR AMMAGIDDVG
IGALFGLYDW KFEVMGLLYH TIHLEERFGV GPHTISFPRI KPAINTPYSQ KPEHIVSDED
FKKLVAIIRL SVPYTGMILT AREPAKLRDE VIKLGVSQID AGSRIGIGAY SHREDDEDRK
RQFTLEDPRP LDQVMRSLLK EGFVPSFCTA CYRAGRTGEH FMEFAIPGFV KNFCTPNALF
TLQEYLCDYA TEETRKVGEE VIERELQKMN PKIRERVREG LEKIKRGERD VRF