Gene TRQ2_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1553 
SymbolthiH 
ID6093001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1563223 
End bp1564638 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content49% 
IMG OID642488753 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001739572 
Protein GI170289334 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGTGT TTGTGAAAGA GCGTGTAGAG AGCAGATCTT TCATACCGGA AGAAAAGATA 
TTTGAACTTC TGGAGAAAAC GAAAAACCCG GATCCTGCAA GGGTGAGAGA GATCATCCAG
AAGTCGCTGG ACAAGAACAG GCTCGAGCCG GAAGAGACGG CCACCCTTTT GAATGTGGAA
GATCCAGAGC TTCTGGAGGA GATATTCGAG GCGGCCCGCA CTCTGAAGGA GAGAATATAC
GGAAACAGGA TCGTTCTCTT TGCACCGCTC TACATAGGAA ACGACTGCGT CAACGACTGT
GTCTACTGCG GTTTCAGAGT CTCCAACAAA GTGGTGGAAA GAAAAACGCT CACGGAAGAA
CAGTTGAAAG AAGAAGTCAA AGCCCTCGTC TCCCAGGGCC ACAAAAGGCT CATAGTGGTC
TACGGAGAGC ATCCAAAGTA CTCTCCGGAG TTCATCGCAA GAACGATCGA CATCGTGTAC
AACACGAAGT ACGGCAACGG TGAGATCAGA AGGGTGAACG TCAACGCTGC ACCCCAGACG
ATAGAGGGCT ACAGGATCAT AAAGTCCGTG GGAATCGGTA CTTTCCAGAT CTTTCAGGAA
ACGTACCACA AAAAGACGTA CCTGAAACTC CATCCCAGGG GTCCCAAATC GAACTACAAC
TGGAGACTTT ACGGTCTGGA CAGAGCGATG ATGGCCGGTA TCGACGACGT AGGAATAGGC
GCCCTCTTTG GCCTTTACGA CTGGAAATTC GAGGTGATGG GACTTCTCTA CCACACGATC
CACCTCGAGG AGAGGTTCGG AGTGGGACCA CACACCATCT CCTTCCCAAG GATAAAACCT
GCCATAAACA CCCCATATTC ACAGAGGCCG GAACACATCG TGAGCGATGA GGACTTCAAA
AAACTCGTTG CCATCATACG ACTTTCTGTT CCATACACAG GAATGATCCT CACGGCAAGA
GAGCCCGCAA AACTCAGGGA TGAGGTCATA AAACTCGGTG TCTCACAGAT AGACGCCGGC
TCAAGAATAG GGATCGGAGC GTACTCTCAC AGAGAAGACG ACGAGGACAG GAAAAGGCAG
TTCACACTCG AAGATCCAAG ACCTCTCGAC CAGGTGATGA GAAGTCTTCT GAAAGAAGGT
TTTGTCCCAT CCTTCTGCAC CGCATGTTAC AGGGCAGGAA GAACGGGAGA ACACTTCATG
GAGTTTGCAA TCCCCGGTTT TGTGAAGAAC TTCTGTACAC CGAACGCTCT CTTCACGCTC
CAGGAGTACC TCTGTGACTA CGCAACGGAG GAAACAAGAA GAATAGGAGA AGAGGTCATA
GAAAAAGAAC TCCAGAAGAT GAATCCAAAG ATAAGAGAGA GAGTGAAAGA AGGCCTTGAA
AGAATAAAGC GCGGTGAGAG GGATGTCAGA TTTTAA
 
Protein sequence
MYVFVKERVE SRSFIPEEKI FELLEKTKNP DPARVREIIQ KSLDKNRLEP EETATLLNVE 
DPELLEEIFE AARTLKERIY GNRIVLFAPL YIGNDCVNDC VYCGFRVSNK VVERKTLTEE
QLKEEVKALV SQGHKRLIVV YGEHPKYSPE FIARTIDIVY NTKYGNGEIR RVNVNAAPQT
IEGYRIIKSV GIGTFQIFQE TYHKKTYLKL HPRGPKSNYN WRLYGLDRAM MAGIDDVGIG
ALFGLYDWKF EVMGLLYHTI HLEERFGVGP HTISFPRIKP AINTPYSQRP EHIVSDEDFK
KLVAIIRLSV PYTGMILTAR EPAKLRDEVI KLGVSQIDAG SRIGIGAYSH REDDEDRKRQ
FTLEDPRPLD QVMRSLLKEG FVPSFCTACY RAGRTGEHFM EFAIPGFVKN FCTPNALFTL
QEYLCDYATE ETRRIGEEVI EKELQKMNPK IRERVKEGLE RIKRGERDVR F