Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1553 |
Symbol | thiH |
ID | 6093001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1563223 |
End bp | 1564638 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642488753 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001739572 |
Protein GI | 170289334 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGTGT TTGTGAAAGA GCGTGTAGAG AGCAGATCTT TCATACCGGA AGAAAAGATA TTTGAACTTC TGGAGAAAAC GAAAAACCCG GATCCTGCAA GGGTGAGAGA GATCATCCAG AAGTCGCTGG ACAAGAACAG GCTCGAGCCG GAAGAGACGG CCACCCTTTT GAATGTGGAA GATCCAGAGC TTCTGGAGGA GATATTCGAG GCGGCCCGCA CTCTGAAGGA GAGAATATAC GGAAACAGGA TCGTTCTCTT TGCACCGCTC TACATAGGAA ACGACTGCGT CAACGACTGT GTCTACTGCG GTTTCAGAGT CTCCAACAAA GTGGTGGAAA GAAAAACGCT CACGGAAGAA CAGTTGAAAG AAGAAGTCAA AGCCCTCGTC TCCCAGGGCC ACAAAAGGCT CATAGTGGTC TACGGAGAGC ATCCAAAGTA CTCTCCGGAG TTCATCGCAA GAACGATCGA CATCGTGTAC AACACGAAGT ACGGCAACGG TGAGATCAGA AGGGTGAACG TCAACGCTGC ACCCCAGACG ATAGAGGGCT ACAGGATCAT AAAGTCCGTG GGAATCGGTA CTTTCCAGAT CTTTCAGGAA ACGTACCACA AAAAGACGTA CCTGAAACTC CATCCCAGGG GTCCCAAATC GAACTACAAC TGGAGACTTT ACGGTCTGGA CAGAGCGATG ATGGCCGGTA TCGACGACGT AGGAATAGGC GCCCTCTTTG GCCTTTACGA CTGGAAATTC GAGGTGATGG GACTTCTCTA CCACACGATC CACCTCGAGG AGAGGTTCGG AGTGGGACCA CACACCATCT CCTTCCCAAG GATAAAACCT GCCATAAACA CCCCATATTC ACAGAGGCCG GAACACATCG TGAGCGATGA GGACTTCAAA AAACTCGTTG CCATCATACG ACTTTCTGTT CCATACACAG GAATGATCCT CACGGCAAGA GAGCCCGCAA AACTCAGGGA TGAGGTCATA AAACTCGGTG TCTCACAGAT AGACGCCGGC TCAAGAATAG GGATCGGAGC GTACTCTCAC AGAGAAGACG ACGAGGACAG GAAAAGGCAG TTCACACTCG AAGATCCAAG ACCTCTCGAC CAGGTGATGA GAAGTCTTCT GAAAGAAGGT TTTGTCCCAT CCTTCTGCAC CGCATGTTAC AGGGCAGGAA GAACGGGAGA ACACTTCATG GAGTTTGCAA TCCCCGGTTT TGTGAAGAAC TTCTGTACAC CGAACGCTCT CTTCACGCTC CAGGAGTACC TCTGTGACTA CGCAACGGAG GAAACAAGAA GAATAGGAGA AGAGGTCATA GAAAAAGAAC TCCAGAAGAT GAATCCAAAG ATAAGAGAGA GAGTGAAAGA AGGCCTTGAA AGAATAAAGC GCGGTGAGAG GGATGTCAGA TTTTAA
|
Protein sequence | MYVFVKERVE SRSFIPEEKI FELLEKTKNP DPARVREIIQ KSLDKNRLEP EETATLLNVE DPELLEEIFE AARTLKERIY GNRIVLFAPL YIGNDCVNDC VYCGFRVSNK VVERKTLTEE QLKEEVKALV SQGHKRLIVV YGEHPKYSPE FIARTIDIVY NTKYGNGEIR RVNVNAAPQT IEGYRIIKSV GIGTFQIFQE TYHKKTYLKL HPRGPKSNYN WRLYGLDRAM MAGIDDVGIG ALFGLYDWKF EVMGLLYHTI HLEERFGVGP HTISFPRIKP AINTPYSQRP EHIVSDEDFK KLVAIIRLSV PYTGMILTAR EPAKLRDEVI KLGVSQIDAG SRIGIGAYSH REDDEDRKRQ FTLEDPRPLD QVMRSLLKEG FVPSFCTACY RAGRTGEHFM EFAIPGFVKN FCTPNALFTL QEYLCDYATE ETRRIGEEVI EKELQKMNPK IRERVKEGLE RIKRGERDVR F
|
| |