Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmel_1599 |
Symbol | thiH |
ID | 5297879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermosipho melanesiensis BI429 |
Kingdom | Bacteria |
Replicon accession | NC_009616 |
Strand | - |
Start bp | 1585835 |
End bp | 1587250 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640769876 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001306828 |
Protein GI | 150021474 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACATAT TTACAAAGGA AAAATATTGT GAAAAAACAT TCATTAATAC TGAAGAAATA GAGGAATTAT TAGAAAATAC CAAGTTTCCA GATAGTGAAA AAATTCGCAA TATTCTCCGA AAATCCTTAA ATAAAGAAAA ACTCTCTCCA ATTGAAGTTG CCACATTACT AAACGCTAAT TCCAGAGAAC TATGGGAAGA AATTTTTGAT GCAGCAAGAA AGTTAAAAGA AAAAATTTAT GGAAACAGAA TTGTATTATT TGCACCGCTT TACATTGGTA ATGAATGCGT AAACGATTGC GAATACTGTG GTTTTAGAGT TTCAAACAAA AAGGTAGTTA GACAAACTCT TACTATTGAA AAATTAAAAG AAGAGATAAA AGCTTTAGTT AACAAAGGAC ACAAAAGATT AATCGTGGTA TATGGAGAAC ACCCCAAATA TTCACCGGAA TTTATCGCAA AAACAATAGA TGTAATATAT AACACAAAAT ATGGGAACGG TGAGATAAGA AGGGTAAACG TAAATGCCGC TCCTCAAACT ATAGAAGGGT ATAAGATTAT AAAAGAAGTT GGAATTGGAA CCTTTCAAAT ATTCCAAGAA ACATACCATA TACCTACATA CAAAAAAGTA CATCCCCGTG GACCAAAATC AAACTTTGCC TGGAGGTTAT ACGGCCTTGA TAGAGCAATG TTAGCCGGTA TTGATGATGT GGGAATCGGA GCACTTTTCG GATTATACGA TTGGAAATTC GAAGTTATGG GACTAATTTA TCACACTATC CACCTTGAAG AAAGATTTGG AGTCGGTCCA CATACGATTT CGTTTCCAAG AATTGAACCA GCAGTAGGTA CTCCTATTGC AGAAAGGCCA CCATATCAAG TTAATGATGA AGACTTTAAA AAAATAGTTG CGGTGTTAAG ACTTGCTGTA CCATATACAG GATTGATTTT AACTGCAAGA GAACCTGTGG AAATAAGGCG CGAAGTCTTA AAATTAGGCG TATCACAAAT AGACGCTGGC TCAAGTATAG GAGTAGGTTC ATACTCTGAT AAAGATCCTG AATTAATAAA GAAAAGTCAG TTTATCCTAG GAGACACACG TACCTTGGAT GAAGTAATCT ATGAATTACT CCAAGAAAAC TATATTCCAT CATTCTGTAC TGCATGTTAT AGAGCAGGAA GAACTGGTGA ACACTTTATG GAATTTGCAA TACCAGGTTT CGTAAAAAGA TTTTGTACTC CAAATGCTTT GTTTACTCTC AATGAATACT TAAACGATTA TGCATCAGAA AAAACTTATG AAATTGGGAA AAAAGTTATT CAGAAAGAAA TAGAAAAATT AAGTGGGAAA CAAAAAGAAG CTGTAATTAG TGGATTAGAT AAAATTAATA AAGGTGAGAG AGATGTCAGA TTCTAG
|
Protein sequence | MYIFTKEKYC EKTFINTEEI EELLENTKFP DSEKIRNILR KSLNKEKLSP IEVATLLNAN SRELWEEIFD AARKLKEKIY GNRIVLFAPL YIGNECVNDC EYCGFRVSNK KVVRQTLTIE KLKEEIKALV NKGHKRLIVV YGEHPKYSPE FIAKTIDVIY NTKYGNGEIR RVNVNAAPQT IEGYKIIKEV GIGTFQIFQE TYHIPTYKKV HPRGPKSNFA WRLYGLDRAM LAGIDDVGIG ALFGLYDWKF EVMGLIYHTI HLEERFGVGP HTISFPRIEP AVGTPIAERP PYQVNDEDFK KIVAVLRLAV PYTGLILTAR EPVEIRREVL KLGVSQIDAG SSIGVGSYSD KDPELIKKSQ FILGDTRTLD EVIYELLQEN YIPSFCTACY RAGRTGEHFM EFAIPGFVKR FCTPNALFTL NEYLNDYASE KTYEIGKKVI QKEIEKLSGK QKEAVISGLD KINKGERDVR F
|
| |