Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0169 |
Symbol | thiH |
ID | 7407160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 207920 |
End bp | 209353 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714571 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002572094 |
Protein GI | 222528212 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000784524 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAGAA AAGATGAATG GGAAAGAACT GAGTTTATAA ATGACCAGAT GGTCTATGAT ATTCTTGAAG AAGGTAGAAA AAATGTTGAC AGAGCAGAAG AAATAATTGA AAAAGCTTTG CAGTTAAATG GACTTGAGCC TCAGGAGGTT GCAACCCTTC TTTATATAGA GGACAAGGAC CTTTTAGAAA AGCTTTTTAA GGCGGCAAGG CAGGTAAAAG AAAGGATTTA TGGCAAGAGG ATTGTGCTTT TTGCACCTCT TTACATCAGC AACTTTTGTG TAAACAACTG CCGATACTGT GGTTATCACA GGTCAAATAC CAAAATGAAA AGAAGAAAAC TTACGATGGA TGAGATACGA AAAGAGGTTG AAATAATTGA ATCTCTTGGG CACAAAAGAA TTGCTCTTGA GCTTGGCGAA GATCCAAAAG AAGCGCCAAT TGAATATGTC ATAGATGCCA TAAAAACCAT ATATTCTGTT TACAAGGAAA AAGGAAATAT AAGAAGAGTA AATGTAAACA TTGCAGCAAC AACAATTGAA GAATATAGGA TGCTAAAAGA AGCAAAAATA GGTACTTATG TACTTTTCCA GGAAACATAC CACAGACCAA CCTATGAATA CATGCACCCC GAAGGCCCAA AGTCAGATTA CGACTGGCAT ACAATGGCAA TGGACAGAGC AATGCAAGGT GGAATTGATG ATGTTGGGCT TGGAGTGCTC TTTGGGCTTT ATGACTATAA ATTTGAGGTT GTTGGGCTAA TCTTGCATGC AAAGCATCTT GAGGAAAGAT TTGGAGTAGG ACCACACACA ATCTCTGTGC CAAGGATTAG ACCTGCCGAG GGTGTTGAGG TGACAAAAGA AAGGTATCCT TACCTTGTTT CTGATGATGA GTTTAAAAAG ATTGTTGCAA TAATAAGACT TGCTGTGCCC TACACTGGAA TGATTTTATC TACCCGTGAA AGACCAGGTT TTAGAGAAGA GGTAATTGAC CTTGGAATAT CGCAGATAAG CGCTGGGTCC TGCACGGGTG TTGGTGGCTA TACTCTTGAG TATGAAGAAA AATCCACGGG TAATTTAGAT GAAGACCTTG CACAGTTTGA GGTTGAAGAT AAAAGAAGTC CAGATGAGGT CATAAGAACA CTTTGCGAGG AGGGTTATAT TCCAAGCTAC TGTACAGCTT GTTACAGAAG AGGAAGAACT GGGGATTTAT TTATGCAGTA TGCAAAGACA GGTGACATTC AAGACTTTTG TACACCAAAT GCGCTTTTGA CTTTTATGGA GTATTTAGAG GACTATGGCT CTGAAAAGAC AAAAGAGGTT GGGCGAAAAA TTATATATGA GAGCTTGAAT CAAATAAAAG ATGAAAAAAT GCGCAAAGAA ACTGAAAAGA GGCTTGAGAT GATAAGAAAT GGTGTGAGAG ATTTATATTT CTAA
|
Protein sequence | MFRKDEWERT EFINDQMVYD ILEEGRKNVD RAEEIIEKAL QLNGLEPQEV ATLLYIEDKD LLEKLFKAAR QVKERIYGKR IVLFAPLYIS NFCVNNCRYC GYHRSNTKMK RRKLTMDEIR KEVEIIESLG HKRIALELGE DPKEAPIEYV IDAIKTIYSV YKEKGNIRRV NVNIAATTIE EYRMLKEAKI GTYVLFQETY HRPTYEYMHP EGPKSDYDWH TMAMDRAMQG GIDDVGLGVL FGLYDYKFEV VGLILHAKHL EERFGVGPHT ISVPRIRPAE GVEVTKERYP YLVSDDEFKK IVAIIRLAVP YTGMILSTRE RPGFREEVID LGISQISAGS CTGVGGYTLE YEEKSTGNLD EDLAQFEVED KRSPDEVIRT LCEEGYIPSY CTACYRRGRT GDLFMQYAKT GDIQDFCTPN ALLTFMEYLE DYGSEKTKEV GRKIIYESLN QIKDEKMRKE TEKRLEMIRN GVRDLYF
|
| |