Gene Athe_0169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0169 
SymbolthiH 
ID7407160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp207920 
End bp209353 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content38% 
IMG OID643714571 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002572094 
Protein GI222528212 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000784524 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGAA AAGATGAATG GGAAAGAACT GAGTTTATAA ATGACCAGAT GGTCTATGAT 
ATTCTTGAAG AAGGTAGAAA AAATGTTGAC AGAGCAGAAG AAATAATTGA AAAAGCTTTG
CAGTTAAATG GACTTGAGCC TCAGGAGGTT GCAACCCTTC TTTATATAGA GGACAAGGAC
CTTTTAGAAA AGCTTTTTAA GGCGGCAAGG CAGGTAAAAG AAAGGATTTA TGGCAAGAGG
ATTGTGCTTT TTGCACCTCT TTACATCAGC AACTTTTGTG TAAACAACTG CCGATACTGT
GGTTATCACA GGTCAAATAC CAAAATGAAA AGAAGAAAAC TTACGATGGA TGAGATACGA
AAAGAGGTTG AAATAATTGA ATCTCTTGGG CACAAAAGAA TTGCTCTTGA GCTTGGCGAA
GATCCAAAAG AAGCGCCAAT TGAATATGTC ATAGATGCCA TAAAAACCAT ATATTCTGTT
TACAAGGAAA AAGGAAATAT AAGAAGAGTA AATGTAAACA TTGCAGCAAC AACAATTGAA
GAATATAGGA TGCTAAAAGA AGCAAAAATA GGTACTTATG TACTTTTCCA GGAAACATAC
CACAGACCAA CCTATGAATA CATGCACCCC GAAGGCCCAA AGTCAGATTA CGACTGGCAT
ACAATGGCAA TGGACAGAGC AATGCAAGGT GGAATTGATG ATGTTGGGCT TGGAGTGCTC
TTTGGGCTTT ATGACTATAA ATTTGAGGTT GTTGGGCTAA TCTTGCATGC AAAGCATCTT
GAGGAAAGAT TTGGAGTAGG ACCACACACA ATCTCTGTGC CAAGGATTAG ACCTGCCGAG
GGTGTTGAGG TGACAAAAGA AAGGTATCCT TACCTTGTTT CTGATGATGA GTTTAAAAAG
ATTGTTGCAA TAATAAGACT TGCTGTGCCC TACACTGGAA TGATTTTATC TACCCGTGAA
AGACCAGGTT TTAGAGAAGA GGTAATTGAC CTTGGAATAT CGCAGATAAG CGCTGGGTCC
TGCACGGGTG TTGGTGGCTA TACTCTTGAG TATGAAGAAA AATCCACGGG TAATTTAGAT
GAAGACCTTG CACAGTTTGA GGTTGAAGAT AAAAGAAGTC CAGATGAGGT CATAAGAACA
CTTTGCGAGG AGGGTTATAT TCCAAGCTAC TGTACAGCTT GTTACAGAAG AGGAAGAACT
GGGGATTTAT TTATGCAGTA TGCAAAGACA GGTGACATTC AAGACTTTTG TACACCAAAT
GCGCTTTTGA CTTTTATGGA GTATTTAGAG GACTATGGCT CTGAAAAGAC AAAAGAGGTT
GGGCGAAAAA TTATATATGA GAGCTTGAAT CAAATAAAAG ATGAAAAAAT GCGCAAAGAA
ACTGAAAAGA GGCTTGAGAT GATAAGAAAT GGTGTGAGAG ATTTATATTT CTAA
 
Protein sequence
MFRKDEWERT EFINDQMVYD ILEEGRKNVD RAEEIIEKAL QLNGLEPQEV ATLLYIEDKD 
LLEKLFKAAR QVKERIYGKR IVLFAPLYIS NFCVNNCRYC GYHRSNTKMK RRKLTMDEIR
KEVEIIESLG HKRIALELGE DPKEAPIEYV IDAIKTIYSV YKEKGNIRRV NVNIAATTIE
EYRMLKEAKI GTYVLFQETY HRPTYEYMHP EGPKSDYDWH TMAMDRAMQG GIDDVGLGVL
FGLYDYKFEV VGLILHAKHL EERFGVGPHT ISVPRIRPAE GVEVTKERYP YLVSDDEFKK
IVAIIRLAVP YTGMILSTRE RPGFREEVID LGISQISAGS CTGVGGYTLE YEEKSTGNLD
EDLAQFEVED KRSPDEVIRT LCEEGYIPSY CTACYRRGRT GDLFMQYAKT GDIQDFCTPN
ALLTFMEYLE DYGSEKTKEV GRKIIYESLN QIKDEKMRKE TEKRLEMIRN GVRDLYF