Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0370 |
Symbol | thiH |
ID | 7409300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 423117 |
End bp | 424226 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714756 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002572279 |
Protein GI | 222528397 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000304803 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAGT TTATAAGAAA AGCTGAAAAA GTTTGGGAAG AGTTTAAAGA TTATGTTCCC ACTTATGATG AGGTATGTGA AATCTTAGAA AAAGAAGTTG TAAATATTGA AGATGTAGCA AAACTTTTGA ATGTAGAAGA CAAAAACTCA ATCCTTCTCA TGGCAAGCAA AGCTAAAAAG CTCACAAGAG AAAACTTTGG CAAGGTCATC CTCTTGTATG CGCCGCTGTA TATCTCAAAC TACTGTCAAA ACGGATGTGT TTATTGCGGA TTTTCTTGCA GGAAAAATTA TAAAAGAGAA AAACTTGAGC TTGATGAAAT TGAAAATGAG CTAAGAAGTA TGAAAGAAGA GGGTATCGAC TCTGTTATAA TCCTCACAGG AGAGGATAGA ATATACTCTC CAGTTGACTA TATTAAACAG GCCTGCAAAA TAGCAACAGA ATATATGTCA GAAGTTTCGA TTGAGGTTTA TCCTCTTTTT GAAGAAGAAT ATAGAAGTCT TGCAAACGCT GGTGTTGTGG GAATAACCAT ATATCAGGAG ACATATCAAA AAGAAGATTA TGAGAAGCTA CATCCTTTTG GACCAAAAAG GGATTTTGAG TTTAGGCTCA CTGCTGTTGA AAGAGCCCTA TCTGCCGGGT TTCATGAAGC GTGCGTGGGA CCGCTTTTAG GGCTGTCTCA TCCCAAAAAA GATGTGCTTT GTACTTTGCT TTATGCAGAG TATCTTCTTG ACAGATTTCC CAAAGCAGAA ATTTCAGTTT CATTCCCGCG CGTAAGATCC GCAGGCTTAG ATTTTGTTCC AATATTTTCT GTTTCTGACA AGGAATTTAT AAAATTTTTG ATTGTTGCAA GGATTTATCT TCCAAGAGTT GGAATTGTGA TATCCACAAG AGAAGATGCG CGCCTTCGTG ATGCACTCAT TGATGTGTGC ATAACAAAGA TGTCGGCAGG TTCTAAAACA ACTGTCGGCG GATATGCAAC ACAGGAAGAA AAAGATGCCC AGTTTGAGGT TGAAGATAGA AGAACTGTTG CTGAGGTTGT AGAGAGTATA ATAAAAAAGG GACTGAGACC CGAGTTTACT AACTGGGTAA GGGGTGTTGG AAGTTTATGA
|
Protein sequence | MTEFIRKAEK VWEEFKDYVP TYDEVCEILE KEVVNIEDVA KLLNVEDKNS ILLMASKAKK LTRENFGKVI LLYAPLYISN YCQNGCVYCG FSCRKNYKRE KLELDEIENE LRSMKEEGID SVIILTGEDR IYSPVDYIKQ ACKIATEYMS EVSIEVYPLF EEEYRSLANA GVVGITIYQE TYQKEDYEKL HPFGPKRDFE FRLTAVERAL SAGFHEACVG PLLGLSHPKK DVLCTLLYAE YLLDRFPKAE ISVSFPRVRS AGLDFVPIFS VSDKEFIKFL IVARIYLPRV GIVISTREDA RLRDALIDVC ITKMSAGSKT TVGGYATQEE KDAQFEVEDR RTVAEVVESI IKKGLRPEFT NWVRGVGSL
|
| |