Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1631 |
Symbol | thiH |
ID | 8398443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1775534 |
End bp | 1776691 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644995995 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_003153373 |
Protein GI | 257067117 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTA AAAGCGTAAA TGATTATTTC CCAGAAATGG ATATAATCGA CTCAGATATA AAAGAAAGAA TTGAAAAAGC TTACGATAGA GTAAAAGATA CAGATGTAAG TGAGGCTGAT GTCTTGGCAA GTCTTAATAA GAAAAATCTC TCAGAAAGAG ACTTCTACAA TCTCATAAGC GACAAGGCAG AAGATCACTT AGAAGAGATG GCAGAGCTTG CCAAAGATGC TAGGATTAGA TATTTTGGAA ACAATGTATG CCTATTTTCT CCGATCTATA TAGCTAATTA CTGCGAAAAT TCCTGCAGAT ATTGTGGTTT TAGGGCAAAA AGCGATATCA AAAGAGCTAA GCTTAACCTA GAAGAGATCG AAGAAGAGAT GAAGGCTTTG GCAGAAACTG GGATTGAAGA TGTCCTAATC CTTACTGGTG AAAGCGAGAG ATTTTCTTCT GTAGATTATA TAGGAGAAGC TTGTAGAATT GCCAGCAAAT ATTTTAAGGT AGTAGGAATA GAAGTATATC CCGCAAATGT TTCTTCTTAC GAGAAATTAA GAGAGGCTGG GGCGGATTTC GTTACAGTCT TCCAGGAATC CTACAACAAG AAAGCTTTCG ACTACTATCA TCCCGCAGGG CATAAGAGAA GCTTTAACTA TAGAATCGAC ACCCAAGAGC GAGCTCTTAT GGCAGGCTTT AGGGGAGTGG GTTTTGGGGC CCTCTTTGGA CTTTCTGATC CTATAGAAGA TGCTTTTAAG CTTGCCATCC ACGCCAAGGA AGTTCAAAGG AAATATCCTC AGGCAGAAAT TGCAATCTCT CTTCCAAGGA TTAGGCCAAC CCACGGGGCG GATGATACTC TAGACTTTAA TATCGTAGAT GATAAGAAAT TCTTCCAAAT CATGCTTGCA ATTAGAATGT TTCTGCCTTT TGCCTCTATT ACCCTTTCAA CACGTGAGTC AAAGGACTTT AGGGACTTGG CTGTGAAATA TGCTGCGACT AAAATCTCTG CATCAGTAGA TACTGCTATA GGACACAGGT CAAAGAAAAG TGCTGATGAG GGAGATGAGC AGTTTGAGAT TGACGATTCT CGTTCTACCG AACAAGCCTT TGAGGACCTC AAGAAAATCG GCATGACACC AGTCTTTACT GATTATATAA ATTTATAG
|
Protein sequence | MNIKSVNDYF PEMDIIDSDI KERIEKAYDR VKDTDVSEAD VLASLNKKNL SERDFYNLIS DKAEDHLEEM AELAKDARIR YFGNNVCLFS PIYIANYCEN SCRYCGFRAK SDIKRAKLNL EEIEEEMKAL AETGIEDVLI LTGESERFSS VDYIGEACRI ASKYFKVVGI EVYPANVSSY EKLREAGADF VTVFQESYNK KAFDYYHPAG HKRSFNYRID TQERALMAGF RGVGFGALFG LSDPIEDAFK LAIHAKEVQR KYPQAEIAIS LPRIRPTHGA DDTLDFNIVD DKKFFQIMLA IRMFLPFASI TLSTRESKDF RDLAVKYAAT KISASVDTAI GHRSKKSADE GDEQFEIDDS RSTEQAFEDL KKIGMTPVFT DYINL
|
| |