Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0182 |
Symbol | thiH |
ID | 8427106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 202967 |
End bp | 204376 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645032571 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_003189760 |
Protein GI | 258513538 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTTG CTGCGGCAGA TTTTATTGAT GATCAAAAAA TATGGGGGCT TCTGGAGGAA GCTAAGAATG CCGATAATAA AAAAGTAAAG GAAATTATTG AAAAAGCAGT AAAGGCACGG GGGTTAACAC CCGGGGAGGC GGCAGTACTG CTTCACCTGG AAGATGCCGC GCTGCTGGAG GAAATGTATG CGGCGGCAAA TAAAATTAAA GAGAGTATCT ATGGGCGCAG GCTGGTGCTT TTTGCTCCTC TTTATATCAG CAATTACTGT GTTAACAGCT GTGTCTATTG CGGCTATCGC ACCCACAGCA AGATTTTTCG CCGCAAACTA ACCATGGATG AAATCAAAGA AGAGGTTAAG GTTCTGGAGG GACTGGGTCA TAAGCGTTTA GCTCTGGAAT TCGGTGAACA CCCAGTCGAG TGTCCCATTG ATTATGTTTT GGAGGCCATT AGGACGATAT ATTCCGTTAA GGAGAAAAAC GGCAGCATCA GGAGGGTTAA CGTAAACATT GCTGCCACTA CTGTTGAGGA ATACAGGCTT TTAAAGGAGG CAGGGATCGG AACCTACATA CTTTTTCAGG AAACATATCA TAGGCAAACC TACAGCCGAA TGCACCCGGC CGGTCCCAAG CGTGATTATG TCTGGCACAC CACGGCTATG GACCGGGCCA TGCAGGGCGG TATCGATGAT GTCGGTGTGG GAGTCCTCTT TGGATTATAT GATTATAAAT ATGAAGTTAT GGGGTTGTTA ATGCACGCGC TGCATTTGGA GGAGGCTTTT GGTGTTGGGC CGCACACTAT TTCAGTGCCT AGGTTAAAAC CGGCAGCCGG GATGGATCTG GAGCAATTTC CTCATCTGGT TTCCGACCGG GATTTTAAGA AATTAATCGC TGTCTTGCGA TTGGCTGTTC CCTATACAGG AATGATTCTT TCCACCAGGG AGGGAGCGGA CTTTAGAGAC GAACTGCTGT CTATAGGTAT CTCGCAGATT AGTGCCGGTT CTTGTACCGG TGTGGGCGGC TACCGGAGCC AGTACCGGCA AGGTGCCGGC AAGGAAGAAG ATACCCGCCA ATTCAATGTG GAGGATAACC GCAGTCCGGA CGAGGTCATC CGCAGTATAG CTGAATCGGG CTATATACCC AGCTTTTGCA CCGCTTGCTA CCGTCAGGGG CGCACCGGGG ACCGTTTTAT GGCACTGGCC AAGACAGGTG AGATTCAAAA TGTCTGTCAA CCCAATGCTA TTCTTACCTT CCAGGAGTTT TTGCTGGATT ATGCCGCACC TGAAACCAGA ATTGCCGGCG ATAATTTCAT TAAGGAGCAG ATCAACCAAA TACCGGATGG AATAATTCGC CGGAAAACAG AAGAAAAACT GGAGAAAATA AAACAAGGTT GGCGAGACCT ATATTTTTAA
|
Protein sequence | MTVAAADFID DQKIWGLLEE AKNADNKKVK EIIEKAVKAR GLTPGEAAVL LHLEDAALLE EMYAAANKIK ESIYGRRLVL FAPLYISNYC VNSCVYCGYR THSKIFRRKL TMDEIKEEVK VLEGLGHKRL ALEFGEHPVE CPIDYVLEAI RTIYSVKEKN GSIRRVNVNI AATTVEEYRL LKEAGIGTYI LFQETYHRQT YSRMHPAGPK RDYVWHTTAM DRAMQGGIDD VGVGVLFGLY DYKYEVMGLL MHALHLEEAF GVGPHTISVP RLKPAAGMDL EQFPHLVSDR DFKKLIAVLR LAVPYTGMIL STREGADFRD ELLSIGISQI SAGSCTGVGG YRSQYRQGAG KEEDTRQFNV EDNRSPDEVI RSIAESGYIP SFCTACYRQG RTGDRFMALA KTGEIQNVCQ PNAILTFQEF LLDYAAPETR IAGDNFIKEQ INQIPDGIIR RKTEEKLEKI KQGWRDLYF
|
| |