Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_0775 |
Symbol | thiH |
ID | 8376430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | - |
Start bp | 852610 |
End bp | 853722 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645000015 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_003157310 |
Protein GI | 256828582 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTCC TGCCCGAGGC CCTGCGCCTG AGCCAAACGC CCCTGCAGCC GCATTTCGAG GCCGTGCGCG GGAATGACGT GCGCCGCGTT GTCGGCCAGG AGCGCGTCGA CGCGTCCGGA TTCCTGGCCC TGCTCTCCCC GGCGGCGGCC CCGCATCTGG AGGCCATGGG CAGACGCGCC CATGCGCTGA CCCTGCGCAA TTTCGGCCGC ACCATCAGCC TCTTCACTCC CCTTTACGTC TCCAACCACT GCGCCAACCA CTGCCGCTAC TGCGGGTTCG CGGCCCCGAA CACGATCCCG CGCACCCAGC TCAGCCTCGA TGAAGTCCGG GCGGAAGGAC AGGCCATCGC CGCCACGGGC CTGAAGCATC TGCTCCTTTT GACCGGCGAG GCCCCGCGCA AGGCGGGCGT TGAGTATCTG GAAGCGTGCG TGCGGGTCCT GCGCCCCTTG TTTCCGTCCA TTTCCTTGGA AGTCTATCCC ATGGAAACGG CTGATTACGC GAGGCTGGTG CAGGCGGGCG TGGACGGCCT GACCGTGTTT CAGGAAACCT ACGACCCGGT CCTTTATGCC CAGCTGCACC CGGCCGGACC CAAGCGGGAC TATGCCTTCC GCCTGAACAC CCCGCAACGC GGAGCCGAGG CGGGCATGCG CGTGGTCAAC ATCGGCGCGC TGCTGGGCCT GACCGACTGG CGGCAGGAAA TTTACGCCAC CGGCCTGCAC GCGGCCTGGC TGCAAAAGCG CTACCCCGGC GTGGATGTGG CCGTGTCCCT GCCGCGCATG CGCCCCCATG CCGGAGCGTT TCAACCGGCG TGCATTGTTT CCGACCGGGA ACTGGTGCAG GCCATGACCG CCCTGCGCAT CTTCCTGCCG CGCCTGTCCA TCACCATCTC CACCCGTGAA GCGCCGGATT TTCGCGACAA CATCCTGCCG CTGGGCGTGA CGCGCATGTC GGCGGGAGTC AGCACCGCCG TGGGCGGGCA CGCCAAACCC GCCGAAACCG GGCAGTTCGA GATCTCCGAT GCGCGCAGCG TGGACGAGAT GAAGGAGTCG TTGCGCGCTC GCGGATACCA GGCCGTCTTC AAAGACTGGG AGCCGCTGGA GGGGAGCGCG TGA
|
Protein sequence | MSFLPEALRL SQTPLQPHFE AVRGNDVRRV VGQERVDASG FLALLSPAAA PHLEAMGRRA HALTLRNFGR TISLFTPLYV SNHCANHCRY CGFAAPNTIP RTQLSLDEVR AEGQAIAATG LKHLLLLTGE APRKAGVEYL EACVRVLRPL FPSISLEVYP METADYARLV QAGVDGLTVF QETYDPVLYA QLHPAGPKRD YAFRLNTPQR GAEAGMRVVN IGALLGLTDW RQEIYATGLH AAWLQKRYPG VDVAVSLPRM RPHAGAFQPA CIVSDRELVQ AMTALRIFLP RLSITISTRE APDFRDNILP LGVTRMSAGV STAVGGHAKP AETGQFEISD ARSVDEMKES LRARGYQAVF KDWEPLEGSA
|
| |