Gene Dbac_0775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_0775 
SymbolthiH 
ID8376430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp852610 
End bp853722 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content67% 
IMG OID645000015 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003157310 
Protein GI256828582 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCC TGCCCGAGGC CCTGCGCCTG AGCCAAACGC CCCTGCAGCC GCATTTCGAG 
GCCGTGCGCG GGAATGACGT GCGCCGCGTT GTCGGCCAGG AGCGCGTCGA CGCGTCCGGA
TTCCTGGCCC TGCTCTCCCC GGCGGCGGCC CCGCATCTGG AGGCCATGGG CAGACGCGCC
CATGCGCTGA CCCTGCGCAA TTTCGGCCGC ACCATCAGCC TCTTCACTCC CCTTTACGTC
TCCAACCACT GCGCCAACCA CTGCCGCTAC TGCGGGTTCG CGGCCCCGAA CACGATCCCG
CGCACCCAGC TCAGCCTCGA TGAAGTCCGG GCGGAAGGAC AGGCCATCGC CGCCACGGGC
CTGAAGCATC TGCTCCTTTT GACCGGCGAG GCCCCGCGCA AGGCGGGCGT TGAGTATCTG
GAAGCGTGCG TGCGGGTCCT GCGCCCCTTG TTTCCGTCCA TTTCCTTGGA AGTCTATCCC
ATGGAAACGG CTGATTACGC GAGGCTGGTG CAGGCGGGCG TGGACGGCCT GACCGTGTTT
CAGGAAACCT ACGACCCGGT CCTTTATGCC CAGCTGCACC CGGCCGGACC CAAGCGGGAC
TATGCCTTCC GCCTGAACAC CCCGCAACGC GGAGCCGAGG CGGGCATGCG CGTGGTCAAC
ATCGGCGCGC TGCTGGGCCT GACCGACTGG CGGCAGGAAA TTTACGCCAC CGGCCTGCAC
GCGGCCTGGC TGCAAAAGCG CTACCCCGGC GTGGATGTGG CCGTGTCCCT GCCGCGCATG
CGCCCCCATG CCGGAGCGTT TCAACCGGCG TGCATTGTTT CCGACCGGGA ACTGGTGCAG
GCCATGACCG CCCTGCGCAT CTTCCTGCCG CGCCTGTCCA TCACCATCTC CACCCGTGAA
GCGCCGGATT TTCGCGACAA CATCCTGCCG CTGGGCGTGA CGCGCATGTC GGCGGGAGTC
AGCACCGCCG TGGGCGGGCA CGCCAAACCC GCCGAAACCG GGCAGTTCGA GATCTCCGAT
GCGCGCAGCG TGGACGAGAT GAAGGAGTCG TTGCGCGCTC GCGGATACCA GGCCGTCTTC
AAAGACTGGG AGCCGCTGGA GGGGAGCGCG TGA
 
Protein sequence
MSFLPEALRL SQTPLQPHFE AVRGNDVRRV VGQERVDASG FLALLSPAAA PHLEAMGRRA 
HALTLRNFGR TISLFTPLYV SNHCANHCRY CGFAAPNTIP RTQLSLDEVR AEGQAIAATG
LKHLLLLTGE APRKAGVEYL EACVRVLRPL FPSISLEVYP METADYARLV QAGVDGLTVF
QETYDPVLYA QLHPAGPKRD YAFRLNTPQR GAEAGMRVVN IGALLGLTDW RQEIYATGLH
AAWLQKRYPG VDVAVSLPRM RPHAGAFQPA CIVSDRELVQ AMTALRIFLP RLSITISTRE
APDFRDNILP LGVTRMSAGV STAVGGHAKP AETGQFEISD ARSVDEMKES LRARGYQAVF
KDWEPLEGSA