Gene Dtox_0182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0182 
SymbolthiH 
ID8427106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp202967 
End bp204376 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content48% 
IMG OID645032571 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_003189760 
Protein GI258513538 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTG CTGCGGCAGA TTTTATTGAT GATCAAAAAA TATGGGGGCT TCTGGAGGAA 
GCTAAGAATG CCGATAATAA AAAAGTAAAG GAAATTATTG AAAAAGCAGT AAAGGCACGG
GGGTTAACAC CCGGGGAGGC GGCAGTACTG CTTCACCTGG AAGATGCCGC GCTGCTGGAG
GAAATGTATG CGGCGGCAAA TAAAATTAAA GAGAGTATCT ATGGGCGCAG GCTGGTGCTT
TTTGCTCCTC TTTATATCAG CAATTACTGT GTTAACAGCT GTGTCTATTG CGGCTATCGC
ACCCACAGCA AGATTTTTCG CCGCAAACTA ACCATGGATG AAATCAAAGA AGAGGTTAAG
GTTCTGGAGG GACTGGGTCA TAAGCGTTTA GCTCTGGAAT TCGGTGAACA CCCAGTCGAG
TGTCCCATTG ATTATGTTTT GGAGGCCATT AGGACGATAT ATTCCGTTAA GGAGAAAAAC
GGCAGCATCA GGAGGGTTAA CGTAAACATT GCTGCCACTA CTGTTGAGGA ATACAGGCTT
TTAAAGGAGG CAGGGATCGG AACCTACATA CTTTTTCAGG AAACATATCA TAGGCAAACC
TACAGCCGAA TGCACCCGGC CGGTCCCAAG CGTGATTATG TCTGGCACAC CACGGCTATG
GACCGGGCCA TGCAGGGCGG TATCGATGAT GTCGGTGTGG GAGTCCTCTT TGGATTATAT
GATTATAAAT ATGAAGTTAT GGGGTTGTTA ATGCACGCGC TGCATTTGGA GGAGGCTTTT
GGTGTTGGGC CGCACACTAT TTCAGTGCCT AGGTTAAAAC CGGCAGCCGG GATGGATCTG
GAGCAATTTC CTCATCTGGT TTCCGACCGG GATTTTAAGA AATTAATCGC TGTCTTGCGA
TTGGCTGTTC CCTATACAGG AATGATTCTT TCCACCAGGG AGGGAGCGGA CTTTAGAGAC
GAACTGCTGT CTATAGGTAT CTCGCAGATT AGTGCCGGTT CTTGTACCGG TGTGGGCGGC
TACCGGAGCC AGTACCGGCA AGGTGCCGGC AAGGAAGAAG ATACCCGCCA ATTCAATGTG
GAGGATAACC GCAGTCCGGA CGAGGTCATC CGCAGTATAG CTGAATCGGG CTATATACCC
AGCTTTTGCA CCGCTTGCTA CCGTCAGGGG CGCACCGGGG ACCGTTTTAT GGCACTGGCC
AAGACAGGTG AGATTCAAAA TGTCTGTCAA CCCAATGCTA TTCTTACCTT CCAGGAGTTT
TTGCTGGATT ATGCCGCACC TGAAACCAGA ATTGCCGGCG ATAATTTCAT TAAGGAGCAG
ATCAACCAAA TACCGGATGG AATAATTCGC CGGAAAACAG AAGAAAAACT GGAGAAAATA
AAACAAGGTT GGCGAGACCT ATATTTTTAA
 
Protein sequence
MTVAAADFID DQKIWGLLEE AKNADNKKVK EIIEKAVKAR GLTPGEAAVL LHLEDAALLE 
EMYAAANKIK ESIYGRRLVL FAPLYISNYC VNSCVYCGYR THSKIFRRKL TMDEIKEEVK
VLEGLGHKRL ALEFGEHPVE CPIDYVLEAI RTIYSVKEKN GSIRRVNVNI AATTVEEYRL
LKEAGIGTYI LFQETYHRQT YSRMHPAGPK RDYVWHTTAM DRAMQGGIDD VGVGVLFGLY
DYKYEVMGLL MHALHLEEAF GVGPHTISVP RLKPAAGMDL EQFPHLVSDR DFKKLIAVLR
LAVPYTGMIL STREGADFRD ELLSIGISQI SAGSCTGVGG YRSQYRQGAG KEEDTRQFNV
EDNRSPDEVI RSIAESGYIP SFCTACYRQG RTGDRFMALA KTGEIQNVCQ PNAILTFQEF
LLDYAAPETR IAGDNFIKEQ INQIPDGIIR RKTEEKLEKI KQGWRDLYF