Gene Dtox_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1681 
Symbol 
ID8428647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1769228 
End bp1770328 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content50% 
IMG OID645034014 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_003191161 
Protein GI258514939 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000164161 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTTTT ACGGAGAACT CCAAAGGTAT GAGAACTTTG ATTTTGAACT CTTTTTTAAA 
CAAGTTACTG ACGCCGGGAT TAAAAGAATT ATAGCCCAGC ACCGCCTTAA TGAAAGAGAT
TACCTGGCGC TGCTGTCTCC CCGGGCGGAA AATTTTCTGG AGGAAATGGC TCAAAAAGCT
CACCGCCTTA CTGTTCAGCA CTTTGGCAGG GTAATTTTCC TTTTTACTCC CATGTACCTG
GCCAACTATT GTGTCAATCA GTGTGTGTAC TGCGGGTTTC AGGTTCATAA TAGGCTGGAA
AGAAAGAAAC TTTCTCCCGC CGAAGTGGAA AAGGAAGCTA AAATTATTGC AGCTACCGGT
CTAAAGCATA TACTCATTCT TACCGGAGAG TCCAGGCAGG AATCCCCGGT TTCCTATATC
AGAGATTGTG TCGAGGTGCT GAAAAAATAT TTTACTTCTG TCAGCATAGA AATTTATCCA
CTGGAAGAAG ACGAGTATGC CGAGCTTATT GCTGCCGGGG TGGACGGTTT GACTATGTAC
CAGGAGGTAT ATAACGAGGA GGTTTATGCC GAACTGCATC CGGGTGGGCC GAAACGAAAT
TACCGCTTCC GGTTGGATGC TCCGGAGCGG GCCTGCCGGG CAGGAGTGAG GACAGTTAAT
GTGGGCGCCT TACTGGGACT GCATGACTGG CGAAGCGAGG CTTTTTTCAC AGGTCTGCAT
GCTGATTATC TCCAGAAAAA TTTTACGGAT GTTGAGGTCA GCATATCGCC GCCGCGGATG
CGCCCTCACC TGGGGGGCTT TCAACCCAGA GTTGAAGTGA GCGATCAAAA CCTGGTGCAG
TACCTACTGG CCTTCCGGCT CTTTATGCCG CGCGGCGGTA TTACTGTTTC CACCAGAGAG
AGGGCAGAAT TGCGGGATCA TCTTGTGCGG CTGGGCGCGA CCAAAATGTC GGCCGGTTCT
TGTACTGCTG TGGGTGGGCG GTCTGATCAG GAATCCACCG GCCAGTTTGA GATATCTGAT
GAGCGCAATG TGGTGGAGAT GGCGGACATG CTTTACTCTG TTGGTTACCA GCCGGTCTAT
AAAGATTGGC AGTCGTTTTG A
 
Protein sequence
MSFYGELQRY ENFDFELFFK QVTDAGIKRI IAQHRLNERD YLALLSPRAE NFLEEMAQKA 
HRLTVQHFGR VIFLFTPMYL ANYCVNQCVY CGFQVHNRLE RKKLSPAEVE KEAKIIAATG
LKHILILTGE SRQESPVSYI RDCVEVLKKY FTSVSIEIYP LEEDEYAELI AAGVDGLTMY
QEVYNEEVYA ELHPGGPKRN YRFRLDAPER ACRAGVRTVN VGALLGLHDW RSEAFFTGLH
ADYLQKNFTD VEVSISPPRM RPHLGGFQPR VEVSDQNLVQ YLLAFRLFMP RGGITVSTRE
RAELRDHLVR LGATKMSAGS CTAVGGRSDQ ESTGQFEISD ERNVVEMADM LYSVGYQPVY
KDWQSF