Gene Dtox_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1645 
Symbol 
ID8428611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1724476 
End bp1725552 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content48% 
IMG OID645033978 
ProductRadical SAM domain protein 
Protein accessionYP_003191125 
Protein GI258514903 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000004914 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000624327 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGATCT TGAGTAATAT TTTTCCGCTG CTAAAAAAAG CAATAGCCGG TGGCAGGCTT 
AGCTTAGAGG AGGGTACTGC CCTCATGGAA ACAAATGACC TGCTTGCTTT GGGTCAGGCT
GCGGATATTA TTCGGCAGCG CTTGCACCCG GAACGGCAGG TAACTTTTGT AATAGACAGA
AATATTAATT ATACCAATGT TTGCCTGTCA CGCTGCAGGT TTTGTGCTTT TTATCGAGAT
CAAAATGCAC CGGATGCTTA CATAATCGGC TGGCAAGAAT TATATGAAAA AATTGCGGAG
ACAGTGGACG CAGGTGGCAC CGAACTTTTA ATACAAGGTG GTTTGCACCC TGATTTAACA
CTTGATTATT ATCTGGATAT GCTGCGTTAT ATCAAGAGCA ATTTTGATAT TCATATACAT
TCCTTTTCTC CTCCGGAAGT TATGCACATG GTAAAAGGCT CCGGACTGTC AATTAAGGAA
GTGCTGGAAA AACTCCGGGC CGCAGGTTTG GACTCTCTGC CCGGCGGCGG AGCCGAAATA
CTGGTTGACC GAGTGCGCGG CCAAATCAGC CCGGAAAAGA TTGGCTGGCA GGACTGGATG
AAGGTTATGT TGACCGCTCA CAGTATCGGC ATGAAAACTA CCGCTACGAT GATGTTCGGC
CATGTGGAAA CGAACGAAGA AAGAGTACTG CACATGATCA GGGTGAGAGA AGCTCAGGAT
CAGACTGGGG GCTTTACTGC TTTCATACCC TGGAGTTTTC AACCCAAAAA TACACAGTTA
AGCATGCCGG CAGCCAGTGG GGTGGAGTAT CTGAAAACGC TGGCTGTAGC CCGCCTGGTT
ATTGATAACG TTCCTAATAT CCAGGCTTCC TGGGTCACTC AGGGCGCCAA AATAGCTCAG
GTTGCTTTAA GCTTTGGAGC TAATGATTTC GGCAGCACCA TGCTGGAGGA AAATGTTGTG
CGGGCGGCCG GTGTAACTTA CCGTATAGCC TTGCGGGAGA TTATTCACTG CATAAAAAAT
GCCGGCTTCA CGCCTGCGCA GCGCACAACT GACTATAGGG TGCTTAGGAT GTTTTAG
 
Protein sequence
MKILSNIFPL LKKAIAGGRL SLEEGTALME TNDLLALGQA ADIIRQRLHP ERQVTFVIDR 
NINYTNVCLS RCRFCAFYRD QNAPDAYIIG WQELYEKIAE TVDAGGTELL IQGGLHPDLT
LDYYLDMLRY IKSNFDIHIH SFSPPEVMHM VKGSGLSIKE VLEKLRAAGL DSLPGGGAEI
LVDRVRGQIS PEKIGWQDWM KVMLTAHSIG MKTTATMMFG HVETNEERVL HMIRVREAQD
QTGGFTAFIP WSFQPKNTQL SMPAASGVEY LKTLAVARLV IDNVPNIQAS WVTQGAKIAQ
VALSFGANDF GSTMLEENVV RAAGVTYRIA LREIIHCIKN AGFTPAQRTT DYRVLRMF