Gene Dtox_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0218 
Symbol 
ID8427142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp236578 
End bp238053 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content42% 
IMG OID645032605 
ProductMazG family protein 
Protein accessionYP_003189794 
Protein GI258513572 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.548157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGTG ATATTTATGT TATTGGGCTG GGACCTGGGA CAAAAGAATA TCTGACTTTA 
GATGCTTGGA ACAAGCTGCG TCAGGCGGCT AGAGTTTTTC TTCGTACGGG GATACATCCG
ATTGTTCCCT GGCTTAGGGA GAAGGGTATT TTATTTAAGA CTTTTGACCA TTATTATGAT
GAAGCAGCGG ATTTTTCTCA AGTTTACTTA AGAATTGTTG AGACCTTACT GGAAGAAGCC
GAAAGAGGTC CTCTGGTGTA TGCAGTGCCG GGTCACCCGC TGGTGGCCGA GACATCAGTG
GAACTGCTTT TGTCTAAAGC TGTTCCGGTG GGTAAAACAG TTCATATTGT GCCCTGCGTC
AGCTTTTTGG ATGTATTGTC GGTAGCCTTA AATATAGACC CGGCTAACGG TCTTCATATT
TTGGACGGCT TACAACTTGA CACGCAAAGA CCTGTGCCTG ACGTGGCTAA TATTATTACT
CAGGTCTACA GTCGCATAAT TGCTTCAGAC GTTAAACTTT CGTTATTTCA GTACTATCCT
GATGAGCATT TGATTAAGGT GGTTAAGGCA GCGGGTGTTC CCGGTGAGGA GCGTATTGAG
GAGATACCTC TCTATGAGCT GGACAGGTTG GATTGGATAG ATCACCTGAC TAGTGTATAT
GTTCCCCCCT GTTCTAAAGC AGATATAATT AGTAGTTGTC TTTATCCGCT GGATCCAATA
ACCTCCGTAA TGGCGGCACT TAGGGCAGAA AACGGCTGTC CCTGGGATAG GGAACAAAAC
CATCATTCTC TGGGTACATA CATGTTGGAA GAGGTTTACG AGGTGTTGGA AGCTGTTAAT
GAAGGGGATA TGAATAAACT TTGTGAAGAG TTGGGAGACT TATTATTACA GATAGTTTTT
CATGCGCAGA TATCTCTTGA ACATAATGGT TTTGATATGA ATGACATAAT CGCGGTAATA
ACAGAAAAGA TGATCCGTCG TCATCCCCAT GTATTTTCAA CTGTTCTGGT TAAGGACAGT
GCTGAAGTTT TGGTTAATTG GGAGAAGATT AAAAAAGAAG AGCGCAAGGG GAAAGAGTCT
AAATCTAAAT TGGATGGGAT TCCAAAAGGG TTACCTGCTT TGGCGAGGGC AGCCAAGGTG
CAGTCAAAGG CTGCTTTGGT TGGTTTTGAT TGGCCTGATT GCAGTGGCGC TTTATTGAAA
GTCGATGAGG AACTTATTGA ATTAAAAGAA GCAATTAGTC TGAGTAATTC TGTACAGATT
CAAGGTGAAC TGGGTGATCT TTTCTTTGCG GTGGTTAATG TGGCCAGATT GTTAAAAGTG
GACAGTGAAG CTGCATTAAT TGCCACAGTA GAAAAATTTT GCAAGAGGTT TAAGTACATA
GAGGAAATGG TAAAAACCTC CGACAAGGAA TGGCGGCAAT TTACTCTTAA GGAATTGGAT
AACTGGTGGA ATGAGGCTAA AAAATTAGGT ATGTAA
 
Protein sequence
MECDIYVIGL GPGTKEYLTL DAWNKLRQAA RVFLRTGIHP IVPWLREKGI LFKTFDHYYD 
EAADFSQVYL RIVETLLEEA ERGPLVYAVP GHPLVAETSV ELLLSKAVPV GKTVHIVPCV
SFLDVLSVAL NIDPANGLHI LDGLQLDTQR PVPDVANIIT QVYSRIIASD VKLSLFQYYP
DEHLIKVVKA AGVPGEERIE EIPLYELDRL DWIDHLTSVY VPPCSKADII SSCLYPLDPI
TSVMAALRAE NGCPWDREQN HHSLGTYMLE EVYEVLEAVN EGDMNKLCEE LGDLLLQIVF
HAQISLEHNG FDMNDIIAVI TEKMIRRHPH VFSTVLVKDS AEVLVNWEKI KKEERKGKES
KSKLDGIPKG LPALARAAKV QSKAALVGFD WPDCSGALLK VDEELIELKE AISLSNSVQI
QGELGDLFFA VVNVARLLKV DSEAALIATV EKFCKRFKYI EEMVKTSDKE WRQFTLKELD
NWWNEAKKLG M