Gene Dtox_4262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4262 
Symbol 
ID8431276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4426311 
End bp4427894 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content43% 
IMG OID645036454 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_003193552 
Protein GI258517330 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTA CGAGGAAAGA CATCGAGGCA GCTATTTGGA GAGGTGCTAA TACTTTCAGA 
GGAGCTATAG ATGCGGCCAA TTATAAAGAC TATATTTTGC CTATGCTTTT TGTAAAGTAT
CTGAGTGATT CTTATCTGGA AAAAGTAGAA AAGCTAGAGC AAAAATATAA TGACCCGGTG
AGGGCAGAAA GAGCGATAAA CAGACTTCCC TTTGCCATTA AGGAAAAACA CAGGTTTTCC
TGGCTTTATC AAAACAGGTA TAACGACAAC CTGGGGGAAC TCATCAATAT TGCTTTACGT
GGGATAGAAG ACGATAATCC CTCACTGTTT ACCGGGGTAT TTAGAAATAT TGATTTTAAC
AGCGAAGCCA TGCTGGGCAA TCATAATCAA AAAAACACCA GGTTAAGGGA ATTGCTTGAG
GATTTTGAAC CTCTTGATTT GCGTCCTTCC GCTATTGAGC CCGAGGAAGG CAAGGTGGCG
GCTGATACCA TAGGCGATGC ATATGAATAT ATGATCGGAG AGTTTGCCAG ACAGGCAGGT
AAAAAAGCCG GATCTTTTTT TACCCCGTCT GAGGTATCCG AGCTGATAGC ACGCATCGTC
AATCCCAAAA TAAGCGATAC CATGTACGAT CCGACGTGCG GTTCCGGATC TCTGCTCATC
AGAACCGGAA AAAAAGCCAT TGAAAAGGAA AATGGCAATA TAAAGACCCT GGCCCTCTAT
GGGCAGGAAA TGAACGGGTC TTCCTGGTCT ATGGCCAAAA TGAATATGTT TCTTCATGAA
ATAATGGATG CCAGGATTGC CTGGGGTGAC TCACTGGCTA ACCCCATGCA TTTGGATCCC
GATGGAAATC TTATGCAGTT TGATGTTATC GTAGCCAATA TGCCTTTTTC TCAGGATAAA
TGGGCGGCGG GATTTAATAC CGGTGGCGAA ATGACCGGCA AAGGCAAACA ATTTAAAATG
GAAGCATCCC TTGATAAATT TCACCGCTTT GACTGGGGAG TTCCCCCTGC CAGCAAAGGT
GATTGGGCAT TTTTGCTGCA TATGATAGCC AGTCTTAAAA GCGGTGGCAG GATTGCCGCC
GTTGCTCCCC ATGGTGTTTT GTTTAGAGGT GCGTCCGAAG GAAGGATAAG GCAGGCTGTA
ATTGAGAGAA ATCTATTGGA CGCAGTTATT GGTCTTCCTG CCAACCTGTT TTACGGTACC
GGTATTCCGG CCTGTATCCT GGTTTTTAAG AAAAACCGCA ACCGTAATGA TGTGCTTTTT
ATTGACGCGT CCGGCAAAGA TGAAAAGGGC CATCTTCGCT ACAGGAAGGA TAAAAATCAG
AACAGGTTGG AAACAAAACA CATAGAGGAT ATTGTAAAAG CTTATGAGAC TCGTTCGGAT
ATTATAAAAC TAGCTCATGT GGCAACCTTG GATGAGATAA AAGCCAACGA ATATAATCTT
AATATCCCTC GGTATGTTGA TACTTTTGAG GAAGAAGACC TGGTGAATAT AGAAGAGGTA
AAAAACAATA TAGCCAATAT CCAAAAGGAG CTTGCCGAAG TGGAAGCACA GATGGCCGAA
TATCTTAAGG AGTTGGGACT ATGA
 
Protein sequence
MATTRKDIEA AIWRGANTFR GAIDAANYKD YILPMLFVKY LSDSYLEKVE KLEQKYNDPV 
RAERAINRLP FAIKEKHRFS WLYQNRYNDN LGELINIALR GIEDDNPSLF TGVFRNIDFN
SEAMLGNHNQ KNTRLRELLE DFEPLDLRPS AIEPEEGKVA ADTIGDAYEY MIGEFARQAG
KKAGSFFTPS EVSELIARIV NPKISDTMYD PTCGSGSLLI RTGKKAIEKE NGNIKTLALY
GQEMNGSSWS MAKMNMFLHE IMDARIAWGD SLANPMHLDP DGNLMQFDVI VANMPFSQDK
WAAGFNTGGE MTGKGKQFKM EASLDKFHRF DWGVPPASKG DWAFLLHMIA SLKSGGRIAA
VAPHGVLFRG ASEGRIRQAV IERNLLDAVI GLPANLFYGT GIPACILVFK KNRNRNDVLF
IDASGKDEKG HLRYRKDKNQ NRLETKHIED IVKAYETRSD IIKLAHVATL DEIKANEYNL
NIPRYVDTFE EEDLVNIEEV KNNIANIQKE LAEVEAQMAE YLKELGL