Gene Dtox_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4202 
Symbol 
ID8431216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4380162 
End bp4383266 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content43% 
IMG OID645036395 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003193493 
Protein GI258517271 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACC ATACCATATT TGACGAACGC CCTGAAAGTC AGGAGAGGGC AATTAAAGTA 
TTGGAAAAGT TAGGCTATCA GTATGTACCC CGGTCACGGG CAGAAATTTT TCGAGGACGG
CTTTCTCATG TTTTATTTCC TGATGTTTTG CGAGAGTTTA TGCACCGGCA GTCTTTCGTT
TATAGAGGAA AGCAAACGCC TTTTTCCGAA CGTTCTATCG GCAAGGCCAT TAAGGATATC
GACGTGCCGC TTATTTCCGG GTTGATGTCG GCCGGCAAAA ACATCTATGA TATGCTTATT
TCCGGCAACA GCTATGAGGA GGAATTATTT GATGGAGGCC GTCAGTCTTT TGATCTAAAG
TTTATTGACT GGGAACATCC GGAAAACAAT CTATGGCAGG TGACAGATGA ATTTTCAGTG
GAGCGGGCGG ATGGCAAATA TGTGTGGCCG GATATAGTGC TGCTTGTCAA TGGTCTTCCT
CTGGTGGTGA TCGAGTGCAA AAAATCCGGT ATTGATGTGG AAAAAGGAGT TGCTCAAAAT
ACACGCAACT GGCAACCCGA CTATATTCCC CATCTGTTTA AATATGCGCA AATCATTATG
GCCATGAATC TAAACACTGT AAAATATGGA ACTTGTGGTA CGACTGCTGA GCATTTTAGC
GTCTGGCGGG AAAAAAATTA TCAATGGCAG CAGGAAAAAT GCAGCAATGT CAGCCCTGAC
GGCAAAGTCA CTGCACAGGA CAGAACTATT GTTTCCATAC TCTCCAAAGA GAGATTGCTG
GAATTGATGC GTTATTTTAT CCTATATGAC AATAATGTCA AGAAAATTGC CCGCTATCAA
CAGTTTTTTG CTGTACAGGC AACCATGAAA CGTATAAAGG GGGAAGATAA CAAAGGTGCA
AGGGGTGGTG TGATCTGGCA CAGCCAGGGC AGCGGTAAAT CTCTCACCAT GGTGATGCTG
GTTAAAAAAA TAATAGCTGA CCCGGATATC GGAAATCCTC GTTTTGTGCT TGTCAATGAT
CGGGTCAATC TGGATAAGCA GCTGAGGGAC AACTTTGCCA GAACTCAGCT CAGACCGGTC
CGGGCCAGTA CAGGCAAGGG TCTGATTGAT TTGTTAAGGG ACAAAAGTGA AACTATAATT
ACTACTCTTG TACATAAATT TTATACAGCA GCAAAAAAGA GAGTCAAGAT TGAAGATGAC
AATATTTTTT TGCTGGTGGA CGAAAGCCAT CGCTCACACT CAAGTGAACT TCATGACTTT
ATGATTGATG TTCTGCCTAA TGCTATAAAA ATAGGTTTTA CCGGCACGCC GCTGCTTAAG
AAACACAAAT TAAATACCTA TGCGCAGTTT GGACCGCTTA TTGACAGTTA CCCTATTACC
AGGGCTGTTG AAGACGGTGT TATTGTTCCT CTTGTCTATG AGGGCAGGAT TATTCCACAA
GATGTGACGA GTGAGAAGAT TGATGATTAT CTCAAATATA TTATTGCGCC GCTGAATCAC
GAGCGGCAGG AAGATATGAA GCGCAAATGG AGCCGTCTTT TACCTCTGGC TCAAACTCGT
CAACGCATAG ACATGGTGGC TTTTGACATT CATGAGCACT TTATAAGTTA TGCCGGGCCT
AAGGGATTCA AGGCTATGAT TGCAGCATCA TCTCGTCCGG CAGCTATTGA CCTGCATAGA
TCGATAAAGA AACTTGGCGG GATAAAAACA GCGGTGGTGA TATCGCCCGA AAATGTTAAA
GAAGGTGATG AGCTTACCGG TGAAAATAAA GAAAAAATAA GATCCTTTTT CAAGGAAGAG
GTGGAGCCTC TCTTTGGACA TAACTATGAG GAATATGGTG AATGGGCTAA AAACAGCTTC
ATTGGTGGTG AAGATGTGGA TATGCTTATT GTCAAGGATA TGTTGCTAAC CGGCTTTGAC
GCGCCGGTTG CTGCTGTGCT GTATGTGGAT AAACCCATGA GGGAGCATGC TTTGTTGCAG
GCCCTTGCCC GCGTTAACCG CGTTTATCCG GGCAAAGACT TCGGGCTCAT CGTGGATTAT
TGGGGTATTT TTAGCAAGCT TAATACTGCC ATGGATATGT ATGCGGATGA GAAATCCGGT
ATGGACGGAT ATGATCAGGC GGATATTGAA AATGCTATTC TTGGTGCCGT TGATCAAAAA
CGCAAGCTGG AAAGGGCGCA CAGTGAATTG CGGCTTGTTT TTGAGGGAAA GGATTTTGAT
CGAAACAGTT CCGATGGTTG GCAAAGTGCA TTGGCAGATA GTGACTTGAG AAAAGTTTTT
TATGAAAAGC TTTCAGTATT TTCCCGCCTG CTGGATTTGG CCATGGGAAG CTATGCTTTC
TATAGCGCCA TTGGTTATGA CCGTATTCAG CAATACAAGC GGGATTTGCT TCATTTCCAA
AAGCTTCGCG GGGCGGTCTT GCTTCGTTAT AATGAAAAAG TAGATTTTAG TAAATATGAA
GACGGTATTC GCAGCCTGTT GAATAATTTT GTGCTTTCCG AACCAAGCCG GATTATTGTT
GAACCTGTTT CTATTCATGA TACCGAAGGG ATGAAAGCGC AGCTTGAGAA GCTGGATAGC
AAAGCCGCCA AAGGGGATGC CATTCGTACG CGTATGGACA AAGAGCTGGA AACCTGTCGT
TATGATGACC CTTTACTATA TAAGAGGTTT TTTGCGCAGG TCCAAGAAAC CCTGGAAGTA
TACAAAGCCT CAAGAAATGA TGATGTTTAT TTCTTTGAAA TGGAGAAAAT GGCGGATGAT
TTTAAAAAAG GCTATACCGG CCATCACTAC CCTGCGTGTA TTGATAATGA CAGTGATGCG
AAAGCTTTTT ATGGCATAAT TCAGAGTATC ATTGCTGAAG TCATATATGA TATAACGCCG
GAAATAGATG AGGGCATGGG GCAACTGGCC ATTGAGGTCA AAAATGCCAT TTGCAGTCGT
GCAAAGGTGG ATTGGCGTCA TAATGTTGCT GTGCACAAAG ACATGGAACA AGCTCTTGAT
GATCTGATTT GGGATTTTGC CGAGCAATAT CAAATAAAAT TATCCGTTGA AAAAATCGAT
CTCATGCTGG AAGAACTAAG AAAAACAGCA ATTAGCAGGT ATTAG
 
Protein sequence
MPDHTIFDER PESQERAIKV LEKLGYQYVP RSRAEIFRGR LSHVLFPDVL REFMHRQSFV 
YRGKQTPFSE RSIGKAIKDI DVPLISGLMS AGKNIYDMLI SGNSYEEELF DGGRQSFDLK
FIDWEHPENN LWQVTDEFSV ERADGKYVWP DIVLLVNGLP LVVIECKKSG IDVEKGVAQN
TRNWQPDYIP HLFKYAQIIM AMNLNTVKYG TCGTTAEHFS VWREKNYQWQ QEKCSNVSPD
GKVTAQDRTI VSILSKERLL ELMRYFILYD NNVKKIARYQ QFFAVQATMK RIKGEDNKGA
RGGVIWHSQG SGKSLTMVML VKKIIADPDI GNPRFVLVND RVNLDKQLRD NFARTQLRPV
RASTGKGLID LLRDKSETII TTLVHKFYTA AKKRVKIEDD NIFLLVDESH RSHSSELHDF
MIDVLPNAIK IGFTGTPLLK KHKLNTYAQF GPLIDSYPIT RAVEDGVIVP LVYEGRIIPQ
DVTSEKIDDY LKYIIAPLNH ERQEDMKRKW SRLLPLAQTR QRIDMVAFDI HEHFISYAGP
KGFKAMIAAS SRPAAIDLHR SIKKLGGIKT AVVISPENVK EGDELTGENK EKIRSFFKEE
VEPLFGHNYE EYGEWAKNSF IGGEDVDMLI VKDMLLTGFD APVAAVLYVD KPMREHALLQ
ALARVNRVYP GKDFGLIVDY WGIFSKLNTA MDMYADEKSG MDGYDQADIE NAILGAVDQK
RKLERAHSEL RLVFEGKDFD RNSSDGWQSA LADSDLRKVF YEKLSVFSRL LDLAMGSYAF
YSAIGYDRIQ QYKRDLLHFQ KLRGAVLLRY NEKVDFSKYE DGIRSLLNNF VLSEPSRIIV
EPVSIHDTEG MKAQLEKLDS KAAKGDAIRT RMDKELETCR YDDPLLYKRF FAQVQETLEV
YKASRNDDVY FFEMEKMADD FKKGYTGHHY PACIDNDSDA KAFYGIIQSI IAEVIYDITP
EIDEGMGQLA IEVKNAICSR AKVDWRHNVA VHKDMEQALD DLIWDFAEQY QIKLSVEKID
LMLEELRKTA ISRY