Gene Dtox_2390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2390 
Symbol 
ID8429374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2563416 
End bp2565776 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content47% 
IMG OID645034695 
ProductMutS2 family protein 
Protein accessionYP_003191824 
Protein GI258515602 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0786061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.328409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGAAA GGAAAACATT AAAGCGCCTG GAATTTGACA AAATCCTGAA GAAATTAGCC 
GGCTGCACCG GTTCAGTGCT GGGCAGGGAA AGGGCTTTAA GCCTTATGCC TTCTGTTGAT
TATCAAACGG TAAAAATCTG GCTTTCAGAA ACTACGGAAG CCAGAGAGAT GTTTCGGCTG
GAGCCTGCTG CGGACATCGG AGGCTGGCAT GACATAAGAA AAAGCGTAAT CAGGGCACAC
CAGGGGGCGG TATTGGAAGC CAAAGATCTT TCGGAGATAG GAGAAACCTT GGCGGCAGCT
CGCTTAGCCA GGAATTATTT ACTGGAAAGA GATGAGTTTT ATCCCTTGCT GGCCGGGATA
GGCTCGCGCA TCACCAGTTT TGCCGATCTT GAACACAGGT TAAAAAATGC CATCTTGCCG
GGGGGGGAAA TTGCTGACAG GGCATCTGAT GCACTGTCAC AAATTCGTCG GCGTATTACC
AACAACCGTG CATCTGTCAA AGAGCGTTTG GAGCACATTA TTCGTTCCCC GAATTATCAA
AAATATCTGC AGGATCCTAT AGTAACAATA AGAGAAGGGC GTTATGTGGT CCCGGTAAAG
CTGGAATACC GGGGTCAGGT ACAGGGTATT GTACATGACA CCTCAGCCAG CGGGGCGACT
CTTTTTGTTG AGCCAATGGC TGTAGTTGAG GCTAATAACG AATTAAGGCG CTTAATTGCA
GCCGAAAAGC AGGAGATTGT GAAAATACTA ACTGACCTTT CCTGCCGGGT AGCTCAGGAA
AGTGAGCCGC TGGGGGTTAC TCTGGAGGCT TTGGGGCATC TTGATTTTGT TCTGGCTAAA
GCCAGGCTGA GCAGTCAAAT GGATGCCTGG GCCCCTGTTT TAGTTGACGG TCCGGTGGTA
AATATACAAA AAGGCCGTCA TCCTTTGTTA GCCGGAGATG TTGTGCCGGT CAGCGTGCAT
TTAGGCAAAG AGTTCGACTC TTTAATCATA ACCGGCCCCA ATACCGGTGG AAAAACCGTA
ACTTTAAAAA CCATCGGGCT GCTGGTATTG ATGGCCCAGT CAGGGTTGCA CATTCCTGCT
GAAAGCAGTT CGGAAACAGG AATATTTGAA CAGGTATTCG CTGATATAGG TGATGAACAG
AGTATTGAGC AATCCCTCAG TACCTTTTCC TCGCATATGA GCAACATTGT CAGTATCCTC
AACCAATCAG GGGCGGGTAG TTTGGTATTA ATGGATGAAT TGGGAGCCGG TACAGACCCC
ACTGAGGGAG CGGCACTGGC CCAGGCAATT TTGGAGAAGC TGCATGAGCA AAAGGCTAAA
ATTGTAGCTA CAACTCACTA CAGCGAGCTG AAAAATTTTG CCTATGCCCA CAGACGGGTT
GAAAATGCCA GTGTGGAATT TGATCCGATT AGTTTAAAGC CTACTTACCG TCTTTTGATA
GGCAAACCGG GACGCAGCAA TGCTTTTGAA ATCGCTCTAC GCCTGGGGCT CGAGCCTGGG
GTTGTCAGCA GAGCCAGGGA TTTTCTGACA ACCGAACAAA TAGAGATCTC TGAGCTGATG
CTGCGGCTGG AAAAAGAACG CCAGGCAGCC GAGGAGGAAA AGCGAATCGC TGAATTATTG
CGGCAGGATG CTGAAAAACT GAAAGCCAGG TATACTGAGC TGGAACAAAT GCTGAGGGAA
AAGAGAGAGG ATATTCTGGC CAAAGCTCAT GAAGAAGCCA GTAAAACGGT TAAAAATACA
CGGCAGGAAG CTGAAGAAGC AATCAAGGAA TTTCGCGGCA TGCTGCAGGA AAATGATAAC
CGCCTGAAAG AAATGGCTGT GCAGGAGGTT CGCAACAAAA TTAAGGGGAT GCAGGGGCGC
CTGCGTAAAG CACCCGAGAA AAGCCACGGG GGTGTGGTAC CCCGGGAGTT ATTAATCGGT
GAAGAGGTTT TTATTCCTAA TTTAAACCAA CAAGGTTATG TTTTGAATGT TTCCACTGAC
GGGAAGGAAG CCCTGGTCCA GGTAGGAATA ATGAAATTAA ATATGCCCGT TAAAGATCTG
CGCAAAGTTG ATGAAGCTAA AAAAGAAAAC AGCGGTAAAG TGCAATTTGC CGGTTTACTT
AAAAATAAAT CTCAGGAGAT TTCAACAAAA CTTGATTTAA GAGGCATGAG AGCTGAGGAA
GCCTGGTTGG AAGTGGAGAA ATACTTGGAT GATGCATTTT TAGCCGGTTT AAACAAGATA
TATGTAGTGC ACGGCAAGGG AACCGGTGCT TTACGGGCCA TGATTCAAAG GGAACTGCAG
AACAGCAGAC GGGTTAAATC TTTTCGCCTG GGTGAACATG GTGAAGGTGG AGCAGGTGTA
ACTGTGGTAG ATTTAAAATA A
 
Protein sequence
MLERKTLKRL EFDKILKKLA GCTGSVLGRE RALSLMPSVD YQTVKIWLSE TTEAREMFRL 
EPAADIGGWH DIRKSVIRAH QGAVLEAKDL SEIGETLAAA RLARNYLLER DEFYPLLAGI
GSRITSFADL EHRLKNAILP GGEIADRASD ALSQIRRRIT NNRASVKERL EHIIRSPNYQ
KYLQDPIVTI REGRYVVPVK LEYRGQVQGI VHDTSASGAT LFVEPMAVVE ANNELRRLIA
AEKQEIVKIL TDLSCRVAQE SEPLGVTLEA LGHLDFVLAK ARLSSQMDAW APVLVDGPVV
NIQKGRHPLL AGDVVPVSVH LGKEFDSLII TGPNTGGKTV TLKTIGLLVL MAQSGLHIPA
ESSSETGIFE QVFADIGDEQ SIEQSLSTFS SHMSNIVSIL NQSGAGSLVL MDELGAGTDP
TEGAALAQAI LEKLHEQKAK IVATTHYSEL KNFAYAHRRV ENASVEFDPI SLKPTYRLLI
GKPGRSNAFE IALRLGLEPG VVSRARDFLT TEQIEISELM LRLEKERQAA EEEKRIAELL
RQDAEKLKAR YTELEQMLRE KREDILAKAH EEASKTVKNT RQEAEEAIKE FRGMLQENDN
RLKEMAVQEV RNKIKGMQGR LRKAPEKSHG GVVPRELLIG EEVFIPNLNQ QGYVLNVSTD
GKEALVQVGI MKLNMPVKDL RKVDEAKKEN SGKVQFAGLL KNKSQEISTK LDLRGMRAEE
AWLEVEKYLD DAFLAGLNKI YVVHGKGTGA LRAMIQRELQ NSRRVKSFRL GEHGEGGAGV
TVVDLK