Gene Dtox_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2001 
Symbol 
ID8428983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2164444 
End bp2166609 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content46% 
IMG OID645034328 
ProductDNA topoisomerase III 
Protein accessionYP_003191459 
Protein GI258515237 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid
[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0468853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.227685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAC TAGTCTTAGC GGAAAAACCT AGTGTCGCCA GGGAAATTGC CAGGATACTG 
AACTGTACTA TAAAGGGTAA AGGATACCTT GAGGGCAACA AATATATTGT CACCTGGGCT
CTGGGGCACC TGGTAACTCT GGCAGAACCG GAGGATTATG ACGAGAAATA CAAGACCTGG
CGAACAGAAG ACCTGCCCAT GCTGCCTGCT GAAATGAAAA CCGTGGTAAT CAAGAAAACA
GGTCCACAGT TTCATATAGT AAGCAAATTG ATGAAACGTC CGGATGTCCG GGAATTGATT
ATTGCCACCG ACGCCGGGCG GGAGGGAGAA CTTGTGGCCA GATGGATCAT GAAACAGGCC
CGCTGCCAAA AACCTTTTAA ACGCCTATGG ATATCTTCGC TGACTGACGA AGCTATCAGA
GAAGGCTTTA CCAGGTTAAG ACCAGGTACT GAGTATAATA ACCTGTATGA TTCAGCAATC
TGCCGGGCCG AGGCTGACTG GTTAATCGGA CTCAATGTAA CCCGGGCCCT TACTGTTAAG
TACAATGCCC AGCTGTCTGC CGGACGGGTA CAGACCCCCA CTCTGGCCAT GCTGGTTAGC
CGGGAAAAAG AGATAAAAAG GTTTGTGCCG ACTGATTTCT GGACAGTCAA GGCTAATTTG
GGCAAATTCC AGGCTGAATG GCGAGATAAG TCAGGAAAAC ATAACCGATT TTCTGATTGC
CTGCAAGCAG AAACAATTGT TAATAAGGTA AAAGAAAAGT CCGGAGAAGT AATCGATCTT
ATAACTCAGG AAAAAAGTGA GCAACCTCCT CTGGCTTATG ACTTGACCGA ACTGCAAAGG
GATGCCAATA AAAAGTTCGG CTACACAGCA CAGAAAACTC TTGCAGGCCT GCAAATGCTG
TATGAAAGAC ATAAGTTAGT AACTTATCCC AGAACCGACT CGCGCTATTT AACCTCTGAT
ATGGTACCAA CACTGATGCC CAGATTGAAA GCCGTATCGG GCGGTTATTT CGCAGAACTG
GTTAAGCCAT TGACCGCAAA ACCCTTGCCG ATAAACAAAA GACTGGTGGA TAACAGCAAG
GTTACCGACC ACCACGCAAT TATACCTACC ACAGAAAGAG CCAACCTGGC GGCATTGACT
GCCGAGGAAA AAAATATCTA TGATTTAATA GTACGGCGTT TCATAGCTGT ATTCTACCCG
GCCTACAGCT ATAAGCAAAC AACTGTTATA GTGCTCATAG CAGGTGAACA TTTTCATGCA
TCAGGTAAGG TAGTTACCAA TCCAGGCTGG AAGGCTGTTT ATAATAATGA ATCAAGGGAG
AAAGACACAG GCTATGACGA CACTCCGGAA CAGAATTTGC CGCCGCTTAA AAAAGGTGAC
AAAGTAAAGG TACTGTCCTG CAAGTCTGTT AAAGGACAAA CCAAACCGCC GGCCCGCTAT
ACAGAAGCCA GCTTACTTTC GGCTATGGAA CACCCGGGTA AATTTATTGA GGACGAGATA
CTGAAAGAAT CTGTGAAGGA AAGTGGACTG GGCACCCCCG CGACAAGGGC GGAAATAATA
GAAAAGATTA TCTCCTCTAA TTATGTCGAA CGGCGTGGAA AAGAGTTAAT CCCCACAGCC
AAAGGCATTC AATTAGTAGA ACTCGTACCG CCTGAGCTAA AGCAGGCAGA CTTAACTGCT
AAATGGGAGC AGCAGCTGCA TGATATAGCT ATGGGTAAAT CCAAACGCAC AAAGTTCATT
TCAGGCATCA GAGAGCACGC AGAGAAAATT GTAACTACCG TATTGAACAG TTCCAGCACT
TTTAAACCGG ATAATGTTAC CGGCAACAAG TGTCCCAAAT GTAGAAAGTT TCTGCTTTCA
GTAAAAACTA AAAGAGGCAG CAACCTGGTC TGTCCTGACC GTAGCTGCGG ACACAGGCAG
GTAGAAAAAG AAACCAGCAA CAGGCCCTGC CCCCAGTGTA AAAAGAGAAG GATGGAAATC
AGAGAAGGCA AGGGCGGTAA GATTATTGTC TGCACTCACT GCCGGTACCA GGAAAAATAT
GTATTACAGC CTAAACGTGA AGCAAAGGTC AATATAAAGG CTTACAGTGA CAGCGCGCCT
CTGGTAACAA ACCTGTCAAT TCTGGCCAAT TTTAAATTTA ACGGGGCTAA AGAAAAAAGT
GATTAG
 
Protein sequence
MKSLVLAEKP SVAREIARIL NCTIKGKGYL EGNKYIVTWA LGHLVTLAEP EDYDEKYKTW 
RTEDLPMLPA EMKTVVIKKT GPQFHIVSKL MKRPDVRELI IATDAGREGE LVARWIMKQA
RCQKPFKRLW ISSLTDEAIR EGFTRLRPGT EYNNLYDSAI CRAEADWLIG LNVTRALTVK
YNAQLSAGRV QTPTLAMLVS REKEIKRFVP TDFWTVKANL GKFQAEWRDK SGKHNRFSDC
LQAETIVNKV KEKSGEVIDL ITQEKSEQPP LAYDLTELQR DANKKFGYTA QKTLAGLQML
YERHKLVTYP RTDSRYLTSD MVPTLMPRLK AVSGGYFAEL VKPLTAKPLP INKRLVDNSK
VTDHHAIIPT TERANLAALT AEEKNIYDLI VRRFIAVFYP AYSYKQTTVI VLIAGEHFHA
SGKVVTNPGW KAVYNNESRE KDTGYDDTPE QNLPPLKKGD KVKVLSCKSV KGQTKPPARY
TEASLLSAME HPGKFIEDEI LKESVKESGL GTPATRAEII EKIISSNYVE RRGKELIPTA
KGIQLVELVP PELKQADLTA KWEQQLHDIA MGKSKRTKFI SGIREHAEKI VTTVLNSSST
FKPDNVTGNK CPKCRKFLLS VKTKRGSNLV CPDRSCGHRQ VEKETSNRPC PQCKKRRMEI
REGKGGKIIV CTHCRYQEKY VLQPKREAKV NIKAYSDSAP LVTNLSILAN FKFNGAKEKS
D