Gene Dtox_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3957 
Symbol 
ID8430972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4140430 
End bp4142454 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content49% 
IMG OID645036175 
Productexcinuclease ABC subunit B 
Protein accessionYP_003193273 
Protein GI258517051 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000186158 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000163265 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATTTAA AAATAAGATC CGAATTTACA CCCAGGGGAG ACCAACCTCA GGCTATTGCT 
CAGTTGGTAG ACGGGTTGGG TAAAAAATAT GATAAACAGA TACTTTTAGG AGTTACCGGC
AGCGGCAAGA CTTATACCAT GGCTAAAGTA ATAGAAGAAG TGCAAAGGCC TGCCCTGATT
CTGGCACCCA ATAAGACACT GGCAGCCCAG CTTTGCTCCG AGTTCAAAGA GTTCTTTCCC
AACAATGCAG TTGAATATTT CGTCAGTTAT TATGACTACT ACCAGCCGGA GGCTTATATA
GCCCATACCG ATACCTATAT AGAAAAGGAT TCTTCCCTCA ATGATGAGAT AGACAAACTG
CGCCACTCTG CCACCTGTGC ACTGTTTGAG CGGCGGGATG TAGTCATTGT AGCCAGTGTT
TCCTGTATTT ACGGCCTGGG TGACCCGGAG GAATACAGCA CGCTGGTGCT TTCTCTGCGG
CAGGGAGTGG AATATGACAG GGATGCCATA CTGCGCAAGC TGGTGGATAT CCAGTATGAG
CGCAATGACA TAAACTTTAC CCGCGGCACT TTTCGCGTGC GGGGCGATGT CATAGAAATT
TTTCCGGTAG CTGCCACTGA GCAGGCTATC AGAGTGGAGA TGTTTGGGGA TGAGGTGGAA
AAACTTTTGC AGTTTGACGT TCTGACCGGT GAGATAACCG GGCAGCGCCA GCATATTTCT
GTCTTTCCGG CCAGCCACTA CGCCACCTCC AAGGAGAAAA TGGAGGAGGC CATCAGCCGT
ATTGAGTCAG AGCTGGAGCA AAGGTTGGCC GAGCTGCGAA AACAGGATAA GCTGCTGGAA
GCGCAGCGCC TTGAGCAGAG AACTAACTAT GATATTGAGA TGATGAGGGA GATGGGCTTT
TGCAACGGGA TTGAGAACTA TTCCCGACAC CTGACCGGCA GGGAGGCGGG CCAGCCCCCC
TATACTTTGT TGGATTATTT TCCTGATGAT TTCATTATGT TTATAGATGA GTCGCATGTG
GCTGTGCCAC AAATTGGCGG CATGTACGAG GGGGACAGGT CGCGGAAAGC CTCGTTGATT
GAACACGGTT TTCGCCTGCC TTCCGCACTG GATAACCGTC CCCTGCAGTT TAATGAGTTT
GAGGAGAGAG TTAAACAGGT TATATATGTT TCAGCAACTC CCGGTGCTTA TGAATTAAAA
CATCACAGGC AAATTGTGGA ACAGGTAATT CGCCCCACCG GTTTGGTTGA CCCGGAAATT
ATCATTCGCC CGACCAGAGG ACAGATTGAT GATCTGCTGA TGGAAATCCG ACTGAGAGAA
AAACGTGATG AGAGAGTTCT TATAACTACC CTGACCAAGA AAATGGCCGA GGATTTAACT
GATTACTTCA AAGAAAACGG GGTAAAAGTG CGTTACCTGC ATTCCGACAT CAATACGCTG
GAGCGGATGG AGATACTGCG TGATTTGCGT TTGGGTGTTT TTGACGTGCT GGTGGGAATT
AACCTCTTGC GCGAAGGCCT GGATTTGCCT GAAGTAAGCC TGGTGGCTAT ACTGGACGCG
GATAAGGAAG GCTTCCTGCG CTCCGAGCGT TCACTGATCC AGACTACAGG CCGGGCGGCC
CGTAATGTGG AGGGTAAGGT GATTATGTAT GCGGACAGGA TTACTGAATC TATGTCCAAA
GCGATCAATG AGACTGAACG CCGCCGCAAA AAGCAGCTGG ACTTCAACGA AAAATACAAT
ATTACGCCGC AAACAGTGCG CAAGGCGGTG CGCGATGTAT TGGAAGCTAC TAAGGTGGCT
GAAAGCAAGG TGCCTTATGC TGTCTCCGGT AAGGCGAAGA TGTCCAAAAA GGATTTAATA
AAAATGATTG CCGGGATGGA AAAAGAGATG AAAGAGGCAG CCAGGCAGTT GGAATTCGAG
CAGGCAGCCA GATTAAGGGA TACTATTATC GAACTGCGGC TAAAATTACG CGGTGAGAAA
AACATTAAAG CAGCCATACC CGACGACGGG GAGATAGCTT ATTAA
 
Protein sequence
MDLKIRSEFT PRGDQPQAIA QLVDGLGKKY DKQILLGVTG SGKTYTMAKV IEEVQRPALI 
LAPNKTLAAQ LCSEFKEFFP NNAVEYFVSY YDYYQPEAYI AHTDTYIEKD SSLNDEIDKL
RHSATCALFE RRDVVIVASV SCIYGLGDPE EYSTLVLSLR QGVEYDRDAI LRKLVDIQYE
RNDINFTRGT FRVRGDVIEI FPVAATEQAI RVEMFGDEVE KLLQFDVLTG EITGQRQHIS
VFPASHYATS KEKMEEAISR IESELEQRLA ELRKQDKLLE AQRLEQRTNY DIEMMREMGF
CNGIENYSRH LTGREAGQPP YTLLDYFPDD FIMFIDESHV AVPQIGGMYE GDRSRKASLI
EHGFRLPSAL DNRPLQFNEF EERVKQVIYV SATPGAYELK HHRQIVEQVI RPTGLVDPEI
IIRPTRGQID DLLMEIRLRE KRDERVLITT LTKKMAEDLT DYFKENGVKV RYLHSDINTL
ERMEILRDLR LGVFDVLVGI NLLREGLDLP EVSLVAILDA DKEGFLRSER SLIQTTGRAA
RNVEGKVIMY ADRITESMSK AINETERRRK KQLDFNEKYN ITPQTVRKAV RDVLEATKVA
ESKVPYAVSG KAKMSKKDLI KMIAGMEKEM KEAARQLEFE QAARLRDTII ELRLKLRGEK
NIKAAIPDDG EIAY