Gene Dtox_3955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3955 
Symbol 
ID8430970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4135413 
End bp4137341 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content47% 
IMG OID645036173 
ProductAAA ATPase 
Protein accessionYP_003193271 
Protein GI258517049 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.262295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00100841 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGAAAA TAGAAAAAAT AGATTTGCAT TTAATTGACG GCACCACTTA TCAGATAGAT 
AACAAACCGG GCCTGACTTA TATTATAGGG CCTAACGGAA CAGGCAAATC CTATGCTTTC
AGAAAATTTG CCGAGGAAAA TATTGGCCGG GCTCTTTATA TTTTGCCTTC CAGAGATGAT
ATGAGGCGCA ATCTATATAA TGTGGGGCAT GGAGAAGAAA ATGATTTTTT CCGCGAATGG
GTCGTCTATG ATCAGGTATG CCGTTACTGG GGCAAGGCTT TGAAGCAGCC GGGTCTTCTG
GCGATGGCTT TTTCCTTTCT GGAGCGCCTG GGCGTACCCC ACAGAATGTC GATAAGTGTG
GAAGAGGGTG AAATTAAGTT TCAAATCTGT GATGAGTCGG TATGTTATGG CAGCAATCAG
GCAATGTCAG GTTTACACAA ACTGCCGGTG CTGGGCATAG CGGTTTATGA TCCCGAGCAT
AAACTGGTTA TTATAGATGA ACCGGAGCAG TCTCTTCACC CCCAGGCACA GCATATTTTT
GTGCAGGTTC TGCGTGATGT GGCCCGCCTG CAGGGCAAGC ACTTCGTGCT GATTACCCAC
TCGCCGACTA TGGTTGACCT GCGTGACGCC GGGGATCTGA CCAGAGTGGT TTTTTTCCGG
AGGCCCAAAA AGTACCTGAC GGAACGCAAG ATTTTTCAGT TAAGTATGGA TGACGCGCAG
CGTTTCAGTG AGCTGCTGCC GGGCTTGACT TCTTATAAAC GTGAAATATT CTTTGCTGAT
AAAGTGATTT TGGTGGAAGG ACAGCATGAC AGGGATGTCT TTATGGCTTT GATTGAGTCC
GGGGGTTTTG GACTATCCCT GGCCCGTACC AGTGTACTGC CTTTGGGCGG GGTAGGCTTT
ATGGCTAAAT ATACGGCTTT TTTTAAAGAA ATAGGTGTTA AGCCTTTCAT TATTTGTGAT
CGTGATGTAT TGTATCCTTC TGCCGGTATT CGTTGGTTTT GCGGTCGTAT GGAGGGTGAG
AGGCTGGACT GGCGGGGCTG GATCAGTCCC TCGCAGGTTA AGGAATATCT TCAGGGCAAA
GAGATGAAGG TACCTGCGGA TCGTTATGAT TTGCAGCCTG CGGAATTTGA TAACATGGAA
CAAATAGCGG CCCGGCTGAG AAATTTGATG AAAAAAGCGT TGGAGAAAAG CGAACAGCTG
ATGGATAGGC TGCAAGGGGT TGCTTCAGCG GCAAATTTGC TGGCCGAACT GGCTAAAAAG
AAAAATCTGA CAGATGACTA CAGTGAATAC CGGGCCTTTC ATTTACTGAT GACGGTTGTG
CTCAATCATT CCGACTGGTA TAATACGCCT GCGGCAGAAA TCTTTAAGCG CTTGAGGGAG
GTATACGAGC AATTGGAGCA CCAGTCTGAC CGCCTGGAAA TACTTATTTT GCGCATAGGG
CGCCTGGAGG ATTTGTACCG GCACAGTGGC CGCCTGGATG CCTCCAAAAC AGAGAAATCC
AGGCGCGAGG CTTTTGATAT ACGCAGATTA TATCAAAATA ATAAGGAACA AGTGGATGTT
GATTATCAAG AGGTGATTGA TCCTCTGATT AGAAAGCGCT ATTTGCTCCG TATGCAAAAC
GGTATTCCGC ACGAGGCGGC GGCTGTATTA TCGGGAAAAA TCCATGAATT ATATGATTTC
CTTTTTAGTG CCGGAGAACT GGCGGAGCGT GTGCAAAAAT TGAGAAGTAC GGGTAAGCTG
GAAGAACTTT CCGCAGGTGC GGAAATTGTA GATTTTTTAC CCGGCGCAGA CCCCCCGCAG
CTAACAATGG AGATACCCCT GCTGAAAGGT TATCTGTCAG TTGACGGTAG AATAACTTTA
ACAGCGGGTA AAAGGCCTGA AGTGTTTGAC ATGTTGTTCA AAAGAAGCTG GCAGGGTGGG
AAAAGCTAA
 
Protein sequence
MEKIEKIDLH LIDGTTYQID NKPGLTYIIG PNGTGKSYAF RKFAEENIGR ALYILPSRDD 
MRRNLYNVGH GEENDFFREW VVYDQVCRYW GKALKQPGLL AMAFSFLERL GVPHRMSISV
EEGEIKFQIC DESVCYGSNQ AMSGLHKLPV LGIAVYDPEH KLVIIDEPEQ SLHPQAQHIF
VQVLRDVARL QGKHFVLITH SPTMVDLRDA GDLTRVVFFR RPKKYLTERK IFQLSMDDAQ
RFSELLPGLT SYKREIFFAD KVILVEGQHD RDVFMALIES GGFGLSLART SVLPLGGVGF
MAKYTAFFKE IGVKPFIICD RDVLYPSAGI RWFCGRMEGE RLDWRGWISP SQVKEYLQGK
EMKVPADRYD LQPAEFDNME QIAARLRNLM KKALEKSEQL MDRLQGVASA ANLLAELAKK
KNLTDDYSEY RAFHLLMTVV LNHSDWYNTP AAEIFKRLRE VYEQLEHQSD RLEILILRIG
RLEDLYRHSG RLDASKTEKS RREAFDIRRL YQNNKEQVDV DYQEVIDPLI RKRYLLRMQN
GIPHEAAAVL SGKIHELYDF LFSAGELAER VQKLRSTGKL EELSAGAEIV DFLPGADPPQ
LTMEIPLLKG YLSVDGRITL TAGKRPEVFD MLFKRSWQGG KS