Gene Dtox_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3002 
Symbol 
ID8429992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3201621 
End bp3204005 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content43% 
IMG OID645035255 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_003192378 
Protein GI258516156 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.201013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA TCAATATGAT TGCGCCGGAG ATAATGCGGG ATCTTCGTTA CTTGGCTCAT 
ACCAGAAATA CTGGCGGCAG CAGAGAAACG GAAACTCTGC TGGCTCATTC CGAACTGACG
ATGCACTATT ACGAAGTGTA TTGTGAAAAA AAAGGAATGG AGGAAATTAT CAAAGGTTTA
ATTGCCGCAT GTGGTTTTCA GGGAGAAGAG GAGCAAACAG TTTATCTTTT GTTCCTCTAT
GCTATTTATT TGCATGATGC GGGAAAAATT AATCCCCGCT TTCAATACGA GGTGCTGGAT
AACGGATTGT TTAAGGAAAT GTGCAGGCAG GCAAGAAATT CTCATCATGC CCTGGCTTCC
GCTTATTTGT ACATAGATCA TATGTACAAA ATTCTGCTCA ATAATCCAAC GTTGGCTGTG
CAAAAATGCC TCGCGTCCTT TGCCTATTGT ATCTGTAAAC ACCATGGAAA ATTAGACAAC
GGGCATGATT TTTCGCAGCT GGAAACTTGG AGTGAGAGAT ATTACAAAAG CGATCTAGAC
AGCAGTATTT TTGAAAATAT CCGTTATTAT ATGCAGGAGT CTGAAGTATT TGTAAGCCCA
ATAAATTTTT ATATCCTGTG TCGGTTATTA TACGCCTTGA TTACAGGCTG CGACTACTGT
GCTGCTCAGG AGTTCATGAC AGGCCACGCC ATGCGGCCTG CAGTGATTGA AAACAGTGGG
GAAAATTTGA AACGGTGCTA TGATGACAGT ACTATCTGCC GGAGCATTCG TAATTACCAG
AAGGATCCGG CAACTTTTAG CGGAGATCCG ATCAACGCCT TGCGAAATCA ATTGTTTTTA
GAGGCGGAGC GGAACTTGTT GGCTCAACCG GAGGCCAATA TCTACTACCT GGAGGCACCC
ACCGGTGCCG GAAAAACCAA TATGGCTGTC AACCTTACCT TACGTGTTTT GGAGATGGAT
CCCAATATAA ATAGTGTATT TTATGTTTTC CCTTTTAATA CCCTTGTAGA GCAGACTAAG
AAAGCTCTGC TTCCCTTTTT CAACGATCAA CTGGCGGTAA TCAATTCCAT CACGCCGGTT
GTGATGGGCA GGGAAGAGCG GAAAGATAAG TATGAGGCAG CCTGGCTGGA TTATATCTTC
AATAATTATT CAATTGTCTT GACTTCTCAT GTTAACTTCT TCAATGCTTT GTTCGGTTGT
GGGCGTGAGC AGTGCTTCCC CTTGCTCAAA CTCTGCAACA GTGTGGTCAT ACTTGATGAA
ATCCAAAGCT ACAAAAACAG TATTTGGCGG GAAATAATTG GTTTTCTGCA AAGTTATGCT
CAACAGCTTC ATATTAAGAT CATTATGATG TCTGCCACTC TGCCGCAGCT GAATCAACTG
CTTAGGACGG TAGACGCAAA ATTTGCCGCA CTTGTGGAAG TTCCGCAAAG GTATTACCGG
CACCCGCTTT TTCAGGGAAG GGTGCAGTTG GATTTCTCTC TGCTGGAAAA AGGCGAGATT
GCTCTGGAGG AACTGAAAGA AGAAGTATTG CGTTTCCGGG ACAAACGTGT GTTGGTGGAA
TTTATCAAGA AAAAGACGGC CAGGCAATTT TTCGAGCTTT GCAAGGAAGA GTGTAATGCT
GTTTTGCTTA CGGGTGACGA CAGTGCGGCC AGACGGGATA AGGTAATCCG GCGGATCGAC
AGAGGTGAGA AGCTAATTCT CATTGCCACT CAAGTCATTG AAGCAGGTGT GAACATTGAC
ATGGAAATTG GATTTAAAGA TATCTCTCTT CTTGATTCCG AAGAGCAATT TTTGGGACGA
ATCAACAGGT CCTGTCTAAA TGCCGGTGGT GCTGTGGCCT ACTTTTTCGA TTTTGACGAT
GCGTCGTTGA TCTATAGAGG GGATGCCAGG CTGAGATATT CTATCCAGGA TCCTGCTATA
CGGCAAAATC TTCAGGAAAA ACGATTTGAC GAAATTTATG CGCGGGTTTT TAGGGATTTA
ATAGACAAGA CAAGTAAAGC TAACCGGAAA AACCTGGCTA ATCTATATGA GAATTGTGCG
CAGTTTAATT GCAGGAACAT AGAAGAGAAA ATGCGCTTAA TTGAACGTAG TTGGCAGCTT
TTTATTCCTT ATGTCTGGGA AGAATTAGAC GGGTATCAGG TGTGGGAGGA ATTCAAGGCT
TTGGGCAGCT ATAAGGAAAT GGGCTATGCC CAGCGTATGG TCGAGTATTC CAGACTAGCT
CTAGAAATGT CCTACTTTAC CTTTACGGTA TTCAGGGATA AGATTCCTGC CGGTGCCGAG
GAATACGGCG GCTATTATTT TATTGAAGGC GGGGAGCGTT TTATTGATGA TGGAGTTTTG
AACAGAGATG AGTTGGAAAA ATATTATGGG GGGATATTTT TGTGA
 
Protein sequence
MSKINMIAPE IMRDLRYLAH TRNTGGSRET ETLLAHSELT MHYYEVYCEK KGMEEIIKGL 
IAACGFQGEE EQTVYLLFLY AIYLHDAGKI NPRFQYEVLD NGLFKEMCRQ ARNSHHALAS
AYLYIDHMYK ILLNNPTLAV QKCLASFAYC ICKHHGKLDN GHDFSQLETW SERYYKSDLD
SSIFENIRYY MQESEVFVSP INFYILCRLL YALITGCDYC AAQEFMTGHA MRPAVIENSG
ENLKRCYDDS TICRSIRNYQ KDPATFSGDP INALRNQLFL EAERNLLAQP EANIYYLEAP
TGAGKTNMAV NLTLRVLEMD PNINSVFYVF PFNTLVEQTK KALLPFFNDQ LAVINSITPV
VMGREERKDK YEAAWLDYIF NNYSIVLTSH VNFFNALFGC GREQCFPLLK LCNSVVILDE
IQSYKNSIWR EIIGFLQSYA QQLHIKIIMM SATLPQLNQL LRTVDAKFAA LVEVPQRYYR
HPLFQGRVQL DFSLLEKGEI ALEELKEEVL RFRDKRVLVE FIKKKTARQF FELCKEECNA
VLLTGDDSAA RRDKVIRRID RGEKLILIAT QVIEAGVNID MEIGFKDISL LDSEEQFLGR
INRSCLNAGG AVAYFFDFDD ASLIYRGDAR LRYSIQDPAI RQNLQEKRFD EIYARVFRDL
IDKTSKANRK NLANLYENCA QFNCRNIEEK MRLIERSWQL FIPYVWEELD GYQVWEEFKA
LGSYKEMGYA QRMVEYSRLA LEMSYFTFTV FRDKIPAGAE EYGGYYFIEG GERFIDDGVL
NRDELEKYYG GIFL