Gene Dtox_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3646 
Symbol 
ID8430654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3836931 
End bp3837962 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content46% 
IMG OID645035874 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003192979 
Protein GI258516757 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAT TACTGAATAC TCTCTATGTG ACTTCGCCCA ATACTTATTT ATCCCTTGAT 
GGAGAAAATA TTGTGATTTT AAAGAACGAT GCTGAGGTGT TGAGAGTTCC GCTGCATAAT
CTGGAGGGTA TTATTGCGTT TGGTTATACC GGTGCCAGTC CGGCTTTGAT GGGTTCTTGT
GCCAAACGTA ATATAGCTTT GAGTTTTATG AAGCCCAGCG GAAAATTCCT GGGCAGAGTG
GTTGGCGAAG TTAAAGGCAA TGTTACCTTG CGAAAGACTC AGTACAGGTT ATCAGATGAT
GCAGCGAAGA GCCATAAAAT TGCCCAATCC TTTATCTTAG GAAAAATATA TAACACCCGC
TGGGTGGTGG AGCGGGCTAC CAGGGATCAT AGGGCCAGGC TGGATGTGGA TAAATTGAAA
GGAGTCTGCC AAACCCTGGC CAAGGCTTTG AAGCTGGTTG AAAATAGTCA GAATTTAGAA
CAACTGCGGG GATTTGAAGG GGAATCAGCG GCCCAGTACT TTCGAGTATT GGATGATTTG
ATTCTTCAGC AAAAAGATGA TTTTTATTTT CATGCCCGGA ATAAACGTCC TCCTCTGGAT
AATGTCAATG CCCTGCTCTC CTTTATCTAT ACACTTTTGG CCCATGATGC TGCGGCGGCT
TTGGAAACTG TTGGGTTGGA CCCCTATGTG GGATTTCTGC ACCGGGATAG ACCAGGAAGA
ATATCCCTGG CTCTGGATCT AATGGAAGAG TTGCGGGCCG TGTATGCGGA CAGGTTTGTT
ATTTCTTTAA TTAATAAGAG GGAAATTAAT GGCGGCGGCT TTACCCGGAT GGAAAACGGA
GCGGTCATTA TGGATGAAGA TACGAGAAAA GCTGTTCTTA AGGCTTGGCA AAGCAGGAAG
CAGGAGGAGA TAAGGCATCC GTTTCTGCAG GAAAAAATGG AATGGGGACT TGTACCCTAT
GCCCAGGCTA TGCTGCTGGC TAGGTTTATC CGGGGGGATT TGGACGGATA CCCGGCGTTT
ATGTGGAAGT AG
 
Protein sequence
MRKLLNTLYV TSPNTYLSLD GENIVILKND AEVLRVPLHN LEGIIAFGYT GASPALMGSC 
AKRNIALSFM KPSGKFLGRV VGEVKGNVTL RKTQYRLSDD AAKSHKIAQS FILGKIYNTR
WVVERATRDH RARLDVDKLK GVCQTLAKAL KLVENSQNLE QLRGFEGESA AQYFRVLDDL
ILQQKDDFYF HARNKRPPLD NVNALLSFIY TLLAHDAAAA LETVGLDPYV GFLHRDRPGR
ISLALDLMEE LRAVYADRFV ISLINKREIN GGGFTRMENG AVIMDEDTRK AVLKAWQSRK
QEEIRHPFLQ EKMEWGLVPY AQAMLLARFI RGDLDGYPAF MWK