Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3646 |
Symbol | |
ID | 8430654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 3836931 |
End bp | 3837962 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645035874 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003192979 |
Protein GI | 258516757 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAT TACTGAATAC TCTCTATGTG ACTTCGCCCA ATACTTATTT ATCCCTTGAT GGAGAAAATA TTGTGATTTT AAAGAACGAT GCTGAGGTGT TGAGAGTTCC GCTGCATAAT CTGGAGGGTA TTATTGCGTT TGGTTATACC GGTGCCAGTC CGGCTTTGAT GGGTTCTTGT GCCAAACGTA ATATAGCTTT GAGTTTTATG AAGCCCAGCG GAAAATTCCT GGGCAGAGTG GTTGGCGAAG TTAAAGGCAA TGTTACCTTG CGAAAGACTC AGTACAGGTT ATCAGATGAT GCAGCGAAGA GCCATAAAAT TGCCCAATCC TTTATCTTAG GAAAAATATA TAACACCCGC TGGGTGGTGG AGCGGGCTAC CAGGGATCAT AGGGCCAGGC TGGATGTGGA TAAATTGAAA GGAGTCTGCC AAACCCTGGC CAAGGCTTTG AAGCTGGTTG AAAATAGTCA GAATTTAGAA CAACTGCGGG GATTTGAAGG GGAATCAGCG GCCCAGTACT TTCGAGTATT GGATGATTTG ATTCTTCAGC AAAAAGATGA TTTTTATTTT CATGCCCGGA ATAAACGTCC TCCTCTGGAT AATGTCAATG CCCTGCTCTC CTTTATCTAT ACACTTTTGG CCCATGATGC TGCGGCGGCT TTGGAAACTG TTGGGTTGGA CCCCTATGTG GGATTTCTGC ACCGGGATAG ACCAGGAAGA ATATCCCTGG CTCTGGATCT AATGGAAGAG TTGCGGGCCG TGTATGCGGA CAGGTTTGTT ATTTCTTTAA TTAATAAGAG GGAAATTAAT GGCGGCGGCT TTACCCGGAT GGAAAACGGA GCGGTCATTA TGGATGAAGA TACGAGAAAA GCTGTTCTTA AGGCTTGGCA AAGCAGGAAG CAGGAGGAGA TAAGGCATCC GTTTCTGCAG GAAAAAATGG AATGGGGACT TGTACCCTAT GCCCAGGCTA TGCTGCTGGC TAGGTTTATC CGGGGGGATT TGGACGGATA CCCGGCGTTT ATGTGGAAGT AG
|
Protein sequence | MRKLLNTLYV TSPNTYLSLD GENIVILKND AEVLRVPLHN LEGIIAFGYT GASPALMGSC AKRNIALSFM KPSGKFLGRV VGEVKGNVTL RKTQYRLSDD AAKSHKIAQS FILGKIYNTR WVVERATRDH RARLDVDKLK GVCQTLAKAL KLVENSQNLE QLRGFEGESA AQYFRVLDDL ILQQKDDFYF HARNKRPPLD NVNALLSFIY TLLAHDAAAA LETVGLDPYV GFLHRDRPGR ISLALDLMEE LRAVYADRFV ISLINKREIN GGGFTRMENG AVIMDEDTRK AVLKAWQSRK QEEIRHPFLQ EKMEWGLVPY AQAMLLARFI RGDLDGYPAF MWK
|
| |