Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0919 |
Symbol | |
ID | 8427858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 930756 |
End bp | 931748 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 645033262 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003190436 |
Protein GI | 258514214 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000247044 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGAAA GTTTCTACTT ATTTTCAAAT GGAGAGTTGC AAAGAAAGGA TAACGTGCTT AGGATTACTG CTTCGGATGG GCGATACAAG GATATTAAGG TGGAGATGAC ACAAGATATT TATCTGTTTG GGGAGGTTGA TTTAAATACA AAGTGTTTGA ATTATGCGGG TCAATTATCC ATCCCAATTC ACATATTCAA TTATTATGGC TCATATACAG GCAGCTTTTA TCCTAAGGAA AAAAATGTTT CGGGTAAGCT GCTTATCGAA CAGGTCAACC ATTATACGGA TAAGTATAAG CGGCTGGAAA TAGCCAAGGC ATTTATTGAA GCGGCTAGTT ATAATATTTT GCGGAACTTA CGTTATTACA GTGAGAGAGG CAGAGATCTG CAGGCATGCA TGACAGAAAT CAAAGGATTA AGGAAGGTTA TTCCCCGGAC AGAGGACATT AATGAATTAA TGGGAATAGA GGGATGTATT AGGCAAGCCT ATTATAATAG CTGGAAAGAT ATTATCAATC AAGAGGTTGA TTTTGAAAAG AGGGTGAAAC GCCCGCCCGA TAACATGATT AATGCGCTCA TATCCTTTGT AAATTCACTG ATATATGCAA CCTGTTTGAC GGAAATATAT AAGACTCAGT TACATCCAAC TGTAAGCTAT CTCCATAGTG CTGGGGAAAG AAGATTTTCT TTATGTTTGG ATATTTCAGA AATCTTCAAA CCGCTAATAG GTGATCGGTT AATTTTTTCT ATGCTGAACA AGAATATGCT CACTGAAAAG GACTTTGAAA GCCAATCTAA TTTCTGCTAT TTGAAGGATA ATGGACGTAA GCGGTTGCTT CAGAAGTATG ACGAAGATTT GAAGCGTACT ATTCGGCATA AGGTTCTAAA TAAAAACGTA TCTTATCGAT ATTTAATCCG GCTTGAGTGT TATAAACTAA TTAAGCATCT AATGAGTGAT AAAAAATATG AAGGATTTAC AATTTGGTGG TGA
|
Protein sequence | MAESFYLFSN GELQRKDNVL RITASDGRYK DIKVEMTQDI YLFGEVDLNT KCLNYAGQLS IPIHIFNYYG SYTGSFYPKE KNVSGKLLIE QVNHYTDKYK RLEIAKAFIE AASYNILRNL RYYSERGRDL QACMTEIKGL RKVIPRTEDI NELMGIEGCI RQAYYNSWKD IINQEVDFEK RVKRPPDNMI NALISFVNSL IYATCLTEIY KTQLHPTVSY LHSAGERRFS LCLDISEIFK PLIGDRLIFS MLNKNMLTEK DFESQSNFCY LKDNGRKRLL QKYDEDLKRT IRHKVLNKNV SYRYLIRLEC YKLIKHLMSD KKYEGFTIWW
|
| |