Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0257 |
Symbol | |
ID | 4116088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 263671 |
End bp | 264666 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638035047 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_643046 |
Protein GI | 108803109 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGGC CCGTCTACAT CTTCAACGCC GGCGAGCTGC AGAGGCAACA AAACACGCTG CGCTTCACGC TCGCCGACGG CAAGCGCCGC TTCGTCCCGG TGGAGACCAC GGGCGAGATC CACGTCTTCG GCGAGGTCTC GGTCAACACG AAGCTCCTCG TCTTTCTGGC CCAGAACGCC ATCCCGCTCC ACGTCTACAA CTACTATGGC TACTGGTCGG GCTCGTACAT GCCGCGCGAG CAGTACGTCT CGGGCTACCT CACCCTGAAG CAGGCCGAGC ACTACCTCGA CCACGAGATG CGTCTCGTTC TCGCCCGCGC CTTCGTGCGC GGGGCGATGG AGAACATGGA GCGGGTGCTC GGCTACTACG CCCGGCGCGG CGTGGAGCTG GATGGGCAAC TGGCGGAGAT CGCCGGCAAG AAGGAGAGCC TGCCGCTCGC CCTGACCACG GAGGAGCTTA TGGCCGTCGA GGGCGGGTGC CGGGACCTCT ACTACGGCTG CTGGGACGGG ATCGTAAAGA GCGAGGAGTT CCGCTTCGAG AAGCGCACCC GCAGGCCACC GGCAAACAGG ATCAACGCGC TCGTCTCCTT CGGCAACAGC CTCCTCTACG TGACCGTCCT CTCGGAGATC CACCGCACCC ACCTCGACCC CCGCATCGGT TTTCTGCACA CCACCAACCA GCGCCGCTAC ACCCTCAACC TGGACGTGGC CGAGGTCTTC AAGCCGATCA TCGTGGACCG CGTGATCTTC TCGCTCCTGA ACCGGGGCGC GATCCAGGCG AAACACTTCC ACAAGGGCAC CGAGGGCGTC TTCCTGAACG AGAGCGGGCG GAAAACGTTC ATCGAAGCCT ACGAGACCCG CCTGAAGGAG ACCATCAAGC ACCCGAAGCT CGGAAGGCCT GTTTCCTACC GGCGGCTCAT TCGCATGGAG CTCTACAAGC TGGAGAAGCA CCTTATGGGA GACGAGCCCT ACGAGCCCTT CGTGAGCCGG TGGTAA
|
Protein sequence | MKRPVYIFNA GELQRQQNTL RFTLADGKRR FVPVETTGEI HVFGEVSVNT KLLVFLAQNA IPLHVYNYYG YWSGSYMPRE QYVSGYLTLK QAEHYLDHEM RLVLARAFVR GAMENMERVL GYYARRGVEL DGQLAEIAGK KESLPLALTT EELMAVEGGC RDLYYGCWDG IVKSEEFRFE KRTRRPPANR INALVSFGNS LLYVTVLSEI HRTHLDPRIG FLHTTNQRRY TLNLDVAEVF KPIIVDRVIF SLLNRGAIQA KHFHKGTEGV FLNESGRKTF IEAYETRLKE TIKHPKLGRP VSYRRLIRME LYKLEKHLMG DEPYEPFVSR W
|
| |