Gene Rxyl_0257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0257 
Symbol 
ID4116088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp263671 
End bp264666 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID638035047 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_643046 
Protein GI108803109 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGGC CCGTCTACAT CTTCAACGCC GGCGAGCTGC AGAGGCAACA AAACACGCTG 
CGCTTCACGC TCGCCGACGG CAAGCGCCGC TTCGTCCCGG TGGAGACCAC GGGCGAGATC
CACGTCTTCG GCGAGGTCTC GGTCAACACG AAGCTCCTCG TCTTTCTGGC CCAGAACGCC
ATCCCGCTCC ACGTCTACAA CTACTATGGC TACTGGTCGG GCTCGTACAT GCCGCGCGAG
CAGTACGTCT CGGGCTACCT CACCCTGAAG CAGGCCGAGC ACTACCTCGA CCACGAGATG
CGTCTCGTTC TCGCCCGCGC CTTCGTGCGC GGGGCGATGG AGAACATGGA GCGGGTGCTC
GGCTACTACG CCCGGCGCGG CGTGGAGCTG GATGGGCAAC TGGCGGAGAT CGCCGGCAAG
AAGGAGAGCC TGCCGCTCGC CCTGACCACG GAGGAGCTTA TGGCCGTCGA GGGCGGGTGC
CGGGACCTCT ACTACGGCTG CTGGGACGGG ATCGTAAAGA GCGAGGAGTT CCGCTTCGAG
AAGCGCACCC GCAGGCCACC GGCAAACAGG ATCAACGCGC TCGTCTCCTT CGGCAACAGC
CTCCTCTACG TGACCGTCCT CTCGGAGATC CACCGCACCC ACCTCGACCC CCGCATCGGT
TTTCTGCACA CCACCAACCA GCGCCGCTAC ACCCTCAACC TGGACGTGGC CGAGGTCTTC
AAGCCGATCA TCGTGGACCG CGTGATCTTC TCGCTCCTGA ACCGGGGCGC GATCCAGGCG
AAACACTTCC ACAAGGGCAC CGAGGGCGTC TTCCTGAACG AGAGCGGGCG GAAAACGTTC
ATCGAAGCCT ACGAGACCCG CCTGAAGGAG ACCATCAAGC ACCCGAAGCT CGGAAGGCCT
GTTTCCTACC GGCGGCTCAT TCGCATGGAG CTCTACAAGC TGGAGAAGCA CCTTATGGGA
GACGAGCCCT ACGAGCCCTT CGTGAGCCGG TGGTAA
 
Protein sequence
MKRPVYIFNA GELQRQQNTL RFTLADGKRR FVPVETTGEI HVFGEVSVNT KLLVFLAQNA 
IPLHVYNYYG YWSGSYMPRE QYVSGYLTLK QAEHYLDHEM RLVLARAFVR GAMENMERVL
GYYARRGVEL DGQLAEIAGK KESLPLALTT EELMAVEGGC RDLYYGCWDG IVKSEEFRFE
KRTRRPPANR INALVSFGNS LLYVTVLSEI HRTHLDPRIG FLHTTNQRRY TLNLDVAEVF
KPIIVDRVIF SLLNRGAIQA KHFHKGTEGV FLNESGRKTF IEAYETRLKE TIKHPKLGRP
VSYRRLIRME LYKLEKHLMG DEPYEPFVSR W