Gene Rxyl_0263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0263 
Symbol 
ID4117753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp269314 
End bp271203 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content67% 
IMG OID638035053 
ProductCRISPR-associated Cmr2 family protein 
Protein accessionYP_643052 
Protein GI108803115 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC AATCACTCCA CTTCACCCTG GGTCCGGTGC AGGGCTTCGT CGGACAGGCG 
CGGCGCACCC GCGACCTGTG GGCGGGCTCC TTCCTGCTCT CCTACCTCGC CGGGCAGGCG
ATGAAGGCGG TACTCGAAGG TGGAGGCGAA ATCGTCTTTC CCGAGATCGG CACCCGGGAG
AGGCCCACCG ACCCGCTTCT CGCCGCGATC CTGAAGAGAC CCATCTCTGA AAACCCGCGC
CCCGAGATCG GCTCGCTCCC CAACCGCTTC AAGGCCGGGG TGCAGGATGG CTTCGATCCG
GAGCGCTGCG AGGAGGCGGT GAGGGAAGCC TGGAAGAGGA TCGCCAGCAG CGTCTGGGAG
CGGTACGTCG AGCCGGTAGC GGCTTACGGC AAGGGCACGA AGGAGATCTG GGAGCGGCAG
GTCGAGGGCT TCTGGGAGAT GAGCTGGGTC ATAGGCGAGG ATCCCGGAGA CCGCAGCGAC
CAGCGCTGGC TCGATCTGCG CAAGAACTGG CGTACCCACC ATCCGCCCTC CGAGCCGGGG
GACAAGTGCA CCCTGATGGG GAGCTGGCAG GAGCTCTCGG GGTACGTGAG AGCGAGGGAG
CGCGAGGCGC AGGACAGGTT CTGGGATGAG CTCCGCAAGA AAGCCGGCAC GCTCAACCTG
GGCGAGCACG AGCGGCTGTG CGCGATCGCC CTCATCAAGC GCCTCTTCCC GGAGGTCGCG
AAAGAGACCA TCGGCTGGGA GCTCAACGCG AGAACCTGGC CCTCGACGCC GTACATGGCG
GCCGTGCCCT GGATCGAGGA GGCGCGCAAT AAACCAGAGG CCAAGAAGCA TCTCGAGCTG
GTACGCTCCA GCGGGGCGCG AAGCTCGGCC TTCGGCGAGT ACAACACCAA CCTCGCGTGC
CCGAAGAATG AGAAGGACTT CGCCCGGCTC GACGGCAACT TCTTCCACAG AGCGGCCCTC
GAGAACGAGC GGGCAACCCC GGACCTCTCT CCCCAGGAAC GAAAAGGCCT TCTCGAAAAC
CTGAAGGCTC TCAATGAGGC CGTGGGGCAC CCGGCCTCGA CCTTCTACGC CCTCTTGCTC
ATGGACGGCG ACAGGCTCGG CAGCCTGCTG CAGAACAAGG ACATCGAGCC CGAGCTCATC
TCCCGGGCCC TCGCGGAGTT CACTGCGGAG GTCGAGGGGA TCATCGGAGA TCACTGCGGC
AGGACGGTCT ACGCCGGAGG CGACGACGTG CTCGCGCTCC TGCCGGTCGA CCGGGCGCTA
CAGGCCGCGG CGGAGTTGCG CTGCAGGTTC CGCCGCGCGT TCGGCTCCGT GTTCGGCGAC
CGGAGGCCGG TAGACAAGGA TGGCAAAACC CTCAAGACGA CCATCTCCGC CGGGCTCGTC
TACGCCACCT ACAACACGCC GCTGCGGGCG GTGATGCAGG AGGCGCACCG GCTGCTCGAC
GAGGTCGCCA AGGACGAGAA CGGCCGCGAC AGCATCGCGG CGAGCGTTCT CGCCGGCAGC
GGGCGCACCG TCCAGTGGGT CTCGGCCTGG GACGAGGGGC CGGGTGACGA GCAAATGATC
ACGAGCACCC TGACGGGCCT CGCAGAAGAC CTGGAGGAGG AGTTCGCGGG CCGCTTCTTC
TACAACGTCC GCGAGCGTTT CGATGTTCTC ACCGGCGACG GCGATAGGCT CATCGAGGAT
CTCGACGCGC AGGCCCTCCT CGTCGCCGAG TACCTGAAGA GCCGGGAGCG CGACGGAGAC
AGGAGGGAGG CCGAGAAAAC CATAGAGCGG CTCCTCAAAG TATGCCGCCG CCGGAAGGGA
GGAGAGGCGC CCGACGAGGG CACGCTCGAC GTCTCGGGGG CGATGCTGGT CCGGTTTCTC
GCGACGAAGG GACGGGGGGT GGAGAGATGA
 
Protein sequence
MKKQSLHFTL GPVQGFVGQA RRTRDLWAGS FLLSYLAGQA MKAVLEGGGE IVFPEIGTRE 
RPTDPLLAAI LKRPISENPR PEIGSLPNRF KAGVQDGFDP ERCEEAVREA WKRIASSVWE
RYVEPVAAYG KGTKEIWERQ VEGFWEMSWV IGEDPGDRSD QRWLDLRKNW RTHHPPSEPG
DKCTLMGSWQ ELSGYVRARE REAQDRFWDE LRKKAGTLNL GEHERLCAIA LIKRLFPEVA
KETIGWELNA RTWPSTPYMA AVPWIEEARN KPEAKKHLEL VRSSGARSSA FGEYNTNLAC
PKNEKDFARL DGNFFHRAAL ENERATPDLS PQERKGLLEN LKALNEAVGH PASTFYALLL
MDGDRLGSLL QNKDIEPELI SRALAEFTAE VEGIIGDHCG RTVYAGGDDV LALLPVDRAL
QAAAELRCRF RRAFGSVFGD RRPVDKDGKT LKTTISAGLV YATYNTPLRA VMQEAHRLLD
EVAKDENGRD SIAASVLAGS GRTVQWVSAW DEGPGDEQMI TSTLTGLAED LEEEFAGRFF
YNVRERFDVL TGDGDRLIED LDAQALLVAE YLKSRERDGD RREAEKTIER LLKVCRRRKG
GEAPDEGTLD VSGAMLVRFL ATKGRGVER