Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4330 |
Symbol | |
ID | 5211314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5442373 |
End bp | 5443404 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640597914 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001278618 |
Protein GI | 148658413 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000958951 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.174892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGGC TGAACAATAC CCTTTACGTC ACCACGCCCG ATACCTATCT CTCGCTTGAC GGCGAGACTA TCGTTGTCAA GAAAGATGCT CACGTTTCAA CCCGACTGCC CCTCCATAAT CTGGAAAACA TCGTCTGTTT CAACTACCAG GGGGTCAGTC CTGCCCTGAT GGGCGCCTGC GTTGATCGCA ACATTGGACT GACATTTCTT GATGCAAACG GGCGATTTCA GGCACGGGTC GGCGGCAGGA CACGCGGCAA CGTGCTGTTG CGCAAAAAAC AATACCGCAT ATCAGAGGAT CGTGAATTAC GCGCAGCGAT TGCCACCTCA TTTATACAGG GAAAAGTTTA CAACTGTCGC AAAGTGCTGG AACGCACGCT GCGCGATCAT ACGCTCCTGA TCGATGTCGA AGTCGTCAGG AACGCATCGG CGGCGCTCAA GGAAGCGCTC ACGTCCATTT CCCGGTGCAG CAGCATAGAA GCGCTGCTGG CAGTCGAAGG CAATGCTGCC AGCGTCTATT TTGGCGTTTT CGACCATCTC GTCCTGCATC AAAAGGACGA CTTTCGCTTC GAGGAACGAT CACGTCGTCC GCCACGCAAC AATATGAACG CACTGCTCTC GTTTCTGTAC ACCCTGCTGA CGAACGAAGC AGTTTCGGCA CTGGAAACAG TTGGGCTTGA CCCCTATGTC GGCTTTCTGC ACACCGACCG ACCGGGTCGA CCATCGCTCG CCCTCGACCT TATCGAGGAA CTACGACCGA TTTTCGCCGA CCGGATGGCG CTTTCGCTGG TTAATCGCAA GCAGATCACG GCGAAAGGCT TCACATCCAA AGAGAGCGGC GGAGTTGTGA TGGACGCCGA TACTCGCAAG GCCGTAATCG GTGCATGGCA GGAGCGGAAG AAAGAGGAGA TTCTCCACCC TTTCCTGAAA GAGCGGATAC CTTTCGGTCT GATCCCGCAC GTCCAGGCGA CGCTTCTGGC GCGGCATCTA CGGGGCGATC TCGACGCCTA CCCGCCGTTT TTCTGGAACT GA
|
Protein sequence | MKRLNNTLYV TTPDTYLSLD GETIVVKKDA HVSTRLPLHN LENIVCFNYQ GVSPALMGAC VDRNIGLTFL DANGRFQARV GGRTRGNVLL RKKQYRISED RELRAAIATS FIQGKVYNCR KVLERTLRDH TLLIDVEVVR NASAALKEAL TSISRCSSIE ALLAVEGNAA SVYFGVFDHL VLHQKDDFRF EERSRRPPRN NMNALLSFLY TLLTNEAVSA LETVGLDPYV GFLHTDRPGR PSLALDLIEE LRPIFADRMA LSLVNRKQIT AKGFTSKESG GVVMDADTRK AVIGAWQERK KEEILHPFLK ERIPFGLIPH VQATLLARHL RGDLDAYPPF FWN
|
| |