Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1870 |
Symbol | |
ID | 5208830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2314536 |
End bp | 2315549 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640595478 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001276209 |
Protein GI | 148656004 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCTCA TCGTTGACCA GTACGGCGTG TTCGTTTCCA AGCACCAGGG GCGCATTCGC GTTGTGAAGG ACAAAGAACG CCTGGCAGAG GTTCCGATCC TCCACCTGGA GCAAATCATG ATCTGTGGCG ACGGCATCGG TCTCAGCAGC GATGTCGTGC GCGTATGCGC AGAGGAAGGC ATCCCCATCC ATTTTGTTGA CAGCATCGGC AACGACTACG GCGCCCTGAT GCACGGCGGC ATTACCGGCA TGGCGCTCAC CCGACGCGCA CAGTTGCGCG CCGGCGACGA TGAGCGTGGT CTGATGCTGG CGCAGGCATT CGCAAGCGGC AAAATCCAGA GTCAGGCCAA CCTGCTGCGC TACGCCGCCA AAAACCGCAA GGAGAGCGAC CCGGACCTGC ACCACGACCT GATGCGCACC GCAACTGAAA TTCTCGACAC GCTGCCGTCG GTGCGCGCTA TGCGCGGCGT GCTCACCGAA GAAACCCGCG CAGCGCTGAT GGGGTTCGAG GGGATGTCCA GCGCGCGCTA CTGGGCAGCC GTGGCGCGCA TCATCCCCGA CGACCTCGCC TGGCCCGGAC GCGAGACGCG CGGTGCGCGC GACCGGTTCA ACCAGGCGCT CAATTATGGG TATGGCATCC TGCAAACGCA GGTGCGCACC GCTCTGATCC TGGCCGGGCT TGATCCACAC GCCGGGTTTC TCCACGCCGA CCGCCCTGGC AAGCCGAGTC TCACGCTCGA CCTGATCGAA GAGTTTCGCC AGGCTGTCGT TGACCGCACC CTGATCGGGC TGGTCAACCG TCAGGTCGAG ATCGGTCAGG GTGACGACGG TTTGCTCGAT GCAGCGACAC GCAAACGCAT CGCCGAGAAG ATTCTTGAGC GACTGGACAG CACCGAGCCG TATGAAGGCA AACGGCAGCC GCTGCGCCAC ATTCTTCAGT GCCAGGCGCG GCATATTGCC ACATTCGTGC GTGGAGAACG CCCAACCTAC GAACCGTTCG TGATGGGATG GTGA
|
Protein sequence | MHLIVDQYGV FVSKHQGRIR VVKDKERLAE VPILHLEQIM ICGDGIGLSS DVVRVCAEEG IPIHFVDSIG NDYGALMHGG ITGMALTRRA QLRAGDDERG LMLAQAFASG KIQSQANLLR YAAKNRKESD PDLHHDLMRT ATEILDTLPS VRAMRGVLTE ETRAALMGFE GMSSARYWAA VARIIPDDLA WPGRETRGAR DRFNQALNYG YGILQTQVRT ALILAGLDPH AGFLHADRPG KPSLTLDLIE EFRQAVVDRT LIGLVNRQVE IGQGDDGLLD AATRKRIAEK ILERLDSTEP YEGKRQPLRH ILQCQARHIA TFVRGERPTY EPFVMGW
|
| |