Gene RoseRS_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1870 
Symbol 
ID5208830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2314536 
End bp2315549 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content63% 
IMG OID640595478 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001276209 
Protein GI148656004 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTCA TCGTTGACCA GTACGGCGTG TTCGTTTCCA AGCACCAGGG GCGCATTCGC 
GTTGTGAAGG ACAAAGAACG CCTGGCAGAG GTTCCGATCC TCCACCTGGA GCAAATCATG
ATCTGTGGCG ACGGCATCGG TCTCAGCAGC GATGTCGTGC GCGTATGCGC AGAGGAAGGC
ATCCCCATCC ATTTTGTTGA CAGCATCGGC AACGACTACG GCGCCCTGAT GCACGGCGGC
ATTACCGGCA TGGCGCTCAC CCGACGCGCA CAGTTGCGCG CCGGCGACGA TGAGCGTGGT
CTGATGCTGG CGCAGGCATT CGCAAGCGGC AAAATCCAGA GTCAGGCCAA CCTGCTGCGC
TACGCCGCCA AAAACCGCAA GGAGAGCGAC CCGGACCTGC ACCACGACCT GATGCGCACC
GCAACTGAAA TTCTCGACAC GCTGCCGTCG GTGCGCGCTA TGCGCGGCGT GCTCACCGAA
GAAACCCGCG CAGCGCTGAT GGGGTTCGAG GGGATGTCCA GCGCGCGCTA CTGGGCAGCC
GTGGCGCGCA TCATCCCCGA CGACCTCGCC TGGCCCGGAC GCGAGACGCG CGGTGCGCGC
GACCGGTTCA ACCAGGCGCT CAATTATGGG TATGGCATCC TGCAAACGCA GGTGCGCACC
GCTCTGATCC TGGCCGGGCT TGATCCACAC GCCGGGTTTC TCCACGCCGA CCGCCCTGGC
AAGCCGAGTC TCACGCTCGA CCTGATCGAA GAGTTTCGCC AGGCTGTCGT TGACCGCACC
CTGATCGGGC TGGTCAACCG TCAGGTCGAG ATCGGTCAGG GTGACGACGG TTTGCTCGAT
GCAGCGACAC GCAAACGCAT CGCCGAGAAG ATTCTTGAGC GACTGGACAG CACCGAGCCG
TATGAAGGCA AACGGCAGCC GCTGCGCCAC ATTCTTCAGT GCCAGGCGCG GCATATTGCC
ACATTCGTGC GTGGAGAACG CCCAACCTAC GAACCGTTCG TGATGGGATG GTGA
 
Protein sequence
MHLIVDQYGV FVSKHQGRIR VVKDKERLAE VPILHLEQIM ICGDGIGLSS DVVRVCAEEG 
IPIHFVDSIG NDYGALMHGG ITGMALTRRA QLRAGDDERG LMLAQAFASG KIQSQANLLR
YAAKNRKESD PDLHHDLMRT ATEILDTLPS VRAMRGVLTE ETRAALMGFE GMSSARYWAA
VARIIPDDLA WPGRETRGAR DRFNQALNYG YGILQTQVRT ALILAGLDPH AGFLHADRPG
KPSLTLDLIE EFRQAVVDRT LIGLVNRQVE IGQGDDGLLD AATRKRIAEK ILERLDSTEP
YEGKRQPLRH ILQCQARHIA TFVRGERPTY EPFVMGW