Gene RPC_4488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4488 
Symbol 
ID3972403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4997120 
End bp4998019 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content63% 
IMG OID637927599 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_534330 
Protein GI90425960 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.155164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTGGC GAGGCCTGCA CCTGACGCAG GCGGCAAGGC TGTCGCTTGC CGACGGTCAG 
GTTTGCGTCA GACAGGATGC GGGCGAAGTG CGGCTCGCAC TGGAAGATAT TGCCTGGATC
GTGATCGACA CGCCGCAGGC GACGCTGTCG AGCGCGCTGA TGAGTGCGTG CATGGACGCC
GGCGTCGTGC TGATCTTTAC CGACGAGCGG CACACGCCAT CGGGCGTTGC CTTGCCGTTT
CATCGTCACC ATCGCCAGGG CGCGATCGCG AAGCTTCAGT TCGACGCCAA GGACGGCGTG
AAGCGGCGGC TGTGGCAAGC CATCATTCGC CGCAAGATTC TCAATCAGGC GGCTTCGCTC
TCGGTTCTTA ACCGCCAGAA TTCAGAGACT CTCGCGGAGA TTGCGCGTCA TGTCGAGCCG
GGCGATCCGG AGAACGTCGA GGCCCGCGCG GCGCGCTTCT ATTGGGGCCG TCTGTTTGGG
GATTTCGTGC GCGACGACGA GGGTGATCTT CGCAACAAAA TGCTGAACTA CGGTTATGCC
GTCATGCGCG CCGGCGTTGC GCGGGCGCTG GTCGCCTGCG GATTTCTTCC GGCGTTCGGT
TTGAAGCACG AGAGCGCGGC CAATGCTTTC AACCTCGCGG ACGATATCGT CGAGCCGTTC
CGGCCGTTTG TCGATGGTCT CGCATGGACG ACTCTCGGTG ATCGCGTGGC CAAGAACGGC
GATCTCACGC TGGATGACCG TCGCGCCATG GCCGGCGTGC TGCTGATGAA TGGCCGGGTC
GGGGACGCCA AGGTGTCGCT TCTGGTTGCC GCGGAAATGG CCTCCGCCAG CCTCTGCCGT
GCGCTGGAGT TCGAAAAGCC GGCGTTGCTC GAATTGCCGG AATTGGAGCG CATTTCATGA
 
Protein sequence
MAWRGLHLTQ AARLSLADGQ VCVRQDAGEV RLALEDIAWI VIDTPQATLS SALMSACMDA 
GVVLIFTDER HTPSGVALPF HRHHRQGAIA KLQFDAKDGV KRRLWQAIIR RKILNQAASL
SVLNRQNSET LAEIARHVEP GDPENVEARA ARFYWGRLFG DFVRDDEGDL RNKMLNYGYA
VMRAGVARAL VACGFLPAFG LKHESAANAF NLADDIVEPF RPFVDGLAWT TLGDRVAKNG
DLTLDDRRAM AGVLLMNGRV GDAKVSLLVA AEMASASLCR ALEFEKPALL ELPELERIS