Gene RPC_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3653 
Symbol 
ID3972025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4066717 
End bp4068099 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content57% 
IMG OID637926763 
Producttype I restriction enzyme StySPI specificity protein 
Protein accessionYP_533507 
Protein GI90425137 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.234587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.701676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGTG ATCTTCCAAG CGGATGGGTC GCCGCACCAA TTGACGACCT TCGTGCGCTT 
GAGCCCAATG CTATCACTGA TGGTCCCTAT GGCAGCAGTC TCAAGACGAG CCATTACCGA
TCGAGCGGCG CTCGTGTGGT CCGTCTAGGC AATATCGGTT TTCGCAGATT CTTGAGTGCT
GATGCGGTAT ACATTTCTGA GGATCACTTC AAGGCCCTGG TAAAGCATCA CGTCAGGGCC
GGAGACGTGT TGATTGCCGC GTTGGGTGAC CCGGTAGGTC GTTCTTGCAT TGCCCCGTCT
GATATTTCGC CGGCGTTGGT GAAGGCAGAT TGCTTTCGTC TTCGTTGTTC GCCTCACCTT
TCAGCGCCAT TCATAATGCT TTGGTTGAAC TCGGAGTGCG CACGCGAAGC TTTTTCAAGC
GCAGCTCACG GACTTGGACG TGTGCGCATT AACCTATCTG ATTTTCGAAC GACTGTAGTA
CCTGTTCCTC CAGCGACTGA GCAAGGGCGC ATCGTCGCTA AGATCGACAA CCTGTCCGCA
AAGTCCAAAC GCTCCCGCGA TCACCTCGAC CACATCCCCC AGTTGGTCGA GAAGTACAAG
CAGGCGATCT TGGCGGCGGC GTTTCGTGGC GAGCTGACGC ACGAGTGGCG TGTCAATAAC
CTCGACCAAA AGTGGCCGTG GCCGGAATGC TCACTGTCGG ATATAGCAAA CATCGGGACG
GGAGCGACCC CTAAGCGCGG CGAGCAACGC TATTACAGCA ACGGGAACAT TCCGTGGATA
ACCAGCGGCG CCGTAAAACA CGCGGTGGTG CAGGCCGCTG ATGAATACAT CACGGAGGCC
GCAGTACGCG AGACAAACTG CAAGGTATTT CCGGCAGGAA CGATCTTGAT GGCAATGTAC
GGAGAAGGCA AAACGCGAGG CCGTGTAACG GTGCTTGGTA TCAACGCAGC AACAAATCAG
GCCGTAGCTG CTATTCAGGT CAGGGCCGAC AGTCCCGCAG TTCGAGACTT CGTCGTTTGG
CACTTACGCA GCGGATACCT CGAACTTCGT GAAAGGGCGG CAGGTGGGGT TCAACCCAAT
CTCAATCTCG GAATTGTCAA TGCGTGGCGC ATACCGTTGC CCTCTCGTGA TGAACAGATG
GAAGTAGTAC GTCGAGTGCA AAAGGCCTTT GCCTGGATCG ACCGTCTCAC CATCGAAACC
ACCAGCGCAC GCAAGCTGAT CGACCGCCTC GACCAAGCCA TCCTCGCCAA GGCATTCCGG
GGCGAGTTGG TACCGCAGGA CCCGAACGAC GAACCGGCGA GCATCCTCTT AGAGCGCATC
AAGGCCAAAC GCGCGGGCAG TGCTGGGCAC ACCCGGCGAC GTTCTGCGCG GGCCACTTCG
TGA
 
Protein sequence
MTGDLPSGWV AAPIDDLRAL EPNAITDGPY GSSLKTSHYR SSGARVVRLG NIGFRRFLSA 
DAVYISEDHF KALVKHHVRA GDVLIAALGD PVGRSCIAPS DISPALVKAD CFRLRCSPHL
SAPFIMLWLN SECAREAFSS AAHGLGRVRI NLSDFRTTVV PVPPATEQGR IVAKIDNLSA
KSKRSRDHLD HIPQLVEKYK QAILAAAFRG ELTHEWRVNN LDQKWPWPEC SLSDIANIGT
GATPKRGEQR YYSNGNIPWI TSGAVKHAVV QAADEYITEA AVRETNCKVF PAGTILMAMY
GEGKTRGRVT VLGINAATNQ AVAAIQVRAD SPAVRDFVVW HLRSGYLELR ERAAGGVQPN
LNLGIVNAWR IPLPSRDEQM EVVRRVQKAF AWIDRLTIET TSARKLIDRL DQAILAKAFR
GELVPQDPND EPASILLERI KAKRAGSAGH TRRRSARATS