Gene RPC_1792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1792 
Symbol 
ID3972057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1946142 
End bp1948202 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content66% 
IMG OID637924905 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_531670 
Protein GI90423300 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0554398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA AGAGTGCCGG ACGCAAGAGC CAAAGCCCGC GGCCGCCGCG ACGCCGGCGA 
CCCGCCGCCG CGACCGACGA TGCAACAGTG CAGCCCTCCG CCTCGGGGGC AGAAGCCGCC
GCAACGCCCA CGCCGCGGCA GTTCTCGATT CCGCCGGAAT TGGTCGAGAC CATCCTGCTC
GGTCCCGCCG ACGACCGCCG CGTGCTGCAG GATTCGCCGC TGCTCGGCGA CGTCTGGGCG
GCCTATGCGG CCGATCCCGG ACGGCCGCAG GATCTACTGA TCACCCCGCA CAAGGATGCC
ACCGCCGCCG ACGTCGCGTT GCGGGTCTCG AAGGCGGTGA AGGACATCCA TCCGAAGGCG
CGGCCGAAGA TCGCCTATCT GCAGGGCCTG GTCGCCGCCA AGCTGACCTT CGAGGAGGTG
TTGCGGGTGG TGGTGCCGCT GACGCAATGG TGGTTTCAGC CGCAGGTGCA GGGCCGGATC
GGCGGCACCG CGCCCGAGGT GCTCACGCAA CTGCTCGACC TCGACGCCGG CAAACCGCGC
GGCGATGCGG CGGCCCTGAC CATCACTTCG CTCGACCGCT ACATCGCGCT CGCTGGACTG
ATCTATTGGA CCGAGCGGCA GCCGCGGCCG CAGGATTTCG ACGACCCGGC GACGCCGCTG
GACAAACGGC CGAAGCTGGA GATCGCGCAG GCGTTGCGTC TCTATAAAAA TCATATCCCC
GAGATCATCA GCGGCATGGT CGAACTCTAC GAAACGGTGC AGGCCGCAAG CCAGGCGACC
GCGGCGAAAG CGCCCAGCGA CCCGGAGGAT GATCCCGACA GCAATCCGGG TCTGATCTTT
CAGGTCTCGC TCAATCGCAA GGCGGAGTTG GCGCTGGATC GCTCGGTTCC GTCGGTCAAG
GCCGATGCGG TGCACGCGTT GTTTCGGGTG AAGTGCAAGT CGATCATCTG GGCGGTGCTG
GATTCCGGGA TCGATGGTAC CCACCCGGCG TTCCAGGTGG CGCTGAACAA ACCGAAACCC
GGCGAGCCGC CGATCCCGTC ATCGCGGGTG CGCAAGACCT TCGACTTCAG CAAGATCCGC
GAGATCGTCA GCAACGACAT CGACGATCTC GACGATGTCG AATGCCAGGA GCTTGCCGCG
GCGACCGGAC GGCCGAAAGC TGAGGTGGTC AAATATCTGA AGAAGGTGGC AACCGACGCC
GGCGCCGGCA AGCCGATCGA CTGGGGCATC GTCGAGAAGC TGATCACGCT GAAGCAGCCT
GCGCCGCCGC CGGCCAGTTC GCACGGCACC CACGTTGCCG GCATTCTCGG CGCCGACAAG
ACCGGCGGCG AACTGCCGGA GGGCGGCGGC CTCGCCGACG GCATGTGCCC GGATATCCGG
CTGTATGATT TCCGCGTGCT CGGCAAGAGT TTGGAAGACA CCGAATTCGC CATCATCGCG
GCGCTGCAAT ACATCCGCTA CGTCAACGAG CGGCACAACT ACATCATCAT CCACGGCGCC
AATCTCAGCC TGTCGATCCC GCACAACGTC CGCAACTACG CCTGCGGGCG CACCCCGGTG
TGCAACGAAT GCGAACGGCT GGTCGACAGC GGCGTGGTGG TGGTCGCCGC CGCCGGCAAC
CGCGGCTTCC AGAAATTCGA AACCAACGAG GGCATCTTCG AGAATTACGC GGCGTTCAGC
ATCACCGATC CCGGCAATGG CGACGGCGTC ATCACCGTCG GCTCGACCCA CGGCAACTGG
CCGCAGACCT ACGGCGTCAG CTTCTTCTCC AGCCGCGGGC CGACCGGCGA CGGCCGGCTG
AAGCCGGATC TGGTGGCCCC CGGCGAGCGG GTGCAGTCCA CGGTGCTCGA CCACGGCTGG
GGCGCCGAAT CCGGCACCAG CATGGCGGCG CCGCACGTCT CCGGCGCCGC GGCGATGTTG
CTGGCGCGCT ACGAGGAATT GATCGGACAA CCGCGCCGGG TCAAGCGCAT CCTGTGCGAC
AGCGCCACCG ATCTCGGGCG GGAAAAGACC TTTCAGGGCC ACGGCATGCT CGACGTGCTG
CGGGCCTTTC AATCGATTTG A
 
Protein sequence
MAKKSAGRKS QSPRPPRRRR PAAATDDATV QPSASGAEAA ATPTPRQFSI PPELVETILL 
GPADDRRVLQ DSPLLGDVWA AYAADPGRPQ DLLITPHKDA TAADVALRVS KAVKDIHPKA
RPKIAYLQGL VAAKLTFEEV LRVVVPLTQW WFQPQVQGRI GGTAPEVLTQ LLDLDAGKPR
GDAAALTITS LDRYIALAGL IYWTERQPRP QDFDDPATPL DKRPKLEIAQ ALRLYKNHIP
EIISGMVELY ETVQAASQAT AAKAPSDPED DPDSNPGLIF QVSLNRKAEL ALDRSVPSVK
ADAVHALFRV KCKSIIWAVL DSGIDGTHPA FQVALNKPKP GEPPIPSSRV RKTFDFSKIR
EIVSNDIDDL DDVECQELAA ATGRPKAEVV KYLKKVATDA GAGKPIDWGI VEKLITLKQP
APPPASSHGT HVAGILGADK TGGELPEGGG LADGMCPDIR LYDFRVLGKS LEDTEFAIIA
ALQYIRYVNE RHNYIIIHGA NLSLSIPHNV RNYACGRTPV CNECERLVDS GVVVVAAAGN
RGFQKFETNE GIFENYAAFS ITDPGNGDGV ITVGSTHGNW PQTYGVSFFS SRGPTGDGRL
KPDLVAPGER VQSTVLDHGW GAESGTSMAA PHVSGAAAML LARYEELIGQ PRRVKRILCD
SATDLGREKT FQGHGMLDVL RAFQSI