Gene RPC_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0166 
Symbol 
ID3971271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp181243 
End bp182601 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content65% 
IMG OID637923279 
Productcarboxyl-terminal protease 
Protein accessionYP_530060 
Protein GI90421690 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.97074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCA AGACTTCGGT AATTCTGCTG AGCGTTGCCA CCGGCGCGGC GCTCACATTG 
TTCGTGACCC AACCGCGATC GATCCTGATG GGATCCACCG CGCGGGCCGC GACCTCCGAC
ACCTATCGCC AGCTCAATCT GTTCGGCGAC GTGTTCGAAC GGGTGCGCAG CGACTATGTC
GAAAAGCCCG ACGACTCCAA GCTGGTGGAA TCCGCGATCA GCGGCATGCT GACCGGGCTC
GATCCGCATT CCAGCTACAT GGATGCCAAG AGCTTCCGCG ACATGCAGGT GCAGACCCGC
GGCGAGTTCG GCGGCCTCGG CATCGAGGTC ACCATGGAAG ACGGTCTGAT CAAGGTGGTG
TCGCCGATCG ACGATACCCC GGCCTCGAAG GCCGGCATTA TGGCCAACGA CATCATCACC
AATCTCGACG ACGAGGCGGT GCAGGGCCTG ACGCTCAATC AGGCGGTCGA GAAGATGCGC
GGCCCGGTCA ACACCAAGAT CCGGCTGAAG ATCGTCCGCA AGGGCCAGGA CAATCCGATC
GACGTCACTT TGGTGCGTGA CAACATCCGC GTCCGCTCGG TGCGCGCCCG CGTCGAAGGC
GACGACATCG GCTACATCCG CATCACCACC TTCAACGAGC AGACCACCGA AGGCTTGAAG
CGCGAACTCG CCGCCCTCAC CACCCAGATC GGCAACGACA AGCTGAAGGG CTGGATTCTC
GACCTGCGCA ACAACCCGGG CGGCCTCTTG GAAGAAGCCG TGACGGTGTC GGATGCGTTC
CTTGATCGCG GCGAAATCGT CTCCACCCGC GGCCGCAACG CCGAAGAAAC CCAGCGCCGC
GCCGCCCATG GGGGCGACCT CGCCAAGGGC AAGCAGGTCA TCGTGCTGAT CAATGGCGGC
TCGGCTTCGG CGTCGGAAAT CGTCGCCGGC GCGCTGCAGG ATCACAAGCG CGCCACGCTG
GTCGGCACCC GCTCGTTCGG CAAGGGCTCG GTGCAGACCA TCATTCCGCT CGGAAGCGGC
AACGGCGCGC TGCGGCTGAC CACGGCGCGC TACTTCACGC CGTCCGGCAA GTCGATCCAG
GCCAAGGGCA TCACCCCGGA CATCGAGGTG CTGCAGGACG TGCCCGACGA GATCAAGTCG
CGCACCGACA CCAAGGGCGA AGCCTCGCTG CGCGGCCATC TGAAGGCCGA AGGCGACGAG
AAGACCGGGT CGCAATCCTA CGTGCCGCCG GAAGCCAAGG ACGACAAGGC GTTGAAGATG
GCGGCCGACC TGCTGCACGG CGTCAAGGTC AACGCCACCG CGCCGGCCAC CGGCGACAAG
GCGGCGATCG ACAAGCCGGC CGGCAAGGTC GAGAACTGA
 
Protein sequence
MMRKTSVILL SVATGAALTL FVTQPRSILM GSTARAATSD TYRQLNLFGD VFERVRSDYV 
EKPDDSKLVE SAISGMLTGL DPHSSYMDAK SFRDMQVQTR GEFGGLGIEV TMEDGLIKVV
SPIDDTPASK AGIMANDIIT NLDDEAVQGL TLNQAVEKMR GPVNTKIRLK IVRKGQDNPI
DVTLVRDNIR VRSVRARVEG DDIGYIRITT FNEQTTEGLK RELAALTTQI GNDKLKGWIL
DLRNNPGGLL EEAVTVSDAF LDRGEIVSTR GRNAEETQRR AAHGGDLAKG KQVIVLINGG
SASASEIVAG ALQDHKRATL VGTRSFGKGS VQTIIPLGSG NGALRLTTAR YFTPSGKSIQ
AKGITPDIEV LQDVPDEIKS RTDTKGEASL RGHLKAEGDE KTGSQSYVPP EAKDDKALKM
AADLLHGVKV NATAPATGDK AAIDKPAGKV EN