Gene RPC_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1575 
Symbol 
ID3972923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1712528 
End bp1714597 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content66% 
IMG OID637924691 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_531456 
Protein GI90423086 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0027772 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGCGC GCAGCAATCC GGCCGATGGC GCCGGCGACG AATTTCTGTG GCTCGAAGAG 
ATCGAGGGCG AGCGCGCGGT TCAATGGGTC GCGGCGCAGA ATGCGCGCAC CGATGCGCAA
TTGCGCGATG CCGCCTACGC GGCGGATTTC GACGCCGCGC TAAAAATCCT CAATGCCGAC
GACCGCATCC CGTTCGTCAG CAAATCCGGG GATCATTTGT ACAATTTCTG GAAGGACGAA
GCCCATCCGC GCGGGCTGTG GCGGCGCACC ACGCTTGCAT CCTATCGCAG CGCCGCGCCG
GAGTGGGAGA TCCTGCTCGA CATCGACGCG CTGAACGAGC GAGAGGGAAT CTCCTGGGCG
TTTGCCGGCG CGGCCCGTTC GCCGGATAAG TCCCGCGCGC TGGTCAGCCT GTCGTTCAAC
GGCACCGACG CCGTCGAGCT GCGCGAGTTC GATCTTGCGA CCAAGCGCTT CGTCGACGGC
GGCTTCCAAA TTCCGCAGGC CAAAACCCGC GCCGACTGGC TCGACGGCGA CACCATCCTG
TTCGGCAGCG CGCTGAACGC CGACGACAGC ACCGCGGCCG GCTACGCCCG GCTGGTGCGC
AAATGGCGCC GCGGCACGCC GCTCAGCGAC GCCGAAGTCG TGTTCGAGGT TGAGAAGCAG
GACGTCGCCG CCTGGTTCGG CGTCAACCGA AGGCCGACGC ACGAAGCCGT CGCCTATTGG
CGGGCGCTGG ATTTCACCCG GTCGCAGATT TTCGTCGAGC CGCAGCACGG CGCGTTCGCC
GGGCAGCGGC TACGGCTGGA GCTGCCAGAG CAGGTGTCGA TCTCGCTGGA GGCCAGCGAT
CTATTCGTCA GCCCGAAGCA GGACTGGCGC GTCGGCGACA ACACCATCGT CGCCGGCGCG
CTGGCGGTGA TCGCGCTGGA TCGCTTTCTT GAAGGTAGCC GCGATTTCGC AATCGTGTTT
CAGCCGACGC CGACCCGCTC GCTGCAATCC TGGCTGGAAA CCCGCCACGG CGTGGTGCTG
CAGATCCTCG ACGACGTCCG CGGCCGGCTC GAACTCGCCA GCCGCGGCAA CAAGGGTTGG
GCCATGCGCG CGCTGCCCGA TCTGCCAGAC AATGCCTCGA TCTATGCGGA AAATTTCGGC
GGCGAAGACG AGCCGGCGCT CGGCAGCGAA ATCGTGCTGA CGGTCACCGG CTTCGACCGG
CCGACCACCA CGGCGCTATG GAACGGCGAA GGCGCGCCGC AGCTGTTGAA GCGCGCGCCG
GCGTCGTTCG ACTCCACCGG CATCGAGGTC GCGCAGCGTC ACGCGATCGC CGCCGACGGC
ACCAAGATTC CGTACTTCCT GATCGGCAAG AATCTTTCGG CCGGCGGGCC GCCGCGGCCG
ACGATTCTGT ATGGTTATGG CGGCTTCGAA GTGTCGCTGA CGCCGGCCTA CATGGGCATC
GTCGGCAAGC TGTGGCTGGA GCAAGGAAAC CTCTACGCAG TGGCGAATAT CCGCGGCGGC
GGCGAATTCG GCCCAAGCTG GCATCTCGCC TCGCGCAAGG CCACCAAGCA CGTCGCGCAT
GACGATTTCG CCGCGGTGGC GCGCGACCTC GCTGCCTCAG GCGTCACCAC CGCGCAAAAG
CTCGCGTGTC ACGGCGGCAG CAACGGCGGC TTGTTGGTCG GCACCATGCT GACGCGCTAT
CCGGAATTGT TCGGCGCGGT GTGGTGCAGC GTGCCGCTGC TCGACATGGC GCGCTACACC
AAGCTGCTCG CGGGACAGAG CTGGATCGCC GAATATGGCG ATCCGGAAAA TCCCGAGGAG
TGGGCGTTCA TCCAAAAGTA CTCGCCGTAT CATCTGGCCA GCGCCGCGAA GACCTATCCG
CCGATCTTCA TCACCACCAA CCGCACCGAC GATCGCGTGC ATCCCGGCCA CGCCCGCAAG
ATGGCGGCGC GGCTTTCGGC ACTGGGTCAG CCGGTGTGGT TCAACGAGAC CGTCGCCGGC
GGCCATTCCG GCGCGGTCGA CAACACCAAG CAGGCGCAAA GCCAGGCGCT CGGCTTTGCG
TTTCTGCGCA GCACGATCTG CAAGGGGTGA
 
Protein sequence
MNARSNPADG AGDEFLWLEE IEGERAVQWV AAQNARTDAQ LRDAAYAADF DAALKILNAD 
DRIPFVSKSG DHLYNFWKDE AHPRGLWRRT TLASYRSAAP EWEILLDIDA LNEREGISWA
FAGAARSPDK SRALVSLSFN GTDAVELREF DLATKRFVDG GFQIPQAKTR ADWLDGDTIL
FGSALNADDS TAAGYARLVR KWRRGTPLSD AEVVFEVEKQ DVAAWFGVNR RPTHEAVAYW
RALDFTRSQI FVEPQHGAFA GQRLRLELPE QVSISLEASD LFVSPKQDWR VGDNTIVAGA
LAVIALDRFL EGSRDFAIVF QPTPTRSLQS WLETRHGVVL QILDDVRGRL ELASRGNKGW
AMRALPDLPD NASIYAENFG GEDEPALGSE IVLTVTGFDR PTTTALWNGE GAPQLLKRAP
ASFDSTGIEV AQRHAIAADG TKIPYFLIGK NLSAGGPPRP TILYGYGGFE VSLTPAYMGI
VGKLWLEQGN LYAVANIRGG GEFGPSWHLA SRKATKHVAH DDFAAVARDL AASGVTTAQK
LACHGGSNGG LLVGTMLTRY PELFGAVWCS VPLLDMARYT KLLAGQSWIA EYGDPENPEE
WAFIQKYSPY HLASAAKTYP PIFITTNRTD DRVHPGHARK MAARLSALGQ PVWFNETVAG
GHSGAVDNTK QAQSQALGFA FLRSTICKG