Gene RPC_3259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3259 
Symbol 
ID3971771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3608218 
End bp3609711 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content65% 
IMG OID637926370 
Productpeptidase S1C, Do 
Protein accessionYP_533120 
Protein GI90424750 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.324344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.259222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGTG CGATCTCAGC CCTTAGCCGC CGCCTGCGCC CGATCGTCGT GGCCGTTGGC 
CTTGCCTCTG CCGCCGCGTT CAGCGCCGCC CCGGCGCAGG CCCGCGGTCC GGACGGCATC
GCCGACGTCG CCGAAAAGGT GATCGACGCG GTGGTCAATA TCTCGACCAC GCAGACCATC
GAAGCCAAGG CCGGAGCCGG CGAGGGCAAG GGGGCGGCGC CGCAATTGCC GCCGGGATCG
CCGTTCGAGG AGTTCTTCGA CGACTTCTTC AAGAACCGCC GCGGCGGCGA GAAGGGCAGC
GGGCCGCGCA AGACCAATTC GCTGGGCTCC GGCTTCATCG TCGACACCGC CGGCATCGCC
GTGACCAACA ATCACGTCAT TGCGGACGCC GACGAGATCA ACATCATCAT GAACGACGGC
ACCAAGATCA AGGCGGAGCT GGTCGGCGTC GACAAGAAGA CCGATCTCGC CGTCTTGAAG
TTCAAGCCGC CGGCCAAGCC GCTGGTGGCG GTGAAGTTCG GCGACAGCGA CAAGTTGCGG
CTTGGCGAAT GGGTGATCGC GATCGGCAAC CCGTTCTCGC TCGGCGGCAC GGTGACCGCG
GGCATCGTCT CGGCGCGCAA CCGCGACATC AATTCCGGGC CCTATGACAG CTACATCCAG
ACCGACGCCG CGATTAATCG CGGCAATTCC GGCGGCCCGC TGTTCAACCT CGACGGCGAA
GTGATCGGCG TCAACACGCT GATCATCTCG CCGTCCGGCG GCTCGATCGG CATCGGCTTC
GCGGTGCCGT CGAAGACCGT GATCGGCGTG GTGGATTCGC TGCGGCAGTT CGGCGAATTG
CGCCGCGGCT GGCTCGGCGT GCGGATCCAG CAGGTCACCG ACGAGATCGC CGAGAGCCTC
AACATCAAGC CCGCGCGGGG AGCATTGATT GCCGGCGTTG AAGACAAGGG ACCGGCCAAG
CCCGCCGGCA TCGAGCCCGG CGACGTCGTC ATCAGGTTCG ACGGCAAGGA CATCAAGGAG
CCGAAGGATC TGTCGCGCGT GGTGGCCGAC ACCGCGGTCG GCAAGGCGGT CGACGTCGTC
ATTATCCGCA AGGGCAAGGA AGAGACCAAG CAGGTCACGC TCGGCCGGCT CGACGATGGC
GAGAAGCCGG TGCAGGCTTC GGTGAAGAGC CAGCCCGAAG CGGAAAAGCC GGTGACCCAG
AAGGCGCTCG GCCTCGACCT CGCCTCGCTC AGCAAAGAGC AGCGCGCCAA GTTCAAGATC
AAGGACAGCG TCAAGGGCGT GCTGATCACC AGCGTCGACA ACGGTTCGGA TGCGGCGGAG
AAGCGTTTGA GCGCCGGCGA CGTCATCGTC GAAGTGGCGC AGGAAACCGT CGGCAACGCC
AGCGACGTCA AGAAGCGGAT CGAGGCGATC AAGAAGGACG GCAAGAAATC GGTGCTGCTG
TTGGTCTCCA ACGGCGACGG CGAGTTGCGC TTCGTGGCGC TTGGCGTGCA GTAA
 
Protein sequence
MTGAISALSR RLRPIVVAVG LASAAAFSAA PAQARGPDGI ADVAEKVIDA VVNISTTQTI 
EAKAGAGEGK GAAPQLPPGS PFEEFFDDFF KNRRGGEKGS GPRKTNSLGS GFIVDTAGIA
VTNNHVIADA DEINIIMNDG TKIKAELVGV DKKTDLAVLK FKPPAKPLVA VKFGDSDKLR
LGEWVIAIGN PFSLGGTVTA GIVSARNRDI NSGPYDSYIQ TDAAINRGNS GGPLFNLDGE
VIGVNTLIIS PSGGSIGIGF AVPSKTVIGV VDSLRQFGEL RRGWLGVRIQ QVTDEIAESL
NIKPARGALI AGVEDKGPAK PAGIEPGDVV IRFDGKDIKE PKDLSRVVAD TAVGKAVDVV
IIRKGKEETK QVTLGRLDDG EKPVQASVKS QPEAEKPVTQ KALGLDLASL SKEQRAKFKI
KDSVKGVLIT SVDNGSDAAE KRLSAGDVIV EVAQETVGNA SDVKKRIEAI KKDGKKSVLL
LVSNGDGELR FVALGVQ