Gene RPC_3417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3417 
Symbol 
ID3970461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3804796 
End bp3806199 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content68% 
IMG OID637926528 
Productpeptidase S1C, Do 
Protein accessionYP_533276 
Protein GI90424906 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.966592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.139479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGA CACGCGCGCT CACCGCCCTG GTGCTGGCCT TGATTCTCTC CGTTCCGGCG 
CAGGCGCAGC CGCAGGATCG CCGGGTGCCG TCCTCGCCCG CCGAACTGAA GCTCAGCTAC
GCGCCGATCG TGCAGCGGGT GCAGCCGGCG GTGGTCAACG TCTACGCCGC CAAGGTGGTG
CAGAACCGCA ACCCGCTGTT GGAGGATCCG ATCTTCCGCC GCTTCTTCGG CGTGCCCGGC
CAGCAGCCGG AGCAGATGCA GCGCTCGCTC GGCTCGGGCG TGATGGTCGA TCCGTCGGGC
CTGGTGGTGA CCAACAACCA CGTCATCGAG GGCGCCGATC AGGTCAAGGT GTCTTTGTCC
GACAAGCGGG AATTCGAGGC CGAGATCGTG CTGAAGGACA GCCGCACCGA CCTCGCGGTG
CTGCGGCTGA AAGAGGTCGC CGGCGAGAAA TTCCCGACGC TGGATTTCTC CAACTCCGAT
GAACTCTTGG TCGGCGACGT GGTGCTGGCG ATCGGCAATC CGTTCGGCGT CGGCCAGACC
GTGACCCACG GCATCATCTC GGCGTTGGCG CGCACCCAGG TCGGCATCAC CGACTACCAA
TTCTTCATTC AGACCGACGC CGCGATCAAT CCCGGCAATT CCGGCGGCGC CTTGGTCGAC
GTCAACGGCC GGCTGGTCGG CATCAACACG GCTATCTTCT CGCGCTCCGG CGGCTCGCAG
GGCATCGGCT TTGCGATTCC GGCCAACATG GTGCGGGTGG TGGTGGCTTC GGCGAAGAGC
GGCGGCAAGG AGGTCAAGCG GCCGTGGCTC GGCGCGCGGC TGCAGGCGGT GACGCCGGAG
ATCGCCGAAA CGCTCGGTCT CAAGCTGCCG AGCGGCGCGC TGGTGGCGAG CGTCACCGCG
GGCAGCCCCT CGGCCCGCGC AGGATTGAAA CTCTCCGACC TGATCATTGC CATCGACGGC
TTCACCGTCG ACGACCCCAA CGCGTTCGAT TATCGCTTCG TCACCAGGCC GCTCGGCGGC
GCCGCGCAGG TCGACGTGCA GCGCGGCGGC AAGCTGGTCA AGCTGGCGAT CCCGCTGGAG
ACCGCGCCGG ACACCGGGCG CGACGAAATC GTGGTGAAGT CGCGCTCGCC GTTCCAGGGC
GCCAAGGTCG CCAACATCTC GCCGGCGCTG GCGGACGAAT TGCGGCTCGA TCCCAGCGTC
GAGGGCGTGG TGGTGATCGA TCTCGCCGAC GATGCGACGG CCGCCGGCGT CGGCTTCCAG
AAGGGCGACA TCATCATCGC CGTCAACAAC AAGAGGATCG CCCGCACCGC CGACCTCGAG
CGGATCGCCG CCGAACCCTC GCGGCTGTGG CGCATCACCG TGGTGCGCGG CGGCCAGCAG
ATCAACGTCA CGCTCGGCGG ATGA
 
Protein sequence
MTSTRALTAL VLALILSVPA QAQPQDRRVP SSPAELKLSY APIVQRVQPA VVNVYAAKVV 
QNRNPLLEDP IFRRFFGVPG QQPEQMQRSL GSGVMVDPSG LVVTNNHVIE GADQVKVSLS
DKREFEAEIV LKDSRTDLAV LRLKEVAGEK FPTLDFSNSD ELLVGDVVLA IGNPFGVGQT
VTHGIISALA RTQVGITDYQ FFIQTDAAIN PGNSGGALVD VNGRLVGINT AIFSRSGGSQ
GIGFAIPANM VRVVVASAKS GGKEVKRPWL GARLQAVTPE IAETLGLKLP SGALVASVTA
GSPSARAGLK LSDLIIAIDG FTVDDPNAFD YRFVTRPLGG AAQVDVQRGG KLVKLAIPLE
TAPDTGRDEI VVKSRSPFQG AKVANISPAL ADELRLDPSV EGVVVIDLAD DATAAGVGFQ
KGDIIIAVNN KRIARTADLE RIAAEPSRLW RITVVRGGQQ INVTLGG