Gene RPD_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3156 
Symbol 
ID4023661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3508837 
End bp3510231 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content66% 
IMG OID637963357 
Productpeptidase S1C, Do 
Protein accessionYP_570283 
Protein GI91977624 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.967436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.845897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGA TCCGAACGCT GACCGCACTT TGTTTGTCGA TCGCGCTGGC GACGCCGGTC 
GCGGCGCAGG AGCGGCGGCT GCCGTCGTCG CTGGGCGAGG TCAAGCTGAG CTACGCGCCA
ATCGTCCAGC ACGCCCAGCC GGCCGTGGTG AACGTCTACG CCGCCAAGGT GGTGCAGAAC
CGCAATCCGC TGCTCGAAGA TCCGATCTTT CGCCGCTTCT TCGGCGGCGG CGGCCCGCAG
CCGGAGCAGA TCCAGCGCTC GCTCGGCTCT GGCGTGATGG TCGATCCGTC GGGTCTGGTC
GTCACCAACA ATCACGTCAT CGACGGCGCC GATCAGGTCA AGGTGGCGCT CGCCGACAAG
CGCGAGTTCG AAGCCGAGAT CGTGCTGAAG GACAGCCGCA CCGATCTGGC GGTGCTGCGG
CTCAAGGATA CCAAGGAAAA ATTCGCGACG CTCGAACTTT CGAATTCCGA TGATCTGCTG
GTCGGCGACC TCGTCCTCGC AATGGGCAAT CCGTTCGGCG TCGGTCAGAC CGTGACGCAC
GGCATCGTCT CGGCGCTGGC GCGCACCCAG GTCGGCATCA CCGACTATCA GTTCTTCGTT
CAGACCGACG CCGCGATCAA TCCCGGCAAT TCCGGCGGCG CGCTGGTCGA TATGACCGGC
AAGCTGGTCG GCATCAATAC CGCGATCTTC TCGCGCTCCG GCGGCTCGCA GGGCATCGGC
TTTGCGATCC CGGCCAACAT GGTGCGCGTG GTGATCGCCT CGGCCAAGGG CGGCGGCAAG
GCGGTGAAAC GGCCATGGCT CGGCGCGCGG CTGCAGGCGG TGACGCCGGA GATCGCCGAG
ACGCTCGGCC TGAAGCGGCC GAGTGGCGCG CTGGTGGCGA GCGTCACCAA GGGAAGTCCC
TCGGAGAAGG CCGGTTTGAA ACTGTCCGAT CTGATCGTCG CGGTCGACGG CTTCCCGATC
GATGATCCGA ATGCGTTCGA CTATCGCTTC GCGACACGGC CGCTCGGCGG CACCGCACAG
ATCGACGCGC AGCGCGCCGG CAAGCCGGTG AAGCTCACCA TCGCGCTCGA GACCGCGCCG
GACACCGGCC GCGACGAGAT CGTGCTGACC GCGCGCTCGC CGTTCCAGGG CGCCAAGATC
GCCAACATCT CGCCGGCGAT CGCCGACGAG ATGCGGCTCG ACCCGAGCGT CGAAGGCGTG
GTCGTCACGG AACTCGCCGA CGACGCCACC GCCGCGAATG TCGGTTTCCA GAAGGGCGAC
ATTATCGTCG CGGTCAACAA CAAGCGGATC GGCAAAACCA GCGACCTCGA GCGGATCACC
AACGAATCCG CGCGACTGTG GCGCATCACG CTGGTCCGCG GCGGCCAGCA GATCAACGTC
ACGCTCGGCG GATGA
 
Protein sequence
MISIRTLTAL CLSIALATPV AAQERRLPSS LGEVKLSYAP IVQHAQPAVV NVYAAKVVQN 
RNPLLEDPIF RRFFGGGGPQ PEQIQRSLGS GVMVDPSGLV VTNNHVIDGA DQVKVALADK
REFEAEIVLK DSRTDLAVLR LKDTKEKFAT LELSNSDDLL VGDLVLAMGN PFGVGQTVTH
GIVSALARTQ VGITDYQFFV QTDAAINPGN SGGALVDMTG KLVGINTAIF SRSGGSQGIG
FAIPANMVRV VIASAKGGGK AVKRPWLGAR LQAVTPEIAE TLGLKRPSGA LVASVTKGSP
SEKAGLKLSD LIVAVDGFPI DDPNAFDYRF ATRPLGGTAQ IDAQRAGKPV KLTIALETAP
DTGRDEIVLT ARSPFQGAKI ANISPAIADE MRLDPSVEGV VVTELADDAT AANVGFQKGD
IIVAVNNKRI GKTSDLERIT NESARLWRIT LVRGGQQINV TLGG