Gene RPD_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3349 
Symbol 
ID4023860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3713026 
End bp3714525 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content65% 
IMG OID637963554 
Productpeptidase S1C, Do 
Protein accessionYP_570474 
Protein GI91977815 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.181903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTG CGATCCCTGC CTTCAGCTTC CGGAACAGGC CCCTGCTGAC GGCGATCTGC 
CTCGGGGCAG CCGTTGCCTT GAGCGCGCCG GCGCCGTCCC ACGCCCGCGG TCCCGAGGGC
ATCGCGGACG TCGCCGAAAA GGTGATTGAC GCCGTGGTCA ACATCTCGAC CACCCAGACC
GTTGAAGCCA AGAATTCACC CGGCGAAGGG AAGGGCGCCA CGCCGAATCT GCCGCCGGGT
TCGCCGTTCG AGGAGTTCTT CGACGACTTC TTCAAGAACC GCCGTGGCGG CGAGAAGGGC
GGCGGGCCGC GCAAGACCAA TTCACTCGGC TCCGGCTTCA TCGTCGACAC CGCCGGCGTC
GCCGTGACCA ACAATCACGT CATTGCCGAC GCCGATGAGA TCAATCTCAT CATGAACGAC
GGCACTAAGA TCAAAGCCGA ACTGGTCGGC GTCGACAAGA AGACCGACCT CGCGGTGCTG
AAGTTCAAGC CGCCGGCGAA CAAGCCGCTC ACCGCCGTGA AGTTCGGCGA CTCCGACAAG
CTGCGGCTCG GCGAGTGGGT GGTGGCGATC GGCAACCCGT TCTCGCTCGG CGGCACAGTG
ACTGCCGGCA TCGTCTCGGC GCGCAACCGC GACATCAATT CCGGGCCTTA TGACAGCTAC
ATCCAGACCG ATGCTGCGAT CAATCGCGGC AATTCCGGAG GCCCGCTGTT CAACCTCAAC
GCCGAAGTCA TCGGCGTCAA CACGCTGATC ATCTCGCCGT CCGGCGGCTC GATCGGCATC
GGTTTCGCGG TGCCGTCGAA GACTGTGGTC GGTGTGGTCG ATCAACTCCG GCAGTTCGGC
GAGCTGCGCC GCGGTTGGCT CGGCGTGCGG ATCCAGCAGG TCACCGACGA AATCGCCGAG
AGCCTGAATA TCAAGCCGGC GCGCGGCGCG TTGGTCGCTG GCATCGACGA CAAGGGGCCG
GCGAAACCGG CCGGCATCGA GCCCGGCGAC GTGGTCGTCA AGTTCGACGG CAAGGACGTC
AAGGAGCCGA AGGATCTGTC TCGCGTGGTC GCCGACACAG CGGTCGGCAA GACCGTCGAC
GTGGTGATCA TCCGCAAGGG CAAGGAAGAG ACCAAGCAGG TCACGCTCGG CCGTCTCGAC
GACGGCGCCA AGCCGCAGCC GGCCTCGGCG AAGTCACAGC CTGAACCGGA AAAGCCGGTG
ACGCAGAAGG CGCTCGGGCT CGACCTCGCG GCGCTGTCGA AGGATCTGCG CGGCCGCTAC
AAGATCAAGG AAACCGTCAA GGGCGTGGTG GTGGTCGGCG TCGACAATGG CTCCGACGCC
GCCGAGAAGC GGCTGTCGGC CGGCGACGTG ATCGTCGAGG TCGCGCAGGA AGCGGTGACC
AGCGCCGCCG ACATCAAGAA GCGCGTCGAC CAGCTTAAGA AGGACGGCAA GAAGTCGGTC
CTGCTGCTGG TCGCCAATGG CGAGGGCGAG CTGCGCTTCG TGGCGCTCAG CCTGCAATAG
 
Protein sequence
MPAAIPAFSF RNRPLLTAIC LGAAVALSAP APSHARGPEG IADVAEKVID AVVNISTTQT 
VEAKNSPGEG KGATPNLPPG SPFEEFFDDF FKNRRGGEKG GGPRKTNSLG SGFIVDTAGV
AVTNNHVIAD ADEINLIMND GTKIKAELVG VDKKTDLAVL KFKPPANKPL TAVKFGDSDK
LRLGEWVVAI GNPFSLGGTV TAGIVSARNR DINSGPYDSY IQTDAAINRG NSGGPLFNLN
AEVIGVNTLI ISPSGGSIGI GFAVPSKTVV GVVDQLRQFG ELRRGWLGVR IQQVTDEIAE
SLNIKPARGA LVAGIDDKGP AKPAGIEPGD VVVKFDGKDV KEPKDLSRVV ADTAVGKTVD
VVIIRKGKEE TKQVTLGRLD DGAKPQPASA KSQPEPEKPV TQKALGLDLA ALSKDLRGRY
KIKETVKGVV VVGVDNGSDA AEKRLSAGDV IVEVAQEAVT SAADIKKRVD QLKKDGKKSV
LLLVANGEGE LRFVALSLQ