Gene RPD_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2011 
Symbol 
ID4022493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2249476 
End bp2251056 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content67% 
IMG OID637962204 
Productpeptidase S1C, Do 
Protein accessionYP_569147 
Protein GI91976488 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.963752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC GAAATACCGA TCTTCCCTCG CAGCCGTCCC GCCAGCCGGA TCGCCGATCG 
CTGCTGTCAG CGCGCAAATT CGCGCTGATG GCCTCGGTGG TCGCCGGTCT CGGCGCGGGC
GCGTTCGGCC TGTCGCAGAC CTCGACGCCG GTCGATCTGT TCAGCACGCC GGCGCATGCG
CAGGTCGGCA ACAAGGTCAA CCAGGCTCAG CAGCCGGTGG GCTTCGCCGA TATCGTCGAG
AAGGTGAAGC CGTCGGTGAT CTCGGTGAAG GTCAACATCG CCGAGAAGAC GGCGAAGAAC
GAGGATCGCG CCGAAGACTC GCCGTTTCAG CCGGGCTCCC CGATGGAGCG CTTCTTCCGC
CGCTTCGGCG GTGAGATGCC CCCCGGTATG CGCGGCCATC GCGGCGGCGG CACGATGACC
GGGCAGGGCT CGGGCTTCTT CATCTCGGCT GACGGCTACG CCGTGACCAA CAATCACGTC
GTCGACGGCG CCGACAAGGT CGAGGTCACC ACCGACGACG GCAAGACCTA CAAGGCCAAG
GTCATCGGCA CCGACCAGCG TACCGATCTG GCGCTGATCA AGGCCGAGGG CCGCACCGAC
TTCCCGTTCG CCAAGCTGTC CGAGGGCAAG CCGCGGATCG GTGATTGGGT GCTCGCGGTC
GGCAATCCGT TCGGCCTCGG CGGCACCGTC ACCGCCGGCA TCGTCTCGGC CTCCGGCCGC
GACATCGGCA ATGGTCCGTA CGACGATTTC ATCCAGATCG ACGCGCCGGT GAACAAGGGC
AATTCCGGTG GCCCGGCGTT CGACACCAAC GGCGAAGTGA TGGGCGTCAA CACCGCGATC
TACTCGCCGT CCGGCGGCAG CGTCGGCATC GCGTTCTCGA TCCCCGCCAG CACCGTCAAG
ACGGTGGTGC AGCAGCTCAA GGACAAGGGT TCGGTGAGCC GCGGCTGGAT TGGCGTGCAG
ATCCAGCCGG TCACGCCGGA GATCGCCGAC AGCCTCGGGC TGAAGAAGCC CGACGGCGCG
CTGGTCGCCG AGCCGCAGCC CAATGGTCCG GCGGCGAAGG CGGGCATCGA ATCCGGCGAC
GTCATCACCG CGGTCAACGG CACGCCGGTG AAGGACGCGC GCGAACTCGC CCGAACCATC
GGCGGCTTCG CGCCGGGCAA TACGGTGAAG CTCACCGTGG TGCACAAGGG CGCGGACCGC
GAGCTCAACC TGACGCTCGG CCAATTGCCG AACCAGGTCG AGGCCAAGGT CAATGATGGT
GGCGACAACG GCAACAGTTC CAGCCGAGGC ACAGAGGTGC CGAGGCTCGG CCTGACGGTC
GCGCCGGCCA GCTCGGTCGC AGGCGCCGGC AAGGACGGCG TCGTCGTCAC CGACGTCGAT
CCCAAGAGCG CCGCAGCCGA CCGCGGCTTC AAGGAAGGCG ATGTGATTCT CGAGGTCGCG
GGCAAGAACG TGGCCAGCCC CGGCGATGTT CGCGAGGCGA TCAACACCGC CAAGGCCGAC
AACAAGAACA GCGTGCTGAT CCGGGTTCGC TCGGGTGGTT CGTCGCGTTT CGTCGCGGTG
CCGATCTCCG CCAAGGGCTG A
 
Protein sequence
MTDRNTDLPS QPSRQPDRRS LLSARKFALM ASVVAGLGAG AFGLSQTSTP VDLFSTPAHA 
QVGNKVNQAQ QPVGFADIVE KVKPSVISVK VNIAEKTAKN EDRAEDSPFQ PGSPMERFFR
RFGGEMPPGM RGHRGGGTMT GQGSGFFISA DGYAVTNNHV VDGADKVEVT TDDGKTYKAK
VIGTDQRTDL ALIKAEGRTD FPFAKLSEGK PRIGDWVLAV GNPFGLGGTV TAGIVSASGR
DIGNGPYDDF IQIDAPVNKG NSGGPAFDTN GEVMGVNTAI YSPSGGSVGI AFSIPASTVK
TVVQQLKDKG SVSRGWIGVQ IQPVTPEIAD SLGLKKPDGA LVAEPQPNGP AAKAGIESGD
VITAVNGTPV KDARELARTI GGFAPGNTVK LTVVHKGADR ELNLTLGQLP NQVEAKVNDG
GDNGNSSSRG TEVPRLGLTV APASSVAGAG KDGVVVTDVD PKSAAADRGF KEGDVILEVA
GKNVASPGDV REAINTAKAD NKNSVLIRVR SGGSSRFVAV PISAKG