Gene RPB_3013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3013 
Symbol 
ID3910812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3436452 
End bp3437879 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content67% 
IMG OID637884919 
Productpeptidase M48, Ste24p 
Protein accessionYP_486626 
Protein GI86750130 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0874514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.639267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATG TGATGATTGA AACTCTGCCC GGAGGATTGC GCCGGAAGGC CTGCGCGCAG 
GCGTCGCGGC TGGTGGCGAT CCTGAGCGCC GCCGCGCTGG CGCTCGCCCC GGTTCCCGGC
CTCGCGCAGG CGCCGCAGCC GAAGGGGCCG CCGCTGCTGC GCGACACCGA GATCGAGAAT
CTGCTGCGCG ACTATACGCG ACCGATCCTG CGCGTCGCCG GCCTCGAAAA GCAGAACATC
CAGATCGCCA TCATCAACGA TCCGAATTTC AACGCCTTCG TCGCCGACGG CCGCCGCATC
TTCGTCAATT ACGGTGCGCT GATGCAGTCG CAAACCCCGA ACCAGTTGAT CGGCGTGCTG
GCGCACGAGA CCGGCCATCT CGCCGGCGGC CATCTGTCCA AGCTCCGCAC CCAGCTCGCG
CAAGCACAGA CGCAGATGAT CGTGGCGATG CTGCTCGGCG TCGGCGCGAT GGTGGCGGGC
TCCAAGGCCG GCCCGAACAG CGGCGCCGGC AATATCGGTG CGGCCGCGAT CTCGGCGCCG
CAGGAATTGA TCCGGCGCAA TCTGCTGTCC TATCAGCGGC AGCAGGAAGA GAACGCCGAC
AAGGCCGCAG TGAAATTTCT CGACGCCACC GGCCAGTCGG CGAAGGGCAT GTACGAAACG
TTCCGTCGTT TCACCGACGA GAGCCTGTTC GCCGCGCGCG GCGCCGATCC TTATGCGCAG
TCGCATCCGA TGCCGGCCGA ACGCGTCCGC GCGCTGGAAG AGCTGGCGCG CTCCAGCCCG
AATTGGGACA AGAAGGACGA CGCCGCACTG CAGCTCCGCC ACGACATGAT GCGTGCCAAG
ACGTCCGGCT TCATGGAGCG TCCCGACACC GTTTACCGGC GCTATCCGTC GTCGAACACC
AGCCTGCCCG CGCGCTACGC CCGCGCCATC TCGACCTATC TGCACGGCGA TCCGCGCTCG
GCGCTGGCCC AGATCGACGG CCTGATTCAG GCCGAGCCGA ACAATCCCTA TTTCTACGAG
TTGCGCGGCC AGGCGCTGCT CGAGGGCGGT CGGCCGCAGG AAGCGATCGC GCCACTGCGC
AAGGCGTTGT CGCTCAGCCG CAGCGCCCCG CTGATCGAGA TGCTGCTCGG CCAGGCGCTG
GTGGCCTCGG GCAGCGCCGC CTCGACCGAA GAGGCGATCC GGATTCTGAA GTCGGCGCTG
TCGCGCGAGG CTGAAGCGCC GCTCGGCTAC AGCCAACTCG CGATGGCCTA TGGCCGCAAG
GGCGACTACG CCGAAGCCGA TCTCGCGTCG GCCCAGGCGG CCTTTCTGCG CGGCGACAAC
AAGACCGCGC GCGCACTCGC GGCGCGCGCC AAGACCCGCT TCCCGGTCGG CTCGCCGGGC
TGGGTCAAGG CGGACGATAT CGTCGAAGCG AAATCAACAT CCAAATAG
 
Protein sequence
MPNVMIETLP GGLRRKACAQ ASRLVAILSA AALALAPVPG LAQAPQPKGP PLLRDTEIEN 
LLRDYTRPIL RVAGLEKQNI QIAIINDPNF NAFVADGRRI FVNYGALMQS QTPNQLIGVL
AHETGHLAGG HLSKLRTQLA QAQTQMIVAM LLGVGAMVAG SKAGPNSGAG NIGAAAISAP
QELIRRNLLS YQRQQEENAD KAAVKFLDAT GQSAKGMYET FRRFTDESLF AARGADPYAQ
SHPMPAERVR ALEELARSSP NWDKKDDAAL QLRHDMMRAK TSGFMERPDT VYRRYPSSNT
SLPARYARAI STYLHGDPRS ALAQIDGLIQ AEPNNPYFYE LRGQALLEGG RPQEAIAPLR
KALSLSRSAP LIEMLLGQAL VASGSAASTE EAIRILKSAL SREAEAPLGY SQLAMAYGRK
GDYAEADLAS AQAAFLRGDN KTARALAARA KTRFPVGSPG WVKADDIVEA KSTSK