Gene Rsph17025_2983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2983 
Symbol 
ID5084583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3049532 
End bp3050881 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content68% 
IMG OID640484555 
Productcarboxyl-terminal protease 
Protein accessionYP_001169174 
Protein GI146279015 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.44239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.186198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAT ACATGATGGC CGCGCTGGGC GGCACTGTCG CCGGCGTGCT GATGGCGACC 
CAGGTGGCCG GGCCCCTCAT CGCGCAGGAG CAGCAGCGGT CGAAGACGGT TTATGAACAG
CTCGACCTGT TCGGCGACAT CTTCGAGCGC ATCCGCGCGC AATATGTCGA GGAAGTCGAG
ACCGACAAGC TGATCGAGGC GGCGATCAAC GGGATGCTGA CCTCGCTCGA CCCCCATTCC
AGCTATCTGC CGCCCGATGA TTTCAACGAC ATGCAGGTGC AGACCCGCGG CGAGTTCGGC
GGGCTCGGCA TCGAGGTCAC GCAGGAAGAG GGCTTCGTCA AGGTCGTCTC GCCGATGGAC
GGCACGCCCG CCGATGCGGC CGGGATCGAG GCCGGCGACT TCATCACCCA TGTGAACGGC
GAGACGGTGT TGGGCCTCAC GCTCGATCAG GCGGTGGACA TGATGCGCGG GCCGGTGGGT
TCGGAAATCA TCATCACCGT GGTGCGCGAG GGCACGGCCG AGCCGTTCGA CGTCTCGATC
ATCCGCGACA CGATCAAGCT GGTCGCGGCC CGCAGCCGGG TGGTGGGCAA TACCGTCGTG
GTGCGCCTGA CCACCTTCAA CGACCAGACC TTCTCGGGCC TCAAGGAGGG GCTGGAGAGC
GGTGCCAAGG AACTCGGCGG GCTCGACAAG GTCAACGGGA TCGTCCTCGA CCTGCGCAAC
AACCCGGGCG GGCTGCTCAC GCAGGCGATC CAGGTCTCGG ACGCCTTCCT CGACAAGGGC
GAGATCGTCT CGACGCGGGG CCGCGCCGCG GGCGACGGCG AGCGGTTCAA CGCGACGGCC
GGTGACCTGA TCGGCGGCAA GCCGATGGTC GTGCTGATCA ACGGCGGGTC GGCCTCGGCA
TCGGAGATCG TGGCGGGCGC GCTGCAGGAT CACCGCCGCG CCATCGTCGT CGGCACCAAG
AGCTTCGGCA AGGGATCGGT CCAGACGGTG ATCCCGCTGC GCGGCGAGGG CGCGATGCGG
CTGACCACCG CGCGCTACTA CACACCGTCG GGCCGCTCGA TCCAGGCGCT GGGCGTGGCG
CCGGACATCG TGGTGAACCA GCCGCCGAGC CGGCCCGCCG GGACCGAGGA AGAGGACGCC
GCCGCGCCGG GCCCGGCGGC CCGCAACCGC TCGGAGGCGG ATCTGCGGGG CGTGCTCTCG
AACGATTCGA TGAGCGATGA CGAAAGGAAG CAGCTCGAAG CCGAGCGGGC GCGCGCCGAA
GAGGCGGCCA AGCTGCGGGA CGAGGATTAC CAGCTGGCCT ATGCGGTCGA TATCCTCAAG
GGCCTCGCCG CCATCGAAGT GAAACCGTGA
 
Protein sequence
MRKYMMAALG GTVAGVLMAT QVAGPLIAQE QQRSKTVYEQ LDLFGDIFER IRAQYVEEVE 
TDKLIEAAIN GMLTSLDPHS SYLPPDDFND MQVQTRGEFG GLGIEVTQEE GFVKVVSPMD
GTPADAAGIE AGDFITHVNG ETVLGLTLDQ AVDMMRGPVG SEIIITVVRE GTAEPFDVSI
IRDTIKLVAA RSRVVGNTVV VRLTTFNDQT FSGLKEGLES GAKELGGLDK VNGIVLDLRN
NPGGLLTQAI QVSDAFLDKG EIVSTRGRAA GDGERFNATA GDLIGGKPMV VLINGGSASA
SEIVAGALQD HRRAIVVGTK SFGKGSVQTV IPLRGEGAMR LTTARYYTPS GRSIQALGVA
PDIVVNQPPS RPAGTEEEDA AAPGPAARNR SEADLRGVLS NDSMSDDERK QLEAERARAE
EAAKLRDEDY QLAYAVDILK GLAAIEVKP