Gene Rsph17029_2591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2591 
Symbol 
ID4897635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2728318 
End bp2729664 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content68% 
IMG OID640113190 
Productcarboxyl-terminal protease 
Protein accessionYP_001044465 
Protein GI126463351 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.837496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.635906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAT ACATGATGGC CGCGCTCGGC GGCACGGTGG CCGGCGTCCT GCTGGCGACC 
CAGGTGGCGG GGCCCCTCAT CGCGCAGGAA CAGCAGCGGT CGAAATCGGT CTACGAGCAG
CTCGACCTGT TCGGCGACAT CTTCGAGCGC ATCCGCGCGC AGTATGTCGA GGAAGTGGAG
ACCGACAAGC TGATCGAGGC CGCGATCAAC GGGATGCTGA CCTCGCTCGA TCCGCATTCG
AGCTACCTGC CGCCCGACGA TTTCGACGAC ATGCAGGTAC AGACCCGCGG CGAGTTCGGC
GGCCTCGGCA TCGAGGTCAC GCAGGAGGAA GGCTTCGTCA AGGTGGTCTC GCCGATGGAC
GGCACGCCCG CGGATGCGGC CGGCATCCAG TCCGGCGACT TCATCACCCA TGTGAACGGC
GAATCCGTTC TGGGCCTGAC GCTCGATCAG GCGGTGGACA TGATGCGCGG GCCGGTCGGC
TCCGAGATCC TCATCACGGT GGTGCGCGAG GGCACCCCCG AGCCCTTCGA CGTCTCGATC
GTCCGCGACA CGATCAAGCT CGTGGCCGCC CGCAGCCGCG TGGTGGGCAA CACGGTCGTC
GTGCGGCTGA CCACTTTCAA TGACCAGACC TTCTCGGGCC TGAAGGAGGG TCTGGAGAAG
GAGATCAAGG CGCTCGGCGG CGAGGACAAG ATCAACGGCG TGGTGCTCGA CCTGCGCAAC
AACCCCGGCG GGCTCCTGAC CCAGGCGATC CAGGTCTCGG ACGCCTTCCT CGACAAGGGC
GAGATCGTCT CGACCCGCGG CCGCGCCGCG GGCGACGGCG AGCGGTTCAA CGCGACGCCG
GGCGATCTGA TCGACGGCAA GCCCATGGTC GTGCTCATCA ACGGCGGGTC GGCCTCGGCC
TCGGAGATCG TGGCGGGCGC GCTGCAGGAC CATCGCCGCG CCATCGTCGT GGGCACCAAG
AGCTTTGGCA AGGGATCGGT CCAGACCGTG ATCCCGCTGC GGGGCGAGGG GGCCATGCGG
CTGACCACGG CGCGCTACTA CACGCCCTCG GGCCGCTCGA TCCAGGCGCT CGGCGTGGCG
CCCGACATCG TGGTGAACCA GCCGCCCGCG AAGCCCGCCG TCCCCGAGGA GGAGGAGACG
CCCGCGACCA GCGCCGCCCG CAACCGGTCG GAGGCCGACC TGCGCGGCGT CCTGTCGAAC
GATTCGATGA CCGAGGACGA GAAGAAGCAG CTCGAGGCCG ATCGCGCCCG GGCCGAGGAA
TCCGCGAAGC TCCGCGACGA GGATTACCAG CTCGCCTATG CGGTGGACAT CCTCAAGGGC
CTCTCGGCCA TCGAAGTGAA GCCCTGA
 
Protein sequence
MRKYMMAALG GTVAGVLLAT QVAGPLIAQE QQRSKSVYEQ LDLFGDIFER IRAQYVEEVE 
TDKLIEAAIN GMLTSLDPHS SYLPPDDFDD MQVQTRGEFG GLGIEVTQEE GFVKVVSPMD
GTPADAAGIQ SGDFITHVNG ESVLGLTLDQ AVDMMRGPVG SEILITVVRE GTPEPFDVSI
VRDTIKLVAA RSRVVGNTVV VRLTTFNDQT FSGLKEGLEK EIKALGGEDK INGVVLDLRN
NPGGLLTQAI QVSDAFLDKG EIVSTRGRAA GDGERFNATP GDLIDGKPMV VLINGGSASA
SEIVAGALQD HRRAIVVGTK SFGKGSVQTV IPLRGEGAMR LTTARYYTPS GRSIQALGVA
PDIVVNQPPA KPAVPEEEET PATSAARNRS EADLRGVLSN DSMTEDEKKQ LEADRARAEE
SAKLRDEDYQ LAYAVDILKG LSAIEVKP