Gene RPB_0552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0552 
Symbol 
ID3909591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp621810 
End bp623303 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content68% 
IMG OID637882440 
Productpeptidase M48, Ste24p 
Protein accessionYP_484174 
Protein GI86747678 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0387817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGCAT CCGAGGGACG GCGCAGCATC TGGAGAAGCC GCCGCTGGCT GGCTGTGCCT 
GTCCTGGCGG GCGCGCTGGC GCTGGCCGGC TGCGGCGATT TTCGCCGCTT CGAGACCGCG
TCGATTCCCT CCAGCACGCC GGCGGCAAAG CCGTCGCGTC CGGCGGCGCA ATCGCCGGCG
GCCGAGCGCG AGCACGAGCG CATCCTCGCC ACCTATGGCG GCGCCTATGA CGATCCGAGG
CTCGAAGCGC TGATCACCGC GACGGTCGAT CGGCTGGTGG CGGCGTCCGA CCGTCCCGAC
CTCACCTACA AGGTGACGAT CCTGAATTCC GGCGCCGTCA ACGCCTTCGC GCTGCCGACC
GGGCAGCTCT ACGTCACCCG CGGGCTGGTC GCGCTCGCCA GCGACACCTC GGAACTGTCG
TCGGTGCTGT CGCACGAGAT GGCGCATGTG CTGGCCAAGC ACGCCGCGAT CCGGGAGGAC
CAGGCGCGCC AGGCGGCGCT GGTCACCCGC GTCGTCACCG ACATGGGCAC CGATCCGGAG
ATGACCGCGC TGGCGCTGGC CAAGACCAAG CTGTCGATGG CGAGCTTCTC GCGGCAGCAG
GAGCTCGAGG CCGACGGCAT CGGCGTCGGC ATCTCGGCGC GCGCCCAGTT CGATCCGTTC
GGAGCTTCGC GTTTCCTCAC CGCGATGGAG CGCAACGCGG CGCTGAAGGC GGGCCGCGGC
GATGCGCGCT CGCAGGACTT CCTGGCGTCG CACCCGGCGA CGCCGGAGCG GGTGCGCAAC
GCGCAGAACA ACGCCCGGCA ATACGCCTCG CCGGAGCAGA CCGCCAAGGG CGAGCGCGAC
CGTGAGACCT ATCTCAACGC CATCGACAAC ATCGTCTATG GCGAGGACCC GAGCGAGGGC
TTCGTCCGCG GCCGCCGCTT CCTGCATCCC AAGCTCGGCT TCACCTTCCA GGTGCCGGAG
AGCTTCACGC TCGACAACAC CGCGCAGGCG GTGATCGGCA TCCGCGAAGG CGGCAGCCAG
GCGATGCGGT TCGACGTGGT GCGGGTGCCG GCGGAACAGT CGCTCGGCGA CTACCTCAAT
TCCGGCTGGA TGGAGAACGT CGACAAGAGT TCGACCGAAG AACTAAGCAT CAACGGCTTT
CCGACCGCCT CGGTGGCGGC GCGCGGCGAT CAGTGGCAGT TCAAGGTCTA TGCGTTGCGG
TTCGGCAGCG ACGTCTATCG CTTCATCTTC GCGACCCGGC AGAAATCGGC CGAAAGCGAC
CGCAATTCGC GCGACACCGT GAATTCGTTC CGACGTCTGA CGCTCGACGA GATCCAGGCG
GCGCGGCCGT TGCGGATCAA GGTGATCACC GTACAGCCGG GCGACACGGT GGAATCGCTG
TCGCACCGGA TGTCCGGCGT CGACCGCCCG CTCGACCGCT TCCGGGTGCT GAACGGCCTC
GACGCCAACG CCACCGTGAA GCCGCGCGAT CTGGTCAAGA TCGTGGTGGA TTAA
 
Protein sequence
MIASEGRRSI WRSRRWLAVP VLAGALALAG CGDFRRFETA SIPSSTPAAK PSRPAAQSPA 
AEREHERILA TYGGAYDDPR LEALITATVD RLVAASDRPD LTYKVTILNS GAVNAFALPT
GQLYVTRGLV ALASDTSELS SVLSHEMAHV LAKHAAIRED QARQAALVTR VVTDMGTDPE
MTALALAKTK LSMASFSRQQ ELEADGIGVG ISARAQFDPF GASRFLTAME RNAALKAGRG
DARSQDFLAS HPATPERVRN AQNNARQYAS PEQTAKGERD RETYLNAIDN IVYGEDPSEG
FVRGRRFLHP KLGFTFQVPE SFTLDNTAQA VIGIREGGSQ AMRFDVVRVP AEQSLGDYLN
SGWMENVDKS STEELSINGF PTASVAARGD QWQFKVYALR FGSDVYRFIF ATRQKSAESD
RNSRDTVNSF RRLTLDEIQA ARPLRIKVIT VQPGDTVESL SHRMSGVDRP LDRFRVLNGL
DANATVKPRD LVKIVVD