Gene RPB_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1019 
Symbol 
ID3909143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1167546 
End bp1169123 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content69% 
IMG OID637882912 
Productpeptidase S1C, Do 
Protein accessionYP_484640 
Protein GI86748144 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGA ACAAGACGGA TCGCAACGAC ATCGACGACA CCCCGAAATC CGAGGCGCAA 
CTACGCCGGA TCATGAGGCC GCGGCGCTTT GCTCTGCTGG CGTCCGCCGC AGCGCTGAGC
GCCGCGCTGG TCACCGGCGG ATATATGACG CACGAGTTTC CGTCGTTCGC GTCGCCCGCG
CTCGCGGCCG AAGCGGCTGC GCCGACCGGC ATGCCTTCCG GCTTCGGCGA TCTCGTCAGC
AAGGTGAAGC CGGCGGTGAT CTCGGTGCGC GTCAAGATCG ACGAGGACAG CCGCACCACG
CCGCTGATCC GCAACGACGA CAGCGACGAC AATGCCAATC CGGGGATGGG CGACGATCAG
TTGCGTCCGT TCCTCAAGCA GTTCGGCTTC CGCGGCGAAG GCGCCATGCC GAAGCGGCAC
GAGATGATCA CCGGCGAGGG CTCCGGCTTC TTCATCTCGC CGGACGGCTA CGCGGTCACC
AACTATCACG TCGTCGATCA CGCCAAGTCG GTGCAGGTCA CGACCGATGA CGGCGCCGTC
CACACCGCCA AGGTCGTCGG CACCGACAAG AAGACCGATC TGGCGCTGAT CAAGGTCGAC
GGCAAGAGCG ACTTCACTTA CGTGAAATTC GCCGATCAGC CGGCGCGGGT CGGCGACTGG
GTGGTGGCGG TCGGCAATCC GTTCGGCCTC GGCGGCACCG TCACGGCCGG CATCGTCTCG
GCGCGCGGCC GCGACATCGG CTCCGGTCCC TATGACGACT ACGTCCAGAT CGACGCGCCG
ATCAACAAGG GCAATTCCGG CGGCCCGGCG TTCGACACCA GCGGCAACGT CATCGGCGTC
AACACCGCGA TCTATTCGCC CTCCGGCGGC TCGGTCGGCA TCGGCTTCGA CATTCCGGCG
GCGACCGCCA AGCTCGTCAT CGCGCAGTTG AAGGACAAGG GCGTGGTGAC GCGCGGTTGG
CTCGGCGTGC AGGTGCAGCC GGTGACCGCC GAGATCGCCG ACAGCATGGG GCTGAAGCAG
GCCCGCGGCG CGCTGGTCGA CAGCCCGCAG GACGGCAGCC CGGCCGCCAA GGCGGGGATC
GCCGCCGGCG ACGTCATCAC CGCGGTCAAT GGCGCGGAGA TCAAGGACTC GCGCGCGCTG
GCGCGGACCA TCAGCATGAT GGCGCCGGGC AGCGCGGTGA AGCTCGACGT GCTGCACAAG
GGTGACAGCA AGACTGTGAC GCTCAGCCTG GCGGAGATGC CGAACGAGAG CGCGAAAGTC
GCCGACAGCG GCGATGCCGG GCGCGAGTCC GGCCGGCCCT ATCTCGGCCT GCGCGTCGCG
CCGGCCAGCG AGGTTGGCGA CTCCGGCCAG AAGGGCGTCG TCGTCACCGG CGTCGACCCC
GAAGGCCCGG CGGCCGATCG CGGGCTGCGC AGCGGCGACG TCATCCTCGA CGTCGGCGGC
AAGCCGGTCG CCAATTCCGG CGACGTCCGC GACGCGCTGA AGCAGGCCAG CGAGCACGGC
AAGAAGACCG TGCTGATGCG GGTCAAGACC GCGGATTCCG CCGCGCGCTT CGTCGCGGTG
CCGATCGCCA AAGGCTGA
 
Protein sequence
MNQNKTDRND IDDTPKSEAQ LRRIMRPRRF ALLASAAALS AALVTGGYMT HEFPSFASPA 
LAAEAAAPTG MPSGFGDLVS KVKPAVISVR VKIDEDSRTT PLIRNDDSDD NANPGMGDDQ
LRPFLKQFGF RGEGAMPKRH EMITGEGSGF FISPDGYAVT NYHVVDHAKS VQVTTDDGAV
HTAKVVGTDK KTDLALIKVD GKSDFTYVKF ADQPARVGDW VVAVGNPFGL GGTVTAGIVS
ARGRDIGSGP YDDYVQIDAP INKGNSGGPA FDTSGNVIGV NTAIYSPSGG SVGIGFDIPA
ATAKLVIAQL KDKGVVTRGW LGVQVQPVTA EIADSMGLKQ ARGALVDSPQ DGSPAAKAGI
AAGDVITAVN GAEIKDSRAL ARTISMMAPG SAVKLDVLHK GDSKTVTLSL AEMPNESAKV
ADSGDAGRES GRPYLGLRVA PASEVGDSGQ KGVVVTGVDP EGPAADRGLR SGDVILDVGG
KPVANSGDVR DALKQASEHG KKTVLMRVKT ADSAARFVAV PIAKG