Gene RPB_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3547 
Symbol 
ID3911349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4059770 
End bp4061440 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID637885449 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_487153 
Protein GI86750657 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.777163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC GAGCATGGCA AAGGCCGGTA TGGATTGCGG CGGTGGCGGC GGCGACGCTG 
GTGATGACGG CGGGGGTCGC ATTTTCCCAA TCGCTGGCGC CTCCACCTCT GACCAACCTT
CCGCCAGTCA CCACGGCGCC GACCATCACG CCGTCGGTCG GCCCGACCAT TCCCGCGCTA
CGCCCGCCAG GGCCGGACGT GCTGCCGTCG GTCGAGCGTG AGTATCGGCG GCTGCGCGAT
GCGCCGCTGC CGCCCTCGCG CTGCAACTAT CTCGACAATG GCCGAACCTG CGCCAAGCTG
TGGGTGATCG GCGATTGGCG GTCCGTGACC AAGCACAAGC GCGGCAAGGC AAAGGCCGAT
CGCGTCCGCC GGGCTCCGCC GCAAATCGTC GCCGCGCGCG GGCAGCGCCC GGCGGTCGCC
GCTCGCGACC ATGTTCCCGG CGAAGTGCTG ATCGAATTCG ACGGCGGTCT GTCCGAGCAG
CAACTCCGTG CGCTGGCGCG ACGGCACCGG CTCACGCGCG TCAGCGTTGA CACCGTCGCA
CTGGTCGGGA CCCGCATCGG ACTGTTTCGG ATCAACGATC GCCGCTCGCC GCAAACGGTC
GCACGCGCAC TCGCCGCTGA CGGTCGGATA CGCGCCGCGC AGGCCAATTT CGTCTACACG
CTGCAGAGCG ATGCCAAGCC CGCTGCCGCC GACAATTCGA TGCTGTATCC GCACGGCCGG
CTGCGGCTGG CCGAGGCGCA TGCGCTGGCG CGCGGACGCG GCATTGTCGT TGCGGTGATC
GATTCCGGCG TCGACATCGC CCATCCGGAA CTCGTCGGTG CGCTCGCCGG TTCGTTCGAC
CCGCTCGGCA GCAAGGAGAA GCCGCATCAG CACGGCACCG GCGTCGCCGG GGCGATCGTC
GCGCGCGCCA AACTCACCGG TGGCGCACCG GAGGCGAAAA TACTGGCGAT CCGGGCGTTC
GGCGAGGGCA AGACCGGAAG CCAGAGCAGC ACGTCTTATC TGATCCTCAA AAGCCTCGAT
ATCGCCCTCG GTCATGGCGC CCGAATCGTC AATATGAGCT TTGCCGGATC GCAGGACCCG
CTGATTGCGC GCGGCGTCGC GGCTGCTGCA GCGCGGGGGA TCGTGATGAT TGCCGCCGCC
GGCAATGCCG GGCCGAAGTC CCCGCCATTG TATCCTGCGG CGCTGCCGGG CGTGATCGCG
GTCAGCGCGA CGAGTCCGGG CGACACGTTG TTTGCGGCAT CGAACCGTGG CCCGCAAATC
GCCGTGGCAG CTCCCGGCGT CGACGTCCTG CTGCCGGCGC CCGACGGCAA ATACGAGGTG
ATGACGGGGA CGTCGTTCTC TGCAGCATTC GTCAGCGGTA TCGCCGCGCT GATGATCGAG
CGCAATCCCA CGCTCGTGCC CGATCAGGTG CGGGCCGTTC TGAGCCAGAC CGCCCGCGAT
CTTGGAGCGC CGGGCCGTGA CGATCTGTTC GGGGCCGGCG AAGCCGATGC ATTTGCTGCC
CTGTCGCGGG TGGACAGCCT GGCGGCCCCG CTGGTTGCGG CGCCCGCGAC CGCGCCGCAG
CCCTCTGTCG CGACAGCGCA ACCTGCGACG CCCGGCGCTG CGTTGGCAGC GCCCGCCGGG
GCAGAGCAGG AGGTCGGCCC GGCGCCGGCC GCCGCGATCC CGGCCCGCTG A
 
Protein sequence
MSDRAWQRPV WIAAVAAATL VMTAGVAFSQ SLAPPPLTNL PPVTTAPTIT PSVGPTIPAL 
RPPGPDVLPS VEREYRRLRD APLPPSRCNY LDNGRTCAKL WVIGDWRSVT KHKRGKAKAD
RVRRAPPQIV AARGQRPAVA ARDHVPGEVL IEFDGGLSEQ QLRALARRHR LTRVSVDTVA
LVGTRIGLFR INDRRSPQTV ARALAADGRI RAAQANFVYT LQSDAKPAAA DNSMLYPHGR
LRLAEAHALA RGRGIVVAVI DSGVDIAHPE LVGALAGSFD PLGSKEKPHQ HGTGVAGAIV
ARAKLTGGAP EAKILAIRAF GEGKTGSQSS TSYLILKSLD IALGHGARIV NMSFAGSQDP
LIARGVAAAA ARGIVMIAAA GNAGPKSPPL YPAALPGVIA VSATSPGDTL FAASNRGPQI
AVAAPGVDVL LPAPDGKYEV MTGTSFSAAF VSGIAALMIE RNPTLVPDQV RAVLSQTARD
LGAPGRDDLF GAGEADAFAA LSRVDSLAAP LVAAPATAPQ PSVATAQPAT PGAALAAPAG
AEQEVGPAPA AAIPAR