Gene RPB_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2041 
Symbol 
ID3909856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2320339 
End bp2321835 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content67% 
IMG OID637883934 
Productpeptidase S1C, Do 
Protein accessionYP_485659 
Protein GI86749163 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTTG CGATCACCGC CTTGAGCTTC CGGTTCAAGC CTCTGCTGAC GGCTTTGTGC 
CTCGGCGCGG CTGCCGCCTT GACCGCGGCG CCGGCGCAGG CCCGCGGCCC CGAGGGCATC
GCGGACGTCG CCGAGAAGGT CATCGACGCC GTGGTCAACA TCTCGACCAG CCAGACCGTC
GAGGCCAAGA GTGCGCCCAG CGAGGGCAAC AGCGCCAAGC CGAATCTGCC GCCGGGGTCG
CCCTTCGAGG AGTTTTTCGA GGACTTCTTC AAGAACCGCC GCGGCGAGAA GGGCGGTGGC
GGCCCGCGCA AGACCAACTC GCTCGGCTCG GGCTTCATCG TCGACACCGC CGGCATTGCC
GTGACCAACA ATCACGTCAT CGCCGACGCC GACGAGATCA ACCTGATCAT GAACGACGGC
ACCAAAATCA AGGCGGAGCT GGTCGGCGTC GACAAGAAGA CCGATCTGGC GGTGCTGAAG
TTCAAGCCGC CGGCGAACAA GCCGCTGGTG GCGGTGAAGT TCGGCGACAG TGACAAGCTG
CGGCTCGGCG AATGGGTGGT GGCGATCGGC AACCCGTTCT CGCTCGGCGG CACGGTCACC
GCCGGCATCG TCTCGGCGCG CAACCGCGAC ATCAATTCGG GGCCGTATGA CAGCTACATC
CAGACCGACG CCGCGATCAA TCGCGGCAAT TCCGGCGGCC CGCTGTTCAA CCTCGACGGC
GAAGTCATCG GCGTCAACAC GCTGATCATC TCGCCGTCCG GCGGCTCGAT CGGCATCGGA
TTCGCGGTGC CTTCGAAGAC CGTGGTCGGG GTGGTCGATC AGCTCCGCCA GTTCGGCGAG
CTGCGCCGCG GCTGGCTCGG CGTGCGGATC CAGCAGGTCA CCGACGAGAT CGCCGAAAGC
CTCAACATCA AGCCGGCGCG CGGCGCGCTG GTCGCCGGCA TCGACGACAA GGGCCCGGCC
AAGCCCGCCG GCATCGAGCC CGGCGACGTC GTCGTCAAGT TCGACGGCAA GGACGTCAAG
GAGCCGAAGG ATCTGTCGCG CGTGGTCGCC GACACGGCGG TCGGCAAGAC CGTCGACGTG
GTGATCATCC GCAAGGGCAA GGAAGAGACC AAGCAGGTCA CGCTCGGCCG CCTCGACGAC
GGCGCCAAGC CGCAGCCGGC CTCCGCGAAG TCGCAGCCGG AGCCGGAAAA GCCGGTGACA
CAGAAGGCGC TCGGGCTCGA CCTCGCCGCG CTGTCGAAGG ACCTGCGCGG CAAGTACAAG
ATCAAGGACA GCGTCAAGGG CGTCGTCGTG GTCGGCGTCG ACACCGGCTC CGATGCCGCC
GAGAAGCGGC TGTCGGCCGG CGACGTGATC GTCGAAGTGG CGCAGGAAGC GGTCACCAGC
GCCGCCGATA TCAAGAAGCG GATCGATCAG GTCAAGAAGG ACGGCAAGAA GTCGGTGCTG
CTGCTGGTTT CGAACGGAGC CGGCGAACTG CGCTTCGTGG CGCTCAGCCT GCAATAG
 
Protein sequence
MPVAITALSF RFKPLLTALC LGAAAALTAA PAQARGPEGI ADVAEKVIDA VVNISTSQTV 
EAKSAPSEGN SAKPNLPPGS PFEEFFEDFF KNRRGEKGGG GPRKTNSLGS GFIVDTAGIA
VTNNHVIADA DEINLIMNDG TKIKAELVGV DKKTDLAVLK FKPPANKPLV AVKFGDSDKL
RLGEWVVAIG NPFSLGGTVT AGIVSARNRD INSGPYDSYI QTDAAINRGN SGGPLFNLDG
EVIGVNTLII SPSGGSIGIG FAVPSKTVVG VVDQLRQFGE LRRGWLGVRI QQVTDEIAES
LNIKPARGAL VAGIDDKGPA KPAGIEPGDV VVKFDGKDVK EPKDLSRVVA DTAVGKTVDV
VIIRKGKEET KQVTLGRLDD GAKPQPASAK SQPEPEKPVT QKALGLDLAA LSKDLRGKYK
IKDSVKGVVV VGVDTGSDAA EKRLSAGDVI VEVAQEAVTS AADIKKRIDQ VKKDGKKSVL
LLVSNGAGEL RFVALSLQ