Gene RPB_2323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2323 
Symbol 
ID3908954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2671433 
End bp2672824 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content68% 
IMG OID637884220 
Productpeptidase S1C, Do 
Protein accessionYP_485939 
Protein GI86749443 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.052989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.164438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTG TCCGCATCTT CACGGCCGTC CTGCTCTCGC TCGCGGCGTC GGCCCCGGCG 
ACCGCGCAGG AGCGGCGGCT GCCGGCGTCG CAGGCCGAGA TCAAGCTCAG CTACGCGCCG
ATCGTGCAGC ACGCGCAGCC GGCGGTGGTG AACGTCTACG CCGCCAAGGT GGTGCAGAAC
CGCAATCCGC TCTTGGAAGA CCCGATCTTC CGCCGCTTCT TCGGCGGCGG CCCGCAGCCC
GAGCAGATCC AGCGCTCGCT CGGAAGCGGC GTGATGGTCG ATCCGTCGGG CCTCGTCGTC
ACCAACAATC ACGTCATCGA CGGCGCCGAT CAGGTCAAGG TCGCGCTCGC CGACAAGCGC
GAGTTCGAGG CCGAGATCGT GCTGAAGGAC AGCCGCACCG ATCTCGCGGT GCTGCGCCTC
AAGGATACCA GGGAGAAATT CGCCACCCTC GAACTGGCCA ATTCGGACGA GCTGCTGGTC
GGCGATCTGG TGCTGGCGTT GGGCAATCCG TTCGGCGTCG GCCAGACCGT GACGCACGGC
ATCGTCTCGG CGCTGGCGCG CACCCAGGTC GGCATCACCG ACTATCAGTT CTTCATCCAG
ACCGACGCGG CGATCAATCC CGGCAATTCC GGCGGCGCGC TGGTCGACAT GACCGGCAAG
CTGGTCGGCA TCAACACCGC GATCTTTTCG CGCTCCGGCG GCTCGCAGGG CATCGGCTTC
GCGATCCCGA CCAACATGGT GCGGGTGGTG ATCGCCTCGG CCAAGAGCGG CGGCAAGGCG
GTGAAGCGGC CGTGGCTCGG CGCGCGGCTG CAGGCGGTGA CGCCGGAGAT CGCCGAGACG
CTCGGTCTGA AGCGGCCCAG CGGCGCGCTG GTGGCGAGCG TCACCAAGGG CAGCCCGTCC
GACAAGGCGG GGCTGAGACT GTCCGATCTG ATCGTCGGGG TCGACGGCTT TGCGATCGAT
GATCCCAACG CGTTCGACTA TCGCTTCGCC ACCCGCCCGC TCGGCGGCAC CGCGCAGATC
GACGTGCAGC GCGGCGGCAA GCCGCTCAAG CTCAGCATCA CGCTGGAGAC CGCGCCGGAC
ACCGGCCGCG ACGAGATCGT GCTGACCGCG CGCTCGCCGT TCCAGGGTGC CAGGATCGCC
AACATCTCGC CGGCGATCGC CGACGACCTG CGGCTCGACC CCAGCGTCGA GGGCGTGGTC
GTGACCGATC TCGCCGACGG CGCCACCGCG GCGAGCGTCG GTTTCCAGAA GGGCGACATC
ATCGTCGCGG TCAACAACAA GCGCATCGGC AAGACCAGCG ACCTCGAACG CATCACCAAC
GAGTCGTTCC GCCTGTGGCG CATCACCGTC GTTCGCGGCG GCCAGCAGAT CAACGTCACG
CTCGGCGGAT GA
 
Protein sequence
MSAVRIFTAV LLSLAASAPA TAQERRLPAS QAEIKLSYAP IVQHAQPAVV NVYAAKVVQN 
RNPLLEDPIF RRFFGGGPQP EQIQRSLGSG VMVDPSGLVV TNNHVIDGAD QVKVALADKR
EFEAEIVLKD SRTDLAVLRL KDTREKFATL ELANSDELLV GDLVLALGNP FGVGQTVTHG
IVSALARTQV GITDYQFFIQ TDAAINPGNS GGALVDMTGK LVGINTAIFS RSGGSQGIGF
AIPTNMVRVV IASAKSGGKA VKRPWLGARL QAVTPEIAET LGLKRPSGAL VASVTKGSPS
DKAGLRLSDL IVGVDGFAID DPNAFDYRFA TRPLGGTAQI DVQRGGKPLK LSITLETAPD
TGRDEIVLTA RSPFQGARIA NISPAIADDL RLDPSVEGVV VTDLADGATA ASVGFQKGDI
IVAVNNKRIG KTSDLERITN ESFRLWRITV VRGGQQINVT LGG