Gene RPB_3443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3443 
Symbol 
ID3911245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3948654 
End bp3950240 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content68% 
IMG OID637885346 
Productpeptidase S1C, Do 
Protein accessionYP_487050 
Protein GI86750554 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.203092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0980299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC GTTCTTCCGA TCTCACCTCG CAGCCGACCA GCCCGGTGCG CCGTTCGCTG 
TTCTCGGCGC GCAAATTCGC GCTGATGGCG TCGGTGGTGG CCGGCCTCGG CGCGGGCGCG
TTCGGCCTGT CGCAGGTCCC GGCGTCGGGC GATCTGTTCA GCACGGCGGC GCAGGCGCAG
GTCGGCAACG AGGTCCACAA GGCGCAGCAG CCCGTCGGCT TCGCCGACAT CGTCGAGAAG
GTGAAGCCCT CGGTGATCTC GGTGAAGGTC AACATCGCCG AAAAAGTTGC GAAGACCGAC
GAGCGCAGCG AGCGCAGCGA AGAATCGCCG TTCGCGCCCG GCTCGCCGAT GGAGCGCTTC
TTCCGCCGCT TCGGCGGTGA AATGCCGCCC GGCATGCGCG GCCATCGCGG CGGCGGCATG
ATGACGGGGC AGGGCTCGGG CTTCTTCATC ACCGCCGACG GCTACGCCGT CACCAACAAT
CACGTCGTCG ACGGCGCCGA CAAGGTCGAA GTCACCACCG ACGACGGCAA GACCTACAAG
GCCAAGGTGA TCGGCACCGA CCAGCGCACC GATCTGGCGC TGATCAAAGC CGAAGGCCGC
ACCGACTTCC CGTTCGCCAA GCTGTCCGAA GGCAAGCCGC GGATCGGCGA CTGGGTGCTC
GCGGTCGGCA ATCCGTTCGG CCTCGGCGGC ACCGTCACCG CCGGCATCGT CTCGGCCTCC
GGCCGCGACC TCGGCAACGG TCCGTATGAC GATTTCATCC AGATCGACGC GCCGGTGAAC
AAAGGCAATT CCGGCGGTCC GGCGTTCGAC GTCAATGGCG AAGTGATGGG TGTCAACACG
GCGATCTACT CGCCGTCGGG CGGCAGCGTC GGCATCGCGT TCTCGATCCC GGCCTCGACC
GTCAAGGCGG TGGTGCAGCA GCTCAAGGAC AAGGGCTCCG TCAGCCGTGG CTGGATCGGC
GTCCAGATCC AGCCGGTGAC GCCGGAGATC GCCGACAGCC TCGGGCTGAA GAAGCCGGAC
GGCGCGTTGG TGGCCGAGCC GCAGCCCAAC GGCCCGGCCG CCAAGGCCGG CATCGAATCC
GGCGACGTCA TCACCGCGGT CAACGGCGCG CCGGTGAAGG ACGCGCGCGA GCTCGCCCGC
ACCATCGGCG GCTTCGCGCC GGGCAATACG GTGAAGCTCA CCGTGTTCCA CAAGGGCGCG
GATCGGGAAC TCAGCCTGAC GCTCGGCCAA TTGCCGAACC AGGTCGAGGC CAAGGCCAAT
CTCGACGGCG ACAACGGTCG CCAATCCAGC CGCGGCACCG AAGTGCCGAG GCTCGGCCTG
ACGGTCGCGC CGGCCAGTTC GGTCGCCGGT GCCGGCAAGG ATGGCGTGGT GGTCACCGAC
GTCGATCCGA AGAGCGCCGC AGCCGATCGC GGCTTCAAGG AAGGCGACGT GATCCTCGAG
GTCGCGGGCA AGAACGTGGC GAGCCCGGGT GACGTCCGCG ACGCCATCAA CACCGCCAAG
AACGACAACA AGAACAGCGT GCTGATCCGG GTCCGCTCGG GTGGTTCGTC ACGTTTCGTC
GCGGTGCCGA TCTCGGCCAA GGGCTGA
 
Protein sequence
MTDRSSDLTS QPTSPVRRSL FSARKFALMA SVVAGLGAGA FGLSQVPASG DLFSTAAQAQ 
VGNEVHKAQQ PVGFADIVEK VKPSVISVKV NIAEKVAKTD ERSERSEESP FAPGSPMERF
FRRFGGEMPP GMRGHRGGGM MTGQGSGFFI TADGYAVTNN HVVDGADKVE VTTDDGKTYK
AKVIGTDQRT DLALIKAEGR TDFPFAKLSE GKPRIGDWVL AVGNPFGLGG TVTAGIVSAS
GRDLGNGPYD DFIQIDAPVN KGNSGGPAFD VNGEVMGVNT AIYSPSGGSV GIAFSIPAST
VKAVVQQLKD KGSVSRGWIG VQIQPVTPEI ADSLGLKKPD GALVAEPQPN GPAAKAGIES
GDVITAVNGA PVKDARELAR TIGGFAPGNT VKLTVFHKGA DRELSLTLGQ LPNQVEAKAN
LDGDNGRQSS RGTEVPRLGL TVAPASSVAG AGKDGVVVTD VDPKSAAADR GFKEGDVILE
VAGKNVASPG DVRDAINTAK NDNKNSVLIR VRSGGSSRFV AVPISAKG