Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3013 |
Symbol | |
ID | 3910812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3436452 |
End bp | 3437879 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884919 |
Product | peptidase M48, Ste24p |
Protein accession | YP_486626 |
Protein GI | 86750130 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0874514 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.639267 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAATG TGATGATTGA AACTCTGCCC GGAGGATTGC GCCGGAAGGC CTGCGCGCAG GCGTCGCGGC TGGTGGCGAT CCTGAGCGCC GCCGCGCTGG CGCTCGCCCC GGTTCCCGGC CTCGCGCAGG CGCCGCAGCC GAAGGGGCCG CCGCTGCTGC GCGACACCGA GATCGAGAAT CTGCTGCGCG ACTATACGCG ACCGATCCTG CGCGTCGCCG GCCTCGAAAA GCAGAACATC CAGATCGCCA TCATCAACGA TCCGAATTTC AACGCCTTCG TCGCCGACGG CCGCCGCATC TTCGTCAATT ACGGTGCGCT GATGCAGTCG CAAACCCCGA ACCAGTTGAT CGGCGTGCTG GCGCACGAGA CCGGCCATCT CGCCGGCGGC CATCTGTCCA AGCTCCGCAC CCAGCTCGCG CAAGCACAGA CGCAGATGAT CGTGGCGATG CTGCTCGGCG TCGGCGCGAT GGTGGCGGGC TCCAAGGCCG GCCCGAACAG CGGCGCCGGC AATATCGGTG CGGCCGCGAT CTCGGCGCCG CAGGAATTGA TCCGGCGCAA TCTGCTGTCC TATCAGCGGC AGCAGGAAGA GAACGCCGAC AAGGCCGCAG TGAAATTTCT CGACGCCACC GGCCAGTCGG CGAAGGGCAT GTACGAAACG TTCCGTCGTT TCACCGACGA GAGCCTGTTC GCCGCGCGCG GCGCCGATCC TTATGCGCAG TCGCATCCGA TGCCGGCCGA ACGCGTCCGC GCGCTGGAAG AGCTGGCGCG CTCCAGCCCG AATTGGGACA AGAAGGACGA CGCCGCACTG CAGCTCCGCC ACGACATGAT GCGTGCCAAG ACGTCCGGCT TCATGGAGCG TCCCGACACC GTTTACCGGC GCTATCCGTC GTCGAACACC AGCCTGCCCG CGCGCTACGC CCGCGCCATC TCGACCTATC TGCACGGCGA TCCGCGCTCG GCGCTGGCCC AGATCGACGG CCTGATTCAG GCCGAGCCGA ACAATCCCTA TTTCTACGAG TTGCGCGGCC AGGCGCTGCT CGAGGGCGGT CGGCCGCAGG AAGCGATCGC GCCACTGCGC AAGGCGTTGT CGCTCAGCCG CAGCGCCCCG CTGATCGAGA TGCTGCTCGG CCAGGCGCTG GTGGCCTCGG GCAGCGCCGC CTCGACCGAA GAGGCGATCC GGATTCTGAA GTCGGCGCTG TCGCGCGAGG CTGAAGCGCC GCTCGGCTAC AGCCAACTCG CGATGGCCTA TGGCCGCAAG GGCGACTACG CCGAAGCCGA TCTCGCGTCG GCCCAGGCGG CCTTTCTGCG CGGCGACAAC AAGACCGCGC GCGCACTCGC GGCGCGCGCC AAGACCCGCT TCCCGGTCGG CTCGCCGGGC TGGGTCAAGG CGGACGATAT CGTCGAAGCG AAATCAACAT CCAAATAG
|
Protein sequence | MPNVMIETLP GGLRRKACAQ ASRLVAILSA AALALAPVPG LAQAPQPKGP PLLRDTEIEN LLRDYTRPIL RVAGLEKQNI QIAIINDPNF NAFVADGRRI FVNYGALMQS QTPNQLIGVL AHETGHLAGG HLSKLRTQLA QAQTQMIVAM LLGVGAMVAG SKAGPNSGAG NIGAAAISAP QELIRRNLLS YQRQQEENAD KAAVKFLDAT GQSAKGMYET FRRFTDESLF AARGADPYAQ SHPMPAERVR ALEELARSSP NWDKKDDAAL QLRHDMMRAK TSGFMERPDT VYRRYPSSNT SLPARYARAI STYLHGDPRS ALAQIDGLIQ AEPNNPYFYE LRGQALLEGG RPQEAIAPLR KALSLSRSAP LIEMLLGQAL VASGSAASTE EAIRILKSAL SREAEAPLGY SQLAMAYGRK GDYAEADLAS AQAAFLRGDN KTARALAARA KTRFPVGSPG WVKADDIVEA KSTSK
|
| |