Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3691 |
Symbol | |
ID | 4024207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4118230 |
End bp | 4119309 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637963895 |
Product | peptidase M48, Ste24p |
Protein accession | YP_570813 |
Protein GI | 91978154 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGTG ACGTTTCCGC CCCGCCGCCG GCGGTCTTCT TCGACGGCGC GTCGAGCCGG CGCCGGCCGG TGACGCTGGC GTTCTCCGAT CGGCTCGAGA TCCTGCAGGA CGGCCGCACG CTGGCGGCGT GGCCGTTCGC AGATATTCGC CGCGCCGACG GCGCTCCCGG CCTGTTGCGG CTCGGCTGCG TTTCCGCACC GGCGCTGGCC CGGCTGGAGG TCCCCGATCC CGCCATTGCG CAACAGCTTG CCGCGCGCTG CAGCTATCTC GATGCCGACG TTCCCCAACG TCACGGCGTC CGCGCCATCG TCGGCTGGTC GTTGGCCGCG ATCGTATCGC TGGTTCTGGT GTCGGTTTAC GGCATGCCGC TGATCGCCGA TCGCCTGGCG CCGCTGTTGC CGCAGGCGTT CGAGCGCCGC GTCGGCGACG TCGCCGACCG GCAGATCAGG ACGTTGTTCG GCGACAAGGT CTGCGACCGG CCGGCCGGGC AGGCGGCGTT CGCCGTGTTG GTCGAGAAGC TGCGCGCCGC CGGCAGCATC GGCGAAACGG TGCAGCCGGC GGTGCTGTCG AGCGAGATCT CCAACGCCAT CGCGCTGCCG GGCGGCCGTG TCTATCTGTT CAGCGCGCTG CTCGACAAGG CGGACAATCC CGACGAGATC GCCGGCGTGC TCGCGCATGA ATTCGGCCAT GTCGCGCGCC GCGACAACAT GCGGCATCTG ATCCGCGAAG GCGGCAGTTC GTTCCTGATC GGTCTGTTGT TCGGCGACGT CACCGGGTCG GGCGCGCTGA TCTTCGCCTC GCGCACGCTG CTCAATTCGT CCTACTCGCG CGAAGCCGAA CACGACGCCG ACAGCTTCGC CATCGGCGTG ATGCACGGCC TCGGCCGGCC GGTGAAGCCG ATGGGCGAGC TGCTGTTCCG GGTCACCGGC AAGCAGCGCG ATTCGAGCAT CAGCATTCTG GCCAGTCATC CACTGACCGA GGACCGGCTG GCGCGGATGA GCGCCGAAGC TGCGATGTCG CCCGGCGCGC CGCTGCTCTC TGCCGAGCAG TGGCAGGCCC TGAAGGCGAT CTGCAAGTAG
|
Protein sequence | MMSDVSAPPP AVFFDGASSR RRPVTLAFSD RLEILQDGRT LAAWPFADIR RADGAPGLLR LGCVSAPALA RLEVPDPAIA QQLAARCSYL DADVPQRHGV RAIVGWSLAA IVSLVLVSVY GMPLIADRLA PLLPQAFERR VGDVADRQIR TLFGDKVCDR PAGQAAFAVL VEKLRAAGSI GETVQPAVLS SEISNAIALP GGRVYLFSAL LDKADNPDEI AGVLAHEFGH VARRDNMRHL IREGGSSFLI GLLFGDVTGS GALIFASRTL LNSSYSREAE HDADSFAIGV MHGLGRPVKP MGELLFRVTG KQRDSSISIL ASHPLTEDRL ARMSAEAAMS PGAPLLSAEQ WQALKAICK
|
| |