Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3156 |
Symbol | |
ID | 4023661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3508837 |
End bp | 3510231 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963357 |
Product | peptidase S1C, Do |
Protein accession | YP_570283 |
Protein GI | 91977624 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.967436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.845897 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCGA TCCGAACGCT GACCGCACTT TGTTTGTCGA TCGCGCTGGC GACGCCGGTC GCGGCGCAGG AGCGGCGGCT GCCGTCGTCG CTGGGCGAGG TCAAGCTGAG CTACGCGCCA ATCGTCCAGC ACGCCCAGCC GGCCGTGGTG AACGTCTACG CCGCCAAGGT GGTGCAGAAC CGCAATCCGC TGCTCGAAGA TCCGATCTTT CGCCGCTTCT TCGGCGGCGG CGGCCCGCAG CCGGAGCAGA TCCAGCGCTC GCTCGGCTCT GGCGTGATGG TCGATCCGTC GGGTCTGGTC GTCACCAACA ATCACGTCAT CGACGGCGCC GATCAGGTCA AGGTGGCGCT CGCCGACAAG CGCGAGTTCG AAGCCGAGAT CGTGCTGAAG GACAGCCGCA CCGATCTGGC GGTGCTGCGG CTCAAGGATA CCAAGGAAAA ATTCGCGACG CTCGAACTTT CGAATTCCGA TGATCTGCTG GTCGGCGACC TCGTCCTCGC AATGGGCAAT CCGTTCGGCG TCGGTCAGAC CGTGACGCAC GGCATCGTCT CGGCGCTGGC GCGCACCCAG GTCGGCATCA CCGACTATCA GTTCTTCGTT CAGACCGACG CCGCGATCAA TCCCGGCAAT TCCGGCGGCG CGCTGGTCGA TATGACCGGC AAGCTGGTCG GCATCAATAC CGCGATCTTC TCGCGCTCCG GCGGCTCGCA GGGCATCGGC TTTGCGATCC CGGCCAACAT GGTGCGCGTG GTGATCGCCT CGGCCAAGGG CGGCGGCAAG GCGGTGAAAC GGCCATGGCT CGGCGCGCGG CTGCAGGCGG TGACGCCGGA GATCGCCGAG ACGCTCGGCC TGAAGCGGCC GAGTGGCGCG CTGGTGGCGA GCGTCACCAA GGGAAGTCCC TCGGAGAAGG CCGGTTTGAA ACTGTCCGAT CTGATCGTCG CGGTCGACGG CTTCCCGATC GATGATCCGA ATGCGTTCGA CTATCGCTTC GCGACACGGC CGCTCGGCGG CACCGCACAG ATCGACGCGC AGCGCGCCGG CAAGCCGGTG AAGCTCACCA TCGCGCTCGA GACCGCGCCG GACACCGGCC GCGACGAGAT CGTGCTGACC GCGCGCTCGC CGTTCCAGGG CGCCAAGATC GCCAACATCT CGCCGGCGAT CGCCGACGAG ATGCGGCTCG ACCCGAGCGT CGAAGGCGTG GTCGTCACGG AACTCGCCGA CGACGCCACC GCCGCGAATG TCGGTTTCCA GAAGGGCGAC ATTATCGTCG CGGTCAACAA CAAGCGGATC GGCAAAACCA GCGACCTCGA GCGGATCACC AACGAATCCG CGCGACTGTG GCGCATCACG CTGGTCCGCG GCGGCCAGCA GATCAACGTC ACGCTCGGCG GATGA
|
Protein sequence | MISIRTLTAL CLSIALATPV AAQERRLPSS LGEVKLSYAP IVQHAQPAVV NVYAAKVVQN RNPLLEDPIF RRFFGGGGPQ PEQIQRSLGS GVMVDPSGLV VTNNHVIDGA DQVKVALADK REFEAEIVLK DSRTDLAVLR LKDTKEKFAT LELSNSDDLL VGDLVLAMGN PFGVGQTVTH GIVSALARTQ VGITDYQFFV QTDAAINPGN SGGALVDMTG KLVGINTAIF SRSGGSQGIG FAIPANMVRV VIASAKGGGK AVKRPWLGAR LQAVTPEIAE TLGLKRPSGA LVASVTKGSP SEKAGLKLSD LIVAVDGFPI DDPNAFDYRF ATRPLGGTAQ IDAQRAGKPV KLTIALETAP DTGRDEIVLT ARSPFQGAKI ANISPAIADE MRLDPSVEGV VVTELADDAT AANVGFQKGD IIVAVNNKRI GKTSDLERIT NESARLWRIT LVRGGQQINV TLGG
|
| |