Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3349 |
Symbol | |
ID | 4023860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3713026 |
End bp | 3714525 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637963554 |
Product | peptidase S1C, Do |
Protein accession | YP_570474 |
Protein GI | 91977815 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.181903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCTG CGATCCCTGC CTTCAGCTTC CGGAACAGGC CCCTGCTGAC GGCGATCTGC CTCGGGGCAG CCGTTGCCTT GAGCGCGCCG GCGCCGTCCC ACGCCCGCGG TCCCGAGGGC ATCGCGGACG TCGCCGAAAA GGTGATTGAC GCCGTGGTCA ACATCTCGAC CACCCAGACC GTTGAAGCCA AGAATTCACC CGGCGAAGGG AAGGGCGCCA CGCCGAATCT GCCGCCGGGT TCGCCGTTCG AGGAGTTCTT CGACGACTTC TTCAAGAACC GCCGTGGCGG CGAGAAGGGC GGCGGGCCGC GCAAGACCAA TTCACTCGGC TCCGGCTTCA TCGTCGACAC CGCCGGCGTC GCCGTGACCA ACAATCACGT CATTGCCGAC GCCGATGAGA TCAATCTCAT CATGAACGAC GGCACTAAGA TCAAAGCCGA ACTGGTCGGC GTCGACAAGA AGACCGACCT CGCGGTGCTG AAGTTCAAGC CGCCGGCGAA CAAGCCGCTC ACCGCCGTGA AGTTCGGCGA CTCCGACAAG CTGCGGCTCG GCGAGTGGGT GGTGGCGATC GGCAACCCGT TCTCGCTCGG CGGCACAGTG ACTGCCGGCA TCGTCTCGGC GCGCAACCGC GACATCAATT CCGGGCCTTA TGACAGCTAC ATCCAGACCG ATGCTGCGAT CAATCGCGGC AATTCCGGAG GCCCGCTGTT CAACCTCAAC GCCGAAGTCA TCGGCGTCAA CACGCTGATC ATCTCGCCGT CCGGCGGCTC GATCGGCATC GGTTTCGCGG TGCCGTCGAA GACTGTGGTC GGTGTGGTCG ATCAACTCCG GCAGTTCGGC GAGCTGCGCC GCGGTTGGCT CGGCGTGCGG ATCCAGCAGG TCACCGACGA AATCGCCGAG AGCCTGAATA TCAAGCCGGC GCGCGGCGCG TTGGTCGCTG GCATCGACGA CAAGGGGCCG GCGAAACCGG CCGGCATCGA GCCCGGCGAC GTGGTCGTCA AGTTCGACGG CAAGGACGTC AAGGAGCCGA AGGATCTGTC TCGCGTGGTC GCCGACACAG CGGTCGGCAA GACCGTCGAC GTGGTGATCA TCCGCAAGGG CAAGGAAGAG ACCAAGCAGG TCACGCTCGG CCGTCTCGAC GACGGCGCCA AGCCGCAGCC GGCCTCGGCG AAGTCACAGC CTGAACCGGA AAAGCCGGTG ACGCAGAAGG CGCTCGGGCT CGACCTCGCG GCGCTGTCGA AGGATCTGCG CGGCCGCTAC AAGATCAAGG AAACCGTCAA GGGCGTGGTG GTGGTCGGCG TCGACAATGG CTCCGACGCC GCCGAGAAGC GGCTGTCGGC CGGCGACGTG ATCGTCGAGG TCGCGCAGGA AGCGGTGACC AGCGCCGCCG ACATCAAGAA GCGCGTCGAC CAGCTTAAGA AGGACGGCAA GAAGTCGGTC CTGCTGCTGG TCGCCAATGG CGAGGGCGAG CTGCGCTTCG TGGCGCTCAG CCTGCAATAG
|
Protein sequence | MPAAIPAFSF RNRPLLTAIC LGAAVALSAP APSHARGPEG IADVAEKVID AVVNISTTQT VEAKNSPGEG KGATPNLPPG SPFEEFFDDF FKNRRGGEKG GGPRKTNSLG SGFIVDTAGV AVTNNHVIAD ADEINLIMND GTKIKAELVG VDKKTDLAVL KFKPPANKPL TAVKFGDSDK LRLGEWVVAI GNPFSLGGTV TAGIVSARNR DINSGPYDSY IQTDAAINRG NSGGPLFNLN AEVIGVNTLI ISPSGGSIGI GFAVPSKTVV GVVDQLRQFG ELRRGWLGVR IQQVTDEIAE SLNIKPARGA LVAGIDDKGP AKPAGIEPGD VVVKFDGKDV KEPKDLSRVV ADTAVGKTVD VVIIRKGKEE TKQVTLGRLD DGAKPQPASA KSQPEPEKPV TQKALGLDLA ALSKDLRGRY KIKETVKGVV VVGVDNGSDA AEKRLSAGDV IVEVAQEAVT SAADIKKRVD QLKKDGKKSV LLLVANGEGE LRFVALSLQ
|
| |