Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3259 |
Symbol | |
ID | 3971771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3608218 |
End bp | 3609711 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637926370 |
Product | peptidase S1C, Do |
Protein accession | YP_533120 |
Protein GI | 90424750 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.324344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.259222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGTG CGATCTCAGC CCTTAGCCGC CGCCTGCGCC CGATCGTCGT GGCCGTTGGC CTTGCCTCTG CCGCCGCGTT CAGCGCCGCC CCGGCGCAGG CCCGCGGTCC GGACGGCATC GCCGACGTCG CCGAAAAGGT GATCGACGCG GTGGTCAATA TCTCGACCAC GCAGACCATC GAAGCCAAGG CCGGAGCCGG CGAGGGCAAG GGGGCGGCGC CGCAATTGCC GCCGGGATCG CCGTTCGAGG AGTTCTTCGA CGACTTCTTC AAGAACCGCC GCGGCGGCGA GAAGGGCAGC GGGCCGCGCA AGACCAATTC GCTGGGCTCC GGCTTCATCG TCGACACCGC CGGCATCGCC GTGACCAACA ATCACGTCAT TGCGGACGCC GACGAGATCA ACATCATCAT GAACGACGGC ACCAAGATCA AGGCGGAGCT GGTCGGCGTC GACAAGAAGA CCGATCTCGC CGTCTTGAAG TTCAAGCCGC CGGCCAAGCC GCTGGTGGCG GTGAAGTTCG GCGACAGCGA CAAGTTGCGG CTTGGCGAAT GGGTGATCGC GATCGGCAAC CCGTTCTCGC TCGGCGGCAC GGTGACCGCG GGCATCGTCT CGGCGCGCAA CCGCGACATC AATTCCGGGC CCTATGACAG CTACATCCAG ACCGACGCCG CGATTAATCG CGGCAATTCC GGCGGCCCGC TGTTCAACCT CGACGGCGAA GTGATCGGCG TCAACACGCT GATCATCTCG CCGTCCGGCG GCTCGATCGG CATCGGCTTC GCGGTGCCGT CGAAGACCGT GATCGGCGTG GTGGATTCGC TGCGGCAGTT CGGCGAATTG CGCCGCGGCT GGCTCGGCGT GCGGATCCAG CAGGTCACCG ACGAGATCGC CGAGAGCCTC AACATCAAGC CCGCGCGGGG AGCATTGATT GCCGGCGTTG AAGACAAGGG ACCGGCCAAG CCCGCCGGCA TCGAGCCCGG CGACGTCGTC ATCAGGTTCG ACGGCAAGGA CATCAAGGAG CCGAAGGATC TGTCGCGCGT GGTGGCCGAC ACCGCGGTCG GCAAGGCGGT CGACGTCGTC ATTATCCGCA AGGGCAAGGA AGAGACCAAG CAGGTCACGC TCGGCCGGCT CGACGATGGC GAGAAGCCGG TGCAGGCTTC GGTGAAGAGC CAGCCCGAAG CGGAAAAGCC GGTGACCCAG AAGGCGCTCG GCCTCGACCT CGCCTCGCTC AGCAAAGAGC AGCGCGCCAA GTTCAAGATC AAGGACAGCG TCAAGGGCGT GCTGATCACC AGCGTCGACA ACGGTTCGGA TGCGGCGGAG AAGCGTTTGA GCGCCGGCGA CGTCATCGTC GAAGTGGCGC AGGAAACCGT CGGCAACGCC AGCGACGTCA AGAAGCGGAT CGAGGCGATC AAGAAGGACG GCAAGAAATC GGTGCTGCTG TTGGTCTCCA ACGGCGACGG CGAGTTGCGC TTCGTGGCGC TTGGCGTGCA GTAA
|
Protein sequence | MTGAISALSR RLRPIVVAVG LASAAAFSAA PAQARGPDGI ADVAEKVIDA VVNISTTQTI EAKAGAGEGK GAAPQLPPGS PFEEFFDDFF KNRRGGEKGS GPRKTNSLGS GFIVDTAGIA VTNNHVIADA DEINIIMNDG TKIKAELVGV DKKTDLAVLK FKPPAKPLVA VKFGDSDKLR LGEWVIAIGN PFSLGGTVTA GIVSARNRDI NSGPYDSYIQ TDAAINRGNS GGPLFNLDGE VIGVNTLIIS PSGGSIGIGF AVPSKTVIGV VDSLRQFGEL RRGWLGVRIQ QVTDEIAESL NIKPARGALI AGVEDKGPAK PAGIEPGDVV IRFDGKDIKE PKDLSRVVAD TAVGKAVDVV IIRKGKEETK QVTLGRLDDG EKPVQASVKS QPEAEKPVTQ KALGLDLASL SKEQRAKFKI KDSVKGVLIT SVDNGSDAAE KRLSAGDVIV EVAQETVGNA SDVKKRIEAI KKDGKKSVLL LVSNGDGELR FVALGVQ
|
| |