Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1841 |
Symbol | |
ID | 3971721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1996874 |
End bp | 1998460 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637924954 |
Product | peptidase S1C, Do |
Protein accession | YP_531719 |
Protein GI | 90423349 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATC GCCCCACCGA ATTGTCCTCG CTGCCGTCCT ATCGGCCGTT GCGCCGCTCG CTGTTCTCGG CGCGCAAGTT TGCGCTGATG GCGTCGGTGG TCGCAGGCCT CGGCGCCGGC CTCTATGGCC TCAGCCCCGC GCCAGACCAT TTCAATGTGC TCAGCACCGC CGCCCACGCC CAGGTCAACA ACGAGGTGCG CAAGGTGGCG CAGCCGGTCG GCTTCGCCGA CATTGTCGAG CGGGTGAAGC CCTCGGTGAT TTCGGTCAAG GTCAACATCA ACGAGAAGGT CGCCAAGAAC GACGACAGCG CCGAGGATTC GCCGTTCCAG CCAGGCTCGC CGATGGAGCG CTTCTTCCGT CGCTTCGGCG GCCCGGATGG CATGCCCCCT GGATTGCGCG GCGGTCGTGG CGGCCGCGGC GCGGTGACCG GGCAAGGCTC CGGCTTCTTC ATCTCGGATG ACGGCTATGC GGTCACCAAC AACCACGTGG TCGATGGCGC CGATAAGGTC GAAGTCACCA CCGACGACGG CCGTACCTTC AAGGCCAAGG TGATCGGCAC CGATCCGCGC ACCGATCTGG CGCTGATCAA GGTCGAAGGC GGCAACAATT TCCCGTTCGC CAAATTGGCC GAAGGCAAGC CGCGGATTGG CGACTGGGTG CTCGCCGTCG GTAACCCGTT CGGGCTCGGC GGCACCGTGA CCGCCGGCAT CGTCTCGGCC TCTGGCCGCG ACATCGGCAA CGGCCCCTAT GACGATTTCA TCCAGATCGA TGCCCCGGTG AACAAGGGCA ACTCGGGTGG ACCGGCGTTC GACACCTCCG GCGAGGTGAT GGGCGTCAAC ACCGCGATCT ATTCGCCGTC CGGCGGTAGC GTCGGCATCG CGTTCTCGAT CCCCGCTTCG ACCGTGAAGA CGGTGGTCGC CCAGCTCAAG GACAAGGGCT CGGTCAGCCG CGGCTGGATC GGCGTGCAGA TTCAACCCGT GACCCAGGAG ATCGCCGACA GCCTCGGCTT GAAGAAGGCC GACGGCGCGC TGGTCGCCGA GCCGCAGGCT GACGGTCCGG CCGCCAAAGC CGGCATCCAA TCCGGCGACG TCATCACCGC GGTGAACGAC ACCCCGGTCA AGGATGCCCG TGAACTCGCC CGCACCATCG GCGGCTTTGC GCCGGGCAAC TCGGTGAAGC TCAACGTCAT CCACAAGGGC CAGGACAAGG TGGTCAACCT CACTCTCGGG CAGTTGCCGA ACACGATCGA AGCCAAGGCC GATGTCGACC GCGGCGATCA CGGCGACGCC AAGCGGGGCT CCGACATTCC GCGGCTCGGC CTGACGCTGG CGCCCGCCGG CACCGTAGCC GGCGCCGGCA AGGATGGCGT TGTCGTCACC GAGGTCGACC CGAAGAGTGC GGCCGCCGAG CGCGGCTTCA AGGAAGGCGA CGTCATTCTG GAAGTGGCCG GCAAGAGCGT CGCCAGCCCG ACCGAGGTGC GCGAAGCCCT GGCGTCGGCG AAGACCGAGA ACAAGAACAG CGTGCTGATC AGGGTTCGCA GCGGTGGCTC GTCGCGTTTC GTGGCGGTGC CGTTGGCAAA GGGCTGA
|
Protein sequence | MTDRPTELSS LPSYRPLRRS LFSARKFALM ASVVAGLGAG LYGLSPAPDH FNVLSTAAHA QVNNEVRKVA QPVGFADIVE RVKPSVISVK VNINEKVAKN DDSAEDSPFQ PGSPMERFFR RFGGPDGMPP GLRGGRGGRG AVTGQGSGFF ISDDGYAVTN NHVVDGADKV EVTTDDGRTF KAKVIGTDPR TDLALIKVEG GNNFPFAKLA EGKPRIGDWV LAVGNPFGLG GTVTAGIVSA SGRDIGNGPY DDFIQIDAPV NKGNSGGPAF DTSGEVMGVN TAIYSPSGGS VGIAFSIPAS TVKTVVAQLK DKGSVSRGWI GVQIQPVTQE IADSLGLKKA DGALVAEPQA DGPAAKAGIQ SGDVITAVND TPVKDARELA RTIGGFAPGN SVKLNVIHKG QDKVVNLTLG QLPNTIEAKA DVDRGDHGDA KRGSDIPRLG LTLAPAGTVA GAGKDGVVVT EVDPKSAAAE RGFKEGDVIL EVAGKSVASP TEVREALASA KTENKNSVLI RVRSGGSSRF VAVPLAKG
|
| |