Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0355 |
Symbol | |
ID | 3719036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 2084795 |
End bp | 2086276 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640071565 |
Product | serine protease |
Protein accession | YP_353430 |
Protein GI | 77463926 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.459558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGTCTC ACGCCATTAC CATCGCCCGC CGCATCCACC CGGTGCCCGC GTCGGTCTCG CGTCTCTTCC TCGCGCTGAT GCTCGGGCTC GCCCTGGCGC TGGCGCAGGC CGTGGCGGTC AAGGCGCAGA ACGCTCCCGC AAGTTTCGCA GGCCTCGCCG AGAAGATCAG CCCGGCCGTC GTGAACATCA CGACCTCGAC CGTCGTGGCG GCACCCACGC AGAATTCGCC CCTCGTGCCC GAAGGCTCGC CCTTCGAGGA TTTCTTCCGT GACTTCATGG ACCCGCAGAA CCGCGGCGAG GGCCCGCGCC GCTCCGAGGC GCTGGGTTCG GGCTTCGTGA TCTCGGAAGA CGGCTACATC GTCACCAACA ATCATGTCAT CGAAGGGGCC GACGACATCC AGATCGAGTT CTTCTCGGGC AAGAAGCTCG AGGCGAAGCT CGTCGGCACC GATCCGAAGA CCGACATCGC GCTGCTGAAG GTCGATGGGA ACCAGCCGCT GCCCTTCGTG AGCTTCGGCA ACTCCGACCT CGCCCGCGTT GGCGACTGGG TCGTGGCGAT GGGCAACCCG CTGGGGCAGG GCTTCTCGGT CTCGGCCGGC ATCGTGTCGG CGCGCAACCG GGCCCTCTCC GGCACCTACG ACGATTACAT CCAGACCGAC GCCGCCATCA ACCGCGGCAA TTCGGGCGGT CCGCTGTTCA ACATGGACGG GCAGGTGATC GGCGTGAACA CGGCGATCCT GTCGCCGAAC GGCGGCTCGA TCGGCATCGG CTTCTCGATG GCCTCGAACG TGGTGGTGAA GGTCGTGCAG CAGCTGCGCG AGTTCGGCGA GACCCGCCGC GGCTGGCTCG GCGTGCGGAT CCAGGACGTG ACCCCCGACG TGGCCGAGGC GATGGGCCTC ACCGAGGCCA AAGGCGCCCT CGTGACCGAC GTGCCGGAAG GCCCCGCGAA AGAGGCCGGC ATGCAGTCTG GCGACGTGAT CGTGACCTTC GATAGCGCGC CCGTGGCGGA CACCCGCGAT CTGGTGCGCC GGGTGGCCGA TGCGCCCATT GGCGAGGCGG TGCGTGTCAT CGTGATGCGC GAAGGCAAGA CCCGGACCCT GTCGGTGACG CTCGGGCGTC GCGAGGAAGC CGAGAACGAA GGCCCCGAGG CACCCGGCGC GACCGAGCCG ACGGAACCGT CGACGGCCGA TCTTCTGGGC CTGACCGTGG CGCCGCTCAC GGCCGAGCAG GCCGGAGAGC TGGGCCTGCC CGGCGGCACC GAGGGGCTTG CCGTGACGGA TGTCGATCCG GCCTCCGAGG CCTATTCCAA GGGCTTGCGC GAGGGAGACG TGATCACCGA GGCCGGCCAG CAGAAAGTGG TCTCGATCAA GGATCTGCAG GACCGTGTGA CCGAGGCGCG GGAGGCGGGG CGGAAATCGC TGCTCCTGCT GATCCGCCGC GGCGGCGATC CGCGTTTCGT GGCCCTGACG GTCAGCGAGT AG
|
Protein sequence | MQSHAITIAR RIHPVPASVS RLFLALMLGL ALALAQAVAV KAQNAPASFA GLAEKISPAV VNITTSTVVA APTQNSPLVP EGSPFEDFFR DFMDPQNRGE GPRRSEALGS GFVISEDGYI VTNNHVIEGA DDIQIEFFSG KKLEAKLVGT DPKTDIALLK VDGNQPLPFV SFGNSDLARV GDWVVAMGNP LGQGFSVSAG IVSARNRALS GTYDDYIQTD AAINRGNSGG PLFNMDGQVI GVNTAILSPN GGSIGIGFSM ASNVVVKVVQ QLREFGETRR GWLGVRIQDV TPDVAEAMGL TEAKGALVTD VPEGPAKEAG MQSGDVIVTF DSAPVADTRD LVRRVADAPI GEAVRVIVMR EGKTRTLSVT LGRREEAENE GPEAPGATEP TEPSTADLLG LTVAPLTAEQ AGELGLPGGT EGLAVTDVDP ASEAYSKGLR EGDVITEAGQ QKVVSIKDLQ DRVTEAREAG RKSLLLLIRR GGDPRFVALT VSE
|
| |