Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0887 |
Symbol | |
ID | 5083613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 904019 |
End bp | 905500 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640482444 |
Product | protease Do |
Protein accession | YP_001167095 |
Protein GI | 146276936 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0626906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.172287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGTCTC ACGCCATTTC CATCGCCCGC CGGAAGGAAC CGGTGCCCAT CGCCGCATGG CGCCTTTTCC TCGCGCTGAT GCTGGGCCTG GGGCTGGCGC TTGCGCAGGC GGTGTCGGCC CATGCGCAGG GGGCCCCGGC CAGCTTCGCC GGTCTCGCCG AAAAGATCAG CCCGGCGGTG GTGAACATCA CCACCTCGAC CGTTGTCGCG GCACCCACGC AGAGTTCTCC GCTCGTGCCC GAAGGCTCGC CCTTCGAGGA TTTCTTCCGC GACTTCATGG ACCCGCAGAA CCGCGAAGGT GGACCGCGCC GCTCCGAGGC GCTGGGCTCG GGCTTCGTGA TCTCGGAAGA CGGCTTCATC GTGACCAACA ACCATGTCAT CGAAGGGGCG GACGACATCC AGATCGAGTT CTTCTCGGGC AACAAGCTCG AGGCGAAGCT CGTGGGCACC GATCCCAAGA CCGACATCGC CCTGCTCAAG GTCTCGAGCA ACCAGCCGCT CCCGTTCGTG AGCTTCGGCA ACTCGGATCT CGCGCGGGTG GGCGACTGGG TGGTGGCGAT GGGCAACCCT CTGGGGCAGG GCTTTTCGGT CTCGGCCGGG ATCATCTCGG CGCGCAACCG GGCGCTCTCG GGCACCTACG ATGACTACAT CCAGACCGAC GCCGCCATCA ACCGCGGCAA CTCGGGCGGG CCGCTGTTCA ATCTCGACGG TCAGGTGATC GGCGTGAACA CGGCGATCCT CTCGCCCAAC GGCGGCTCGA TCGGGATCGG CTTCTCGATG GCCTCGAACG TGGTGGTGAA GGTCGTCCAG CAACTTCGCG AGTTCGGCGA GACGCGGCGC GGCTGGCTCG GCGTGCGGAT CCAGGACGTG ACCCCGGACG TGGCCGAGGC GATGGGGCTG GCCGAGGCGA AGGGGGCGCT GGTGACGGAC GTGCCCGACG GGCCTGCGAA AGAGGCCGGA ATGCAGTCGG GCGACGTGAT CGTGACCTTT GACAAGGCGC CGGTGGCCGA CACCCGCGAT CTCGTGCGCC GCGTGGCGGA CGCCCCGATC GGTGAGGCCG TGCGCGTGGT CGTGATGCGT GAAGGCAAGA CCCGCACGCT CTCCGTGGTG CTCGGGCGCC GGGAGGAAGC CGAGGGTGAG GGCCCGGCGG CGTCCGTCGA GTCTGCCCCG ACGGAACCTT CGACCGCCAA CCTTCTGGGC CTGACAGTGG CTCCGCTGAC GGCCGAGCAG GCCGCCGAGC TGGGTCTGCC GCCCGGCACC GAGGGGCTCG CGGTGACGGA TGTGGACACG GCCTCCGAGG CCTATTCCAA GGGGCTGCGC GAGGGGGATG TCATCACCGA GGCGGGTCAG CAGAAGGTCA TGACGATCAA GGATCTCCAG GACCGCGTGG ACGAGGCCCG CGAGGCCGGG CGCAAGTCGC TCCTGCTTCT AATCCGACGG GGCGGTGACC CACGCTTCGT GGCTCTCACG ATCACCGAAT GA
|
Protein sequence | MQSHAISIAR RKEPVPIAAW RLFLALMLGL GLALAQAVSA HAQGAPASFA GLAEKISPAV VNITTSTVVA APTQSSPLVP EGSPFEDFFR DFMDPQNREG GPRRSEALGS GFVISEDGFI VTNNHVIEGA DDIQIEFFSG NKLEAKLVGT DPKTDIALLK VSSNQPLPFV SFGNSDLARV GDWVVAMGNP LGQGFSVSAG IISARNRALS GTYDDYIQTD AAINRGNSGG PLFNLDGQVI GVNTAILSPN GGSIGIGFSM ASNVVVKVVQ QLREFGETRR GWLGVRIQDV TPDVAEAMGL AEAKGALVTD VPDGPAKEAG MQSGDVIVTF DKAPVADTRD LVRRVADAPI GEAVRVVVMR EGKTRTLSVV LGRREEAEGE GPAASVESAP TEPSTANLLG LTVAPLTAEQ AAELGLPPGT EGLAVTDVDT ASEAYSKGLR EGDVITEAGQ QKVMTIKDLQ DRVDEAREAG RKSLLLLIRR GGDPRFVALT ITE
|
| |