Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1999 |
Symbol | |
ID | 4896200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2117282 |
End bp | 2118802 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640112593 |
Product | protease Do |
Protein accession | YP_001043875 |
Protein GI | 126462761 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0697759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAGTTGG CCGCTCAGCT ATCTCGAAAG GAGTCGGGTG TGCAGTCTCA CGCCATTACC ATCGCCCGCC GCATCCACCC GGTGCCCGCG TCGGTCTCGC GTCTCTTCCT CGCGCTGATG CTCGGGCTCG CCCTGGCGCT GGCGCAGGCC GTGGCTGTCA AGGCGCAGAA CGCTCCCGCA AGTTTCGCAG GCCTCGCCGA GAAGATCAGC CCGGCCGTCG TGAACATTAC GACCTCGACC GTCGTGGCGG CACCCACGCA GAATTCGCCC CTCGTGCCCG AAGGCTCGCC CTTCGAGGAT TTCTTCCGTG ACTTCATGGA CCCGCAGAAC CGCGGCGAGG GCCCGCGCCG CTCCGAGGCG CTGGGCTCCG GCTTCGTGAT CTCGGAAGAC GGCTACATCG TCACCAACAA CCATGTCATC GAAGGGGCCG ACGACATTCA GATCGAGTTC TTCTCGGGCA AGAAGCTCGA GGCGAAGCTC GTCGGCACCG ATCCGAAGAC CGACATCGCG CTGCTGAAGG TCGATGGGAA CCAGCCGCTG CCCTTCGTGA GCTTCGGCAA CTCGGACCTC GCCCGCGTCG GCGACTGGGT CGTGGCGATG GGCAACCCGC TGGGGCAGGG CTTCTCGGTC TCGGCCGGCA TCGTGTCGGC GCGCAACCGG GCCCTCTCCG GCACCTACGA CGATTACATC CAGACCGACG CCGCCATCAA CCGCGGCAAT TCGGGCGGCC CGCTGTTCAA CATGGACGGG CAGGTGATCG GCGTAAACAC GGCGATCCTG TCGCCGAACG GCGGCTCGAT CGGCATCGGC TTCTCGATGG CCTCGAACGT GGTGGTGAAG GTCGTCCAGC AGCTGCGCGA GTTCGGCGAG ACCCGCCGCG GCTGGCTCGG CGTGCGGATC CAGGACGTGA CCCCCGACGT GGCCGAGGCG ATGGGCCTCA CCGAGGCCAA AGGCGCCCTC GTGACCGACG TGCCGGAAGG CCCTGCGAAA GAGGCCGGCA TGCAGTCGGG CGACGTGATC GTGACCTTCG ATAGCGCGCC CGTGGCGGAC ACCCGCGATC TGGTGCGCCG GGTGGCCGAT GCGCCCATTG GCGAAGCGGT GCGGGTCATC GTGATGCGCG AAGGCAAGAC CCGGACCCTG TCGGTGACGC TGGGGCGTCG CGAGGAAGCC GAGAACGAAG GCCCCGAGGC ACCCGGCGCG GCCGAGCCGA CCGAGCCGTC GACGGCCGAT CTTCTGGGCC TGACCGTGGC GCCGCTCACG GCCGAGCAGG CCGGAGAGCT GGGCCTGCCC GGCGGCACCG AGGGGCTTGC GGTGACGGAC GTCGATCCGG CCTCAGAGGC CTATTCCAAG GGCTTGCGCG AGGGGGACGT GATCACCGAA GCCGGCCAGC AGAAAGTGGT CTCGATCAAG GATCTGCAGG ACCGTGTGAC CGAGGCGCGG GAGGCGGGGC GGAAATCGCT GCTCCTGCTG ATCCGCCGCG GCGGCGATCC GCGTTTCGTG GCCCTGACGG TCAGCGAGTA G
|
Protein sequence | MQLAAQLSRK ESGVQSHAIT IARRIHPVPA SVSRLFLALM LGLALALAQA VAVKAQNAPA SFAGLAEKIS PAVVNITTST VVAAPTQNSP LVPEGSPFED FFRDFMDPQN RGEGPRRSEA LGSGFVISED GYIVTNNHVI EGADDIQIEF FSGKKLEAKL VGTDPKTDIA LLKVDGNQPL PFVSFGNSDL ARVGDWVVAM GNPLGQGFSV SAGIVSARNR ALSGTYDDYI QTDAAINRGN SGGPLFNMDG QVIGVNTAIL SPNGGSIGIG FSMASNVVVK VVQQLREFGE TRRGWLGVRI QDVTPDVAEA MGLTEAKGAL VTDVPEGPAK EAGMQSGDVI VTFDSAPVAD TRDLVRRVAD APIGEAVRVI VMREGKTRTL SVTLGRREEA ENEGPEAPGA AEPTEPSTAD LLGLTVAPLT AEQAGELGLP GGTEGLAVTD VDPASEAYSK GLREGDVITE AGQQKVVSIK DLQDRVTEAR EAGRKSLLLL IRRGGDPRFV ALTVSE
|
| |