Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2507 |
Symbol | |
ID | 5084858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 2556689 |
End bp | 2558074 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640484069 |
Product | protease Do |
Protein accession | YP_001168700 |
Protein GI | 146278541 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0254484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAGG CCTTGCTTGC CCTCCTTTTG ACCGCCGCCC TGTTTCCCCT GCCGATTCTG GCCGAGACAC GGATCCCGCA AAGCGCGGCC GAGATCTCGC TGACCTTTGC CCCGGTTGTC CGGTCGGCGG CCCCGGCGGT GGTGAACATC TACGCCACGC GCGTTGTGGC CCAGCGGGTG AGCCCCTTTG CGGCCGACCC GTTCTTCGAC CAGTTGTTCC GTGACTTCGG ACGCCGCCAG CCGCGGGTTC AGAACTCGCT CGGCTCGGGC GTGATTGTAG CCGCGGACGG CATCGTGGTG TCGAACTACC ATGTCGTCGG GCAGGCCGAT GCGATCCGTG TCGTGCTGAA CGACCGCCGC GAGTATGAGG CCGAGGTGAT GCTGTCCGAT CAGGACAGCG ATCTTGCCGT GCTGAAGCTG AGGGATGCGC ACGATCTGCC CTATCTGTCG CTGCGCGATT CGGACGGGAT CGAGGTGGGC GAACTGGTGC TCGCGATCGG CAACCCGTTC GGCGTGGGCC AGACGGTGTC GCAGGGAATC GTCTCGGGGC TTGCGCGGTC GGGCCTTTCG ATCGACGGCG GGCGGGGCTA CTTCATCCAG ACCGACGCGG CCATCAATCC CGGCAACTCG GGCGGGGCGC TGGTCGATAC GGCCGGAAGG CTCGTAGGCA TCAACACCGC GATCCTCACC CAGTCGGGCG GGTCGAACGG GATCGGCTTC GCCATCCCCG CCAACCTGGT GCGCAGCTTC CTGGCGCAGG CCGCGGCGGG CGAGAGCCGC TTTCAGCGCC CGTGGGCGGG GGTGAACGGG CAGGTGGTCG ATGCGGCGAT GGCCGAGGCG ATGGGGCTCG AGCGGCCCGA GGGAGTCGTC CTGACCGAAC TCGATCCCGA AAGCCCGTTC CGCGCGGCGG GCCTGCGCTC GGGTGATGTG GTGGTGGCGC TTGGCGGCCA GCGGACGGAC AGTCCGCAGG AGGTGATGTT CCGCCTGTCC TCCATGGGGA TCGGCTCGCG CACCACGGTC ACCTATCTGC GCGACGGCCA GACGCGCGAG GCCGAGGTTG CGCTGATGGC GCCGCCCGAC AATCCCCCGC GTGAGACGCT GACGCTGCGC GAGACGGCGC TTGCGGGCCT CACGGTCGAG CGGCTGAACC CGGCCGTCCG GGCCGAGATG AACCTGCCGC TGACGCTGGA AGGTGTCGTG GTGCGGGGCG CCGAGGCGCA CGCGGCGCGG GCCGGGCTCC GGCCGGGCGA TCTCCTTCTG GAGATCAACG GCCGCCGCAT CGAGCGGTCG CGCGATGTGC TGGTGGCGGC GCGGGCACAG TCGCGCTGGT GGCAGATCGA CGTGCTGCGC GACGGCCAGT CGCTGCGTCT CCGCTTCCGT CTGTAG
|
Protein sequence | MRQALLALLL TAALFPLPIL AETRIPQSAA EISLTFAPVV RSAAPAVVNI YATRVVAQRV SPFAADPFFD QLFRDFGRRQ PRVQNSLGSG VIVAADGIVV SNYHVVGQAD AIRVVLNDRR EYEAEVMLSD QDSDLAVLKL RDAHDLPYLS LRDSDGIEVG ELVLAIGNPF GVGQTVSQGI VSGLARSGLS IDGGRGYFIQ TDAAINPGNS GGALVDTAGR LVGINTAILT QSGGSNGIGF AIPANLVRSF LAQAAAGESR FQRPWAGVNG QVVDAAMAEA MGLERPEGVV LTELDPESPF RAAGLRSGDV VVALGGQRTD SPQEVMFRLS SMGIGSRTTV TYLRDGQTRE AEVALMAPPD NPPRETLTLR ETALAGLTVE RLNPAVRAEM NLPLTLEGVV VRGAEAHAAR AGLRPGDLLL EINGRRIERS RDVLVAARAQ SRWWQIDVLR DGQSLRLRFR L
|
| |