Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1742 |
Symbol | |
ID | 3718948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 323186 |
End bp | 324571 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640069899 |
Product | serine protease |
Protein accession | YP_351790 |
Protein GI | 77462286 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.378867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCACG CTCTGCTTGC CTTTTCCCTG ATCGCTCTCC TGTCGCCGCT TGCCGCGCCT GCCGAGACGC GCCTGCCCGA GAGCGCCGCC GAGATTTCGC TCTCCTTCGC GCCGGTGGTG CGCTCGGCGG CACCTGCGGT CGTGAACATC TATGCCACGC GGGTGGTCGA GCAGCGCGTG AGCCCCTTTG CCGCCGATCC GTTCTTCGAC CAGCTCTTCC GGGATTTCGG TCGCCGCCAG CCGCGGGTGC AGAACTCGCT GGGCTCCGGC GTGATCGTCT CCGGGGACGG GATCGTGGTG TCGAACTATC ATGTGGTGGG ACAGGCCGAC GCCATCCGGG TCGTGCTGAA CGACCGGCGC GAATATGAGG CCGAGGTGAT GCTCGCCGAT CAGGACAGCG ACCTTGCGGT GCTGAAGCTG AAGGAGGCCA GGGATCTGCC GCATCTGGGG CTGCGCGATT CCGACGGCGT CGAGGTGGGG GAGCTGGTGC TGGCCATCGG CAACCCGTTC GGGGTGGGCC AGACCGTGTC GCAGGGCATC GTCTCGGGGC TCGCGCGCTC GGGTCTCTCG ATCGACGGCG GGCGCGGCTA TTTCATCCAG ACCGATGCCG CCATCAACCC CGGCAACTCG GGCGGCGCGC TGGTCGATAC GGCGGGGCGG CTCGTAGGGA TCAACACCGC GATCCTCACC CAGTCGGGCG GCTCGAACGG GATCGGCTTC GCCATTCCCG CCAATCTGGT GCGCAGCTTC CTTGCACAGG CGGAGGCGGG CGAGGCGCAC TTCCAGCGTC CCTGGGCAGG GGTGAACGGG CAGGCGGTCG ATGCAAGCAT GGCCGAGGCG ATGGGGCTCG AACGCCCGGA AGGGGTGGTG CTGACCGAGC TCGATCCCGA GAGCCCGTTC CGCGCCGCGG GCCTGCGCGC GGGCGATGTG GTGGTGGCGC TGGAGGGGCA GCGCACCGAC AGCCCGCAGG AGGTGATCTT CCGGCTCTCC TCCTTGGGCA TCGGCGCGCG CGCCACGGTG AGCTATCTGC GCGACGGCGA GACGCGCGAG GCCGAGATCG CGCTGGTCGT GGCGCCCGAC AAGCCGCCCC GCGAGACGGT GGCGCTGCGC GAGACGGTCC TTGCCGGGCT CACGGTCGAG CGGCTCAATC CCGCGGTGCG GGCCGAGCTG AACCTGCCCC TGACCCTCGA AGGGGTGGTG GTGCGCGCCT CCGAGGCGAC GGCGGCGCAG ACGGGCCTCC GGCCGGGCGA CATCCTGCTC GAGATCAACG GCCGCCGGAT CGAGCGCCCG CGCGATGTGG AGCGCGCCGC GCAGGAGCGG GTGCGCTGGT GGCAGATCGA CGTTCTCCGC GACGGCAAGC CGCTGCGACT GCGCTTCCGT CTCTGA
|
Protein sequence | MRHALLAFSL IALLSPLAAP AETRLPESAA EISLSFAPVV RSAAPAVVNI YATRVVEQRV SPFAADPFFD QLFRDFGRRQ PRVQNSLGSG VIVSGDGIVV SNYHVVGQAD AIRVVLNDRR EYEAEVMLAD QDSDLAVLKL KEARDLPHLG LRDSDGVEVG ELVLAIGNPF GVGQTVSQGI VSGLARSGLS IDGGRGYFIQ TDAAINPGNS GGALVDTAGR LVGINTAILT QSGGSNGIGF AIPANLVRSF LAQAEAGEAH FQRPWAGVNG QAVDASMAEA MGLERPEGVV LTELDPESPF RAAGLRAGDV VVALEGQRTD SPQEVIFRLS SLGIGARATV SYLRDGETRE AEIALVVAPD KPPRETVALR ETVLAGLTVE RLNPAVRAEL NLPLTLEGVV VRASEATAAQ TGLRPGDILL EINGRRIERP RDVERAAQER VRWWQIDVLR DGKPLRLRFR L
|
| |