Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0388 |
Symbol | |
ID | 4897569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 400776 |
End bp | 402161 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640110972 |
Product | protease Do |
Protein accession | YP_001042276 |
Protein GI | 126461162 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCACG CTCTGCTTGC CTTTTCCCTG ATCGCTCTCC TGTCGCCGCT TGCCGCGCCC GCCGAGACGC GCCTGCCCGA GAGCGCCGCC GAGATTTCGC TCTCCTTTGC GCCGGTGGTG CGCTCGGCGG CACCTGCGGT CGTGAACATC TATGCCACGC GGGTGGTCGA GCAGCGCGTG AGCCCCTTTG CCGCCGATCC GTTCTTCGAC CAGCTCTTCC GGGATTTCGG TCGCCGCCAG CCGCGGGTGC AGAACTCGCT GGGCTCCGGC GTGATCGTCT CCGGCGACGG GATCGTGGTG TCGAACTATC ATGTGGTGGG ACAGGCGGAT GCGATCCGGG TCGTGCTGAA CGACCGGCGC GAATATGAGG CCGAGGTGAT GCTCGCCGAT CAGGACAGCG ACCTTGCGGT ACTGAAGTTG AAGGAGGCTG CGGATCTGCC GCATCTGGGG CTGCGCGATT CCGACGGCGT CGAGGTGGGG GAGCTGGTGC TGGCCATCGG CAACCCGTTC GGAGTGGGCC AGACCGTATC GCAGGGCATC GTCTCGGGGC TCGCGCGCTC GGGTCTCTCG ATCGACGGCG GGCGCGGCTA TTTCATCCAG ACCGACGCCG CCATCAACCC CGGCAACTCG GGCGGCGCAC TGGTCGATAC TGCGGGGCGG CTCGTGGGGA TCAACACGGC GATCCTCACC CAGTCGGGCG GCTCGAACGG GATCGGCTTC GCCATTCCCG CCAATCTCGT GCGCAGCTTC CTCGCGCAGG CGGAGGCGGG CGAGGCGCGC TTCCAGCGTC CCTGGGCCGG GGTCAACGGG CAGGCGGTCG ATGCAAGCAT GGCCGAGGCG ATGGGGCTCG AACGCCCGGA AGGGGTGGTG CTGACCGAGC TCGATCCCGA GAGCCCGTTC CGCGCCGCGG GCCTGCGCGC GGGCGATGTG GTGGTGGCGC TGGAGGGGCA GCGCACCGAC AGCCCGCAGG AGGTGATCTT CCGGCTCTCC TCCTTGGGCA TCGGCGCGCG CGCCACGGTG AGCTATCTGC GCGACGGCGA GACGCGCGAG GCCGAGATCG CGCTGGTCGT GGCGCCCGAC AAGCCGCCCC GCGAGACGGT GGCGCTGCGC GAGACGGTCC TTGCCGGGCT CACGGTCGAG CGGCTCAATC CCGCGGTGCG GGCCGAGCTG AACCTGCCCC TGACCCTCGA AGGGGTGGTG GTGCGCGCCT CCGAGGCGAC GGCGGCGCAG ACGGGCCTCC GGCCGGGCGA CATCCTGCTC GAGATCAACG GCCGCCGGAT CGAGCGCCCG CGCGATGTGG AGCGCGCGGC GCAGGAGCGG GTGCGCTGGT GGCAGATCGA CGTTCTCCGC GACGGCAAGC CGCTGCGACT GCGCTTCCGT CTCTGA
|
Protein sequence | MRHALLAFSL IALLSPLAAP AETRLPESAA EISLSFAPVV RSAAPAVVNI YATRVVEQRV SPFAADPFFD QLFRDFGRRQ PRVQNSLGSG VIVSGDGIVV SNYHVVGQAD AIRVVLNDRR EYEAEVMLAD QDSDLAVLKL KEAADLPHLG LRDSDGVEVG ELVLAIGNPF GVGQTVSQGI VSGLARSGLS IDGGRGYFIQ TDAAINPGNS GGALVDTAGR LVGINTAILT QSGGSNGIGF AIPANLVRSF LAQAEAGEAR FQRPWAGVNG QAVDASMAEA MGLERPEGVV LTELDPESPF RAAGLRAGDV VVALEGQRTD SPQEVIFRLS SLGIGARATV SYLRDGETRE AEIALVVAPD KPPRETVALR ETVLAGLTVE RLNPAVRAEL NLPLTLEGVV VRASEATAAQ TGLRPGDILL EINGRRIERP RDVERAAQER VRWWQIDVLR DGKPLRLRFR L
|
| |