Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3242 |
Symbol | |
ID | 3721847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | + |
Start bp | 299440 |
End bp | 300891 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640072915 |
Product | putative trypsin-like serine protease |
Protein accession | YP_354755 |
Protein GI | 77465252 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.263085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGC CCACCCCGCG TTCCTCGCTG AAGGCCGTCC TGATCGCCAG CACCCTCATC ACCGCCGGTG TCGCCGGCAC GGCCCTGCCG CCGACCGCGG CCCGGGCCGA GGTGCCGATG CAGGGCTATG CCGATCTCGT CGCCCGCGTC TCGCCCGCCG TCGTCTTCAT CGAGGTGACG GCCAAGTCGA AGGAGTCCAC GCCGATGGCG GGCTCTCCCT TCGAGGAATT CCTCCGCCGC TTCGGCGAGA TCGACCCGCA GTTCCGCATG CCGCAGGCCC CCGAGGGCGG GCAGGTCATG CACGGGCTCG GGTCGGGCTT CCTGATCTCG CAGGACGGCA TCATCGTCAC CAACAACCAT GTGGTCGAAA ATGCGACCGA CATGAAGGTC AAGCTCGAGG ACGGCCGCGA GTTCAAGGCC GAAGTCGTGG GCACCGATCC GATGACCGAC ATCGCGGTGA TCCGGCTGAA GGATGCCAAG GACCTGCCCT TCGTCGAGCT TGGCGACAGC GAGAAGCTGC GCGTGGGGGA TGCGGTGGTG GCCGTCGGCA ACCCGTTCGG GCTGGGCGGC ACCGTGACCT CGGGCATCGT CTCGGCCATG GGGCGCAACA TCAACTCGGG CCCCTACGAC GACTACATCC AGACCGACGC GGCCATCAAC CGCGGCAACT CGGGCGGCCC GCTGTTCGAC ACCGAGGGCA AGGTCGTCGG CATGAACACC GCGATCTTCT CGCCCTCCGG CGGCTCGGTG GGCATCGGCT TCTCGATCCC CGCGAACACG GTCAAGGATG TCGTGGCGCA GCTTCAGGAC AAGGGTTCGG TCTCGCGCGG CTGGCTCGGG GTCACGGTTC AGGGCATGAC TCCCGAGATC GCTCAGGCCA TGGGGCTTGA GGGGCGCGAC GGGGCCCTTG TGGCCGAGGT GCAGCAGGGT AGCCCCGCCG ACGAGGGCGG TCTCGAGAGC GGCGACGTCA TCACGGCGGT GAACGGTCAG GAGCTGACGG AGCGGGCGAG CCTGCCGCGG CTGATCGCGG CCATCCCGAA CGGCGAGAAA GCCCAACTCA CGGTCCAGCG CGATGGACGC CAGCAGGAGA TGACCGTGAC GATCGGAGAA CTGACCCCCG ACCGGGCGCA GGTCGCCTCG GCCGAGTCGC CCGAAGGGCT CGGCGGGCCG CTCGGCATCG AGGTCCAGCC GCTCGAGCCC GCGCTGGCAC GCCAGCTCGG CCTGCCGGAC GGCGCCTCGG GCGTCGTGGT GACGGCGGTC GATCCCTCGG GGCCGAACGC CGACCGGCTC GCGCCGGGCG ACGTGATCCA GGAAGCGGCC GGCCACCCGA TCGAGACGCC GCGCGATCTG GCCTCGGCGA TGCGCGAGGC GCGCGGCAAG GGCGTGATGC TGATGAAGGT GCTGCGGCAG GGCAACCCGG TCTATGTGGG CGCCGAAGTG GCCTCGTCCT GA
|
Protein sequence | MPMPTPRSSL KAVLIASTLI TAGVAGTALP PTAARAEVPM QGYADLVARV SPAVVFIEVT AKSKESTPMA GSPFEEFLRR FGEIDPQFRM PQAPEGGQVM HGLGSGFLIS QDGIIVTNNH VVENATDMKV KLEDGREFKA EVVGTDPMTD IAVIRLKDAK DLPFVELGDS EKLRVGDAVV AVGNPFGLGG TVTSGIVSAM GRNINSGPYD DYIQTDAAIN RGNSGGPLFD TEGKVVGMNT AIFSPSGGSV GIGFSIPANT VKDVVAQLQD KGSVSRGWLG VTVQGMTPEI AQAMGLEGRD GALVAEVQQG SPADEGGLES GDVITAVNGQ ELTERASLPR LIAAIPNGEK AQLTVQRDGR QQEMTVTIGE LTPDRAQVAS AESPEGLGGP LGIEVQPLEP ALARQLGLPD GASGVVVTAV DPSGPNADRL APGDVIQEAA GHPIETPRDL ASAMREARGK GVMLMKVLRQ GNPVYVGAEV ASS
|
| |