Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3978 |
Symbol | |
ID | 4898815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 1118845 |
End bp | 1120296 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640114581 |
Product | protease Do |
Protein accession | YP_001045828 |
Protein GI | 126464715 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0104728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGC CCACCCCGCG TTCCTCGCTG AAGGCCGTCC TGATCGCCAG CACCCTCATC ACCGCCGGTG TCGCCGGCAC GGCCCTGCCG CCGACGGCGG CCCGGGCCGA GGTGCCGATG CAGGGCTATG CCGATCTCGT CGCCCGCGTC TCGCCCGCCG TCGTCTTCAT CGAGGTGACG GCCAAGTCGA AGGAGTCCAC GCCGATGGCG GGCTCTCCCT TCGAGGAATT CCTCCGCCGC TTCGGCGAGA TCGACCCGCA GTTCCGCATG CCGCAGGCCC CCGAGGGCGG GCAGGTCATG CACGGGCTCG GGTCGGGCTT CCTGATCTCG CAGGACGGCA TCATCGTCAC CAACAACCAT GTGGTCGAAA ATGCGACCGA CATGAAGGTC AAGCTCGAGG ACGGCCGCGA GTTCAAGGCC GAAGTCGTGG GCACGGATCC GATGACCGAC ATCGCGGTGA TCCGGCTGAA GGATGCCAAG GACCTGCCCT TCGTCGAGCT CGGCGACAGC GAGAAGCTGC GCGTGGGGGA TGCGGTGGTG GCCGTCGGCA ACCCGTTCGG GCTGGGCGGC ACCGTGACCT CGGGCATCGT CTCGGCCATG GGGCGCAACA TCAACTCGGG CCCCTACGAC GACTACATCC AGACCGACGC CGCCATCAAC CGCGGCAACT CGGGCGGCCC GCTGTTCGAC ACCGAGGGCA AGGTCGTCGG CATGAACACC GCGATCTTCT CGCCCTCCGG CGGCTCGGTG GGCATCGGCT TCTCGATCCC CGCGAACACG GTCAAGGATG TCGTGGCCCA GCTTCAGGAC AAGGGTTCGG TCTCGCGCGG CTGGCTCGGG GTCACGGTTC AGGGCATGAC TCCCGAGATC GCTCAGGCCA TGGGGCTTGA GGGGCGCGAC GGGGCCCTCG TGGCCGAGGT GCAGCAGGGC AGCCCCGCCG ATGAGGGCGG TCTCGAGAGC GGCGATGTCA TCACGGCGGT GAACGGGCAG GAACTGACGG AGCGGGCGAG CCTGCCTCGG CTGATCGCGG CCATCCCGAA CGGCGAGAAA GCCCAACTCA CGGTCCAGCG CGATGGACGC CAGCAGGAGA TGACCGTGAC GATCGGAGAA CTGACCCCCG ACCGGGCGCA GGTCGCCTCG GCCGAGTCGC CCGAAGGGCT CGGCGGGCCG CTCGGCATCG AGGTCCAGCC GCTCGAGCCC GCGCTGGCAC GCCAGCTGGG CCTGCCGGAC GGCGCCTCGG GCATCGTGGT GACGGCGGTC GATCCGTCGG GGCCGAACGC CGACCGGCTC GCGCCGGGCG ACGTGATCCA GGAAGCGGCC GGCCACCCGA TCGAGACGCC GCGCGATCTG GCCTCGGCAA TGCGCGAGGC GCGCGGCAAG GGCGTGATGC TGATGAAGGT GCTGCGGCAG GGCAACCCGG TCTATGTGGG CGCCGAAGTG GCCTCGTCCT GA
|
Protein sequence | MPMPTPRSSL KAVLIASTLI TAGVAGTALP PTAARAEVPM QGYADLVARV SPAVVFIEVT AKSKESTPMA GSPFEEFLRR FGEIDPQFRM PQAPEGGQVM HGLGSGFLIS QDGIIVTNNH VVENATDMKV KLEDGREFKA EVVGTDPMTD IAVIRLKDAK DLPFVELGDS EKLRVGDAVV AVGNPFGLGG TVTSGIVSAM GRNINSGPYD DYIQTDAAIN RGNSGGPLFD TEGKVVGMNT AIFSPSGGSV GIGFSIPANT VKDVVAQLQD KGSVSRGWLG VTVQGMTPEI AQAMGLEGRD GALVAEVQQG SPADEGGLES GDVITAVNGQ ELTERASLPR LIAAIPNGEK AQLTVQRDGR QQEMTVTIGE LTPDRAQVAS AESPEGLGGP LGIEVQPLEP ALARQLGLPD GASGIVVTAV DPSGPNADRL APGDVIQEAA GHPIETPRDL ASAMREARGK GVMLMKVLRQ GNPVYVGAEV ASS
|
| |