Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4050 |
Symbol | |
ID | 5086223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 88116 |
End bp | 90005 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640485613 |
Product | hypothetical protein |
Protein accession | YP_001170207 |
Protein GI | 146280050 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.35522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.114627 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACC ATTTCCTCAC CAAGTCGCAG ATCGACCTGT CGCAGTGCCT GCCCTGGGGC GGCGGTCTGG CGCTCGAGGC CCATGGCGCG CTGCGGGCGG CGCTGGCGGC GCGGATCTCG CAGCGGGCGG CCGATCTCTT CGCCGAGCCG CTGATCAACC GCGGCAACGA CGCGGCCCCC GCCAGCATCT CCTGGTATTC GGCCCATGCG GGCGAGGGGC GCCCGCTGTC CGAGCTTGAC GAGGCCGAGC AGGCGCGGGT GGCGGCGCAG CTGTCGGATC TGCTCCGCCC GGTGCGCGAG CTTCTGGCCG ACAGCGAGGA CGGCACGCTG ATCGGGGCTG CGCTGCATCT GGCGGGCAGC GCGCGCGGCG ATGTCTGGGT GGTCGATGGC CAGCCGGTGC TGATCAACTG GGGCATGTTG CCGGCGGGGG CGGAGCGTTC CCAGGCCAGC CGCAGCGCCC ATTACAACCG CACGCTCGGC CGCTTCCTGC CGCTCTCCAA GGCGCCGCCG CTGACCGAGG ACGAGCGCCG GCAACGCGCC GATGCGGCGG GCCCTTCCCC GCTGGCGGGG GCCGCGGTCG GGGCGGGGGC CGGGATCGGG GCCGCCGCTG CGGGCGGGGA CGCGCCCCCG CCCGAGCCGC CCGCCCCCGT CCCTCCGCCG GACGAAGCGC CGCCGCCGCG CCGGCTGCGC GCCTGGGAAT GGGCCCCGCT GCTCGTGCTG CTGCTGCTGG TGGGGGGGGC GGTGATCTGG CTGCTGATCC CCGGCAACCG GCTGTTCCCG CCCCGGATGG CCGCGGTCGT CGAGGATGCG CGCGCCGCCG AGATCGCCGC CGAGATCAAT GCCGCGCTCG AGGCGCGGCG CGCGGCGCTG CAGGCCGCGC TCGACGGGGC GCAGTGCCGG GCCGACGGGA CGCTGATCAT GCCCGGCGGC CGGACGATCG AGGGGCTGCT GCCGCCCGTG CCGGGCAGCC CCGCCGATCG GCCCGGCCAG CGCGCCGAGG CCGATCCGAC CCCGGTCCTG CCGCCCGATC CCGCGCGGGT GCAGGTGCCC GACCTCGATC CCGGGGATCC CGGCAGCACC GCCGTCGCGG ACGCCTCGCT GCTCGAGGTG ATCGAGAGCC GCACGGTGAT GGTCGTGGCG CGCGGCCCCG ACGGGGTCGC CACCGGCTCG GGCTTCTTCG TGGCGCCGGA TCTGGTGATG ACCAACTTCC ATGTGGTCAG CGGCGCCGCG TCCCACAGCA TCTTCGTCAC CAACCGCAGC CTCGGCACGC TGCGCCCCAC GCAGCTTTTG CGCGCCGACG GGCCCTTCGA GCCGACGGGC TCGGATTTCG CGCTGCTGCG CGTGCCCGGC GTCTCGGCGC GCCACTTCAC GCTGCTTCGG CCGGCGGGCT CGCTGAAGCT GCAGAGCGTG ATCGCGGCCG GCTATCCCGG CGACGTGCTG GCGACCGACA CGGCCTTCGC CGCCCTGACC TCGGGCGACA TCTCGGCGGT GCCGGACCTG ACGGTCACCG ACGGCACGGT GAACACCGAG CAGGCGGTCT CGGCGGCGAT CCGGGCGGTG GTCCATTCCG CGCCGATCTC GCAGGGCAAC TCGGGCGGGC CGCTGGTGGA CATGTGCGGG CGGGTCGTGG GGATGAACAC CTTCGTGCGG CAGGGCGCGC TGCGGAACCT GAACTTCGCC CTCTCCGCGC CCGACGTGAT CGGGTTCCTG CGCGCGGCGG GGGCGAGCCC CTCGATCACC GGGACGGACT GCCGTCCCGA GGTGCTGCGC CCCGGCGTGC CGGCCGAGCA GGTCACGCCG GTCGAGGCCG GGCCGGAGCC GGGCGCCGCC CCCGGCGGGG ACGCGCCGCG GCTGCCGGAT TTCGGCGCCC TGCCGCCCCG TGCGGACTAG
|
Protein sequence | MADHFLTKSQ IDLSQCLPWG GGLALEAHGA LRAALAARIS QRAADLFAEP LINRGNDAAP ASISWYSAHA GEGRPLSELD EAEQARVAAQ LSDLLRPVRE LLADSEDGTL IGAALHLAGS ARGDVWVVDG QPVLINWGML PAGAERSQAS RSAHYNRTLG RFLPLSKAPP LTEDERRQRA DAAGPSPLAG AAVGAGAGIG AAAAGGDAPP PEPPAPVPPP DEAPPPRRLR AWEWAPLLVL LLLVGGAVIW LLIPGNRLFP PRMAAVVEDA RAAEIAAEIN AALEARRAAL QAALDGAQCR ADGTLIMPGG RTIEGLLPPV PGSPADRPGQ RAEADPTPVL PPDPARVQVP DLDPGDPGST AVADASLLEV IESRTVMVVA RGPDGVATGS GFFVAPDLVM TNFHVVSGAA SHSIFVTNRS LGTLRPTQLL RADGPFEPTG SDFALLRVPG VSARHFTLLR PAGSLKLQSV IAAGYPGDVL ATDTAFAALT SGDISAVPDL TVTDGTVNTE QAVSAAIRAV VHSAPISQGN SGGPLVDMCG RVVGMNTFVR QGALRNLNFA LSAPDVIGFL RAAGASPSIT GTDCRPEVLR PGVPAEQVTP VEAGPEPGAA PGGDAPRLPD FGALPPRAD
|
| |