Gene Rsph17025_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4050 
Symbol 
ID5086223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp88116 
End bp90005 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content76% 
IMG OID640485613 
Producthypothetical protein 
Protein accessionYP_001170207 
Protein GI146280050 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.35522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.114627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC ATTTCCTCAC CAAGTCGCAG ATCGACCTGT CGCAGTGCCT GCCCTGGGGC 
GGCGGTCTGG CGCTCGAGGC CCATGGCGCG CTGCGGGCGG CGCTGGCGGC GCGGATCTCG
CAGCGGGCGG CCGATCTCTT CGCCGAGCCG CTGATCAACC GCGGCAACGA CGCGGCCCCC
GCCAGCATCT CCTGGTATTC GGCCCATGCG GGCGAGGGGC GCCCGCTGTC CGAGCTTGAC
GAGGCCGAGC AGGCGCGGGT GGCGGCGCAG CTGTCGGATC TGCTCCGCCC GGTGCGCGAG
CTTCTGGCCG ACAGCGAGGA CGGCACGCTG ATCGGGGCTG CGCTGCATCT GGCGGGCAGC
GCGCGCGGCG ATGTCTGGGT GGTCGATGGC CAGCCGGTGC TGATCAACTG GGGCATGTTG
CCGGCGGGGG CGGAGCGTTC CCAGGCCAGC CGCAGCGCCC ATTACAACCG CACGCTCGGC
CGCTTCCTGC CGCTCTCCAA GGCGCCGCCG CTGACCGAGG ACGAGCGCCG GCAACGCGCC
GATGCGGCGG GCCCTTCCCC GCTGGCGGGG GCCGCGGTCG GGGCGGGGGC CGGGATCGGG
GCCGCCGCTG CGGGCGGGGA CGCGCCCCCG CCCGAGCCGC CCGCCCCCGT CCCTCCGCCG
GACGAAGCGC CGCCGCCGCG CCGGCTGCGC GCCTGGGAAT GGGCCCCGCT GCTCGTGCTG
CTGCTGCTGG TGGGGGGGGC GGTGATCTGG CTGCTGATCC CCGGCAACCG GCTGTTCCCG
CCCCGGATGG CCGCGGTCGT CGAGGATGCG CGCGCCGCCG AGATCGCCGC CGAGATCAAT
GCCGCGCTCG AGGCGCGGCG CGCGGCGCTG CAGGCCGCGC TCGACGGGGC GCAGTGCCGG
GCCGACGGGA CGCTGATCAT GCCCGGCGGC CGGACGATCG AGGGGCTGCT GCCGCCCGTG
CCGGGCAGCC CCGCCGATCG GCCCGGCCAG CGCGCCGAGG CCGATCCGAC CCCGGTCCTG
CCGCCCGATC CCGCGCGGGT GCAGGTGCCC GACCTCGATC CCGGGGATCC CGGCAGCACC
GCCGTCGCGG ACGCCTCGCT GCTCGAGGTG ATCGAGAGCC GCACGGTGAT GGTCGTGGCG
CGCGGCCCCG ACGGGGTCGC CACCGGCTCG GGCTTCTTCG TGGCGCCGGA TCTGGTGATG
ACCAACTTCC ATGTGGTCAG CGGCGCCGCG TCCCACAGCA TCTTCGTCAC CAACCGCAGC
CTCGGCACGC TGCGCCCCAC GCAGCTTTTG CGCGCCGACG GGCCCTTCGA GCCGACGGGC
TCGGATTTCG CGCTGCTGCG CGTGCCCGGC GTCTCGGCGC GCCACTTCAC GCTGCTTCGG
CCGGCGGGCT CGCTGAAGCT GCAGAGCGTG ATCGCGGCCG GCTATCCCGG CGACGTGCTG
GCGACCGACA CGGCCTTCGC CGCCCTGACC TCGGGCGACA TCTCGGCGGT GCCGGACCTG
ACGGTCACCG ACGGCACGGT GAACACCGAG CAGGCGGTCT CGGCGGCGAT CCGGGCGGTG
GTCCATTCCG CGCCGATCTC GCAGGGCAAC TCGGGCGGGC CGCTGGTGGA CATGTGCGGG
CGGGTCGTGG GGATGAACAC CTTCGTGCGG CAGGGCGCGC TGCGGAACCT GAACTTCGCC
CTCTCCGCGC CCGACGTGAT CGGGTTCCTG CGCGCGGCGG GGGCGAGCCC CTCGATCACC
GGGACGGACT GCCGTCCCGA GGTGCTGCGC CCCGGCGTGC CGGCCGAGCA GGTCACGCCG
GTCGAGGCCG GGCCGGAGCC GGGCGCCGCC CCCGGCGGGG ACGCGCCGCG GCTGCCGGAT
TTCGGCGCCC TGCCGCCCCG TGCGGACTAG
 
Protein sequence
MADHFLTKSQ IDLSQCLPWG GGLALEAHGA LRAALAARIS QRAADLFAEP LINRGNDAAP 
ASISWYSAHA GEGRPLSELD EAEQARVAAQ LSDLLRPVRE LLADSEDGTL IGAALHLAGS
ARGDVWVVDG QPVLINWGML PAGAERSQAS RSAHYNRTLG RFLPLSKAPP LTEDERRQRA
DAAGPSPLAG AAVGAGAGIG AAAAGGDAPP PEPPAPVPPP DEAPPPRRLR AWEWAPLLVL
LLLVGGAVIW LLIPGNRLFP PRMAAVVEDA RAAEIAAEIN AALEARRAAL QAALDGAQCR
ADGTLIMPGG RTIEGLLPPV PGSPADRPGQ RAEADPTPVL PPDPARVQVP DLDPGDPGST
AVADASLLEV IESRTVMVVA RGPDGVATGS GFFVAPDLVM TNFHVVSGAA SHSIFVTNRS
LGTLRPTQLL RADGPFEPTG SDFALLRVPG VSARHFTLLR PAGSLKLQSV IAAGYPGDVL
ATDTAFAALT SGDISAVPDL TVTDGTVNTE QAVSAAIRAV VHSAPISQGN SGGPLVDMCG
RVVGMNTFVR QGALRNLNFA LSAPDVIGFL RAAGASPSIT GTDCRPEVLR PGVPAEQVTP
VEAGPEPGAA PGGDAPRLPD FGALPPRAD