Gene Rsph17025_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0887 
Symbol 
ID5083613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp904019 
End bp905500 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content68% 
IMG OID640482444 
Productprotease Do 
Protein accessionYP_001167095 
Protein GI146276936 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0626906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.172287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTCTC ACGCCATTTC CATCGCCCGC CGGAAGGAAC CGGTGCCCAT CGCCGCATGG 
CGCCTTTTCC TCGCGCTGAT GCTGGGCCTG GGGCTGGCGC TTGCGCAGGC GGTGTCGGCC
CATGCGCAGG GGGCCCCGGC CAGCTTCGCC GGTCTCGCCG AAAAGATCAG CCCGGCGGTG
GTGAACATCA CCACCTCGAC CGTTGTCGCG GCACCCACGC AGAGTTCTCC GCTCGTGCCC
GAAGGCTCGC CCTTCGAGGA TTTCTTCCGC GACTTCATGG ACCCGCAGAA CCGCGAAGGT
GGACCGCGCC GCTCCGAGGC GCTGGGCTCG GGCTTCGTGA TCTCGGAAGA CGGCTTCATC
GTGACCAACA ACCATGTCAT CGAAGGGGCG GACGACATCC AGATCGAGTT CTTCTCGGGC
AACAAGCTCG AGGCGAAGCT CGTGGGCACC GATCCCAAGA CCGACATCGC CCTGCTCAAG
GTCTCGAGCA ACCAGCCGCT CCCGTTCGTG AGCTTCGGCA ACTCGGATCT CGCGCGGGTG
GGCGACTGGG TGGTGGCGAT GGGCAACCCT CTGGGGCAGG GCTTTTCGGT CTCGGCCGGG
ATCATCTCGG CGCGCAACCG GGCGCTCTCG GGCACCTACG ATGACTACAT CCAGACCGAC
GCCGCCATCA ACCGCGGCAA CTCGGGCGGG CCGCTGTTCA ATCTCGACGG TCAGGTGATC
GGCGTGAACA CGGCGATCCT CTCGCCCAAC GGCGGCTCGA TCGGGATCGG CTTCTCGATG
GCCTCGAACG TGGTGGTGAA GGTCGTCCAG CAACTTCGCG AGTTCGGCGA GACGCGGCGC
GGCTGGCTCG GCGTGCGGAT CCAGGACGTG ACCCCGGACG TGGCCGAGGC GATGGGGCTG
GCCGAGGCGA AGGGGGCGCT GGTGACGGAC GTGCCCGACG GGCCTGCGAA AGAGGCCGGA
ATGCAGTCGG GCGACGTGAT CGTGACCTTT GACAAGGCGC CGGTGGCCGA CACCCGCGAT
CTCGTGCGCC GCGTGGCGGA CGCCCCGATC GGTGAGGCCG TGCGCGTGGT CGTGATGCGT
GAAGGCAAGA CCCGCACGCT CTCCGTGGTG CTCGGGCGCC GGGAGGAAGC CGAGGGTGAG
GGCCCGGCGG CGTCCGTCGA GTCTGCCCCG ACGGAACCTT CGACCGCCAA CCTTCTGGGC
CTGACAGTGG CTCCGCTGAC GGCCGAGCAG GCCGCCGAGC TGGGTCTGCC GCCCGGCACC
GAGGGGCTCG CGGTGACGGA TGTGGACACG GCCTCCGAGG CCTATTCCAA GGGGCTGCGC
GAGGGGGATG TCATCACCGA GGCGGGTCAG CAGAAGGTCA TGACGATCAA GGATCTCCAG
GACCGCGTGG ACGAGGCCCG CGAGGCCGGG CGCAAGTCGC TCCTGCTTCT AATCCGACGG
GGCGGTGACC CACGCTTCGT GGCTCTCACG ATCACCGAAT GA
 
Protein sequence
MQSHAISIAR RKEPVPIAAW RLFLALMLGL GLALAQAVSA HAQGAPASFA GLAEKISPAV 
VNITTSTVVA APTQSSPLVP EGSPFEDFFR DFMDPQNREG GPRRSEALGS GFVISEDGFI
VTNNHVIEGA DDIQIEFFSG NKLEAKLVGT DPKTDIALLK VSSNQPLPFV SFGNSDLARV
GDWVVAMGNP LGQGFSVSAG IISARNRALS GTYDDYIQTD AAINRGNSGG PLFNLDGQVI
GVNTAILSPN GGSIGIGFSM ASNVVVKVVQ QLREFGETRR GWLGVRIQDV TPDVAEAMGL
AEAKGALVTD VPDGPAKEAG MQSGDVIVTF DKAPVADTRD LVRRVADAPI GEAVRVVVMR
EGKTRTLSVV LGRREEAEGE GPAASVESAP TEPSTANLLG LTVAPLTAEQ AAELGLPPGT
EGLAVTDVDT ASEAYSKGLR EGDVITEAGQ QKVMTIKDLQ DRVDEAREAG RKSLLLLIRR
GGDPRFVALT ITE