Gene Rsph17029_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1999 
Symbol 
ID4896200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2117282 
End bp2118802 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID640112593 
Productprotease Do 
Protein accessionYP_001043875 
Protein GI126462761 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0697759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGTTGG CCGCTCAGCT ATCTCGAAAG GAGTCGGGTG TGCAGTCTCA CGCCATTACC 
ATCGCCCGCC GCATCCACCC GGTGCCCGCG TCGGTCTCGC GTCTCTTCCT CGCGCTGATG
CTCGGGCTCG CCCTGGCGCT GGCGCAGGCC GTGGCTGTCA AGGCGCAGAA CGCTCCCGCA
AGTTTCGCAG GCCTCGCCGA GAAGATCAGC CCGGCCGTCG TGAACATTAC GACCTCGACC
GTCGTGGCGG CACCCACGCA GAATTCGCCC CTCGTGCCCG AAGGCTCGCC CTTCGAGGAT
TTCTTCCGTG ACTTCATGGA CCCGCAGAAC CGCGGCGAGG GCCCGCGCCG CTCCGAGGCG
CTGGGCTCCG GCTTCGTGAT CTCGGAAGAC GGCTACATCG TCACCAACAA CCATGTCATC
GAAGGGGCCG ACGACATTCA GATCGAGTTC TTCTCGGGCA AGAAGCTCGA GGCGAAGCTC
GTCGGCACCG ATCCGAAGAC CGACATCGCG CTGCTGAAGG TCGATGGGAA CCAGCCGCTG
CCCTTCGTGA GCTTCGGCAA CTCGGACCTC GCCCGCGTCG GCGACTGGGT CGTGGCGATG
GGCAACCCGC TGGGGCAGGG CTTCTCGGTC TCGGCCGGCA TCGTGTCGGC GCGCAACCGG
GCCCTCTCCG GCACCTACGA CGATTACATC CAGACCGACG CCGCCATCAA CCGCGGCAAT
TCGGGCGGCC CGCTGTTCAA CATGGACGGG CAGGTGATCG GCGTAAACAC GGCGATCCTG
TCGCCGAACG GCGGCTCGAT CGGCATCGGC TTCTCGATGG CCTCGAACGT GGTGGTGAAG
GTCGTCCAGC AGCTGCGCGA GTTCGGCGAG ACCCGCCGCG GCTGGCTCGG CGTGCGGATC
CAGGACGTGA CCCCCGACGT GGCCGAGGCG ATGGGCCTCA CCGAGGCCAA AGGCGCCCTC
GTGACCGACG TGCCGGAAGG CCCTGCGAAA GAGGCCGGCA TGCAGTCGGG CGACGTGATC
GTGACCTTCG ATAGCGCGCC CGTGGCGGAC ACCCGCGATC TGGTGCGCCG GGTGGCCGAT
GCGCCCATTG GCGAAGCGGT GCGGGTCATC GTGATGCGCG AAGGCAAGAC CCGGACCCTG
TCGGTGACGC TGGGGCGTCG CGAGGAAGCC GAGAACGAAG GCCCCGAGGC ACCCGGCGCG
GCCGAGCCGA CCGAGCCGTC GACGGCCGAT CTTCTGGGCC TGACCGTGGC GCCGCTCACG
GCCGAGCAGG CCGGAGAGCT GGGCCTGCCC GGCGGCACCG AGGGGCTTGC GGTGACGGAC
GTCGATCCGG CCTCAGAGGC CTATTCCAAG GGCTTGCGCG AGGGGGACGT GATCACCGAA
GCCGGCCAGC AGAAAGTGGT CTCGATCAAG GATCTGCAGG ACCGTGTGAC CGAGGCGCGG
GAGGCGGGGC GGAAATCGCT GCTCCTGCTG ATCCGCCGCG GCGGCGATCC GCGTTTCGTG
GCCCTGACGG TCAGCGAGTA G
 
Protein sequence
MQLAAQLSRK ESGVQSHAIT IARRIHPVPA SVSRLFLALM LGLALALAQA VAVKAQNAPA 
SFAGLAEKIS PAVVNITTST VVAAPTQNSP LVPEGSPFED FFRDFMDPQN RGEGPRRSEA
LGSGFVISED GYIVTNNHVI EGADDIQIEF FSGKKLEAKL VGTDPKTDIA LLKVDGNQPL
PFVSFGNSDL ARVGDWVVAM GNPLGQGFSV SAGIVSARNR ALSGTYDDYI QTDAAINRGN
SGGPLFNMDG QVIGVNTAIL SPNGGSIGIG FSMASNVVVK VVQQLREFGE TRRGWLGVRI
QDVTPDVAEA MGLTEAKGAL VTDVPEGPAK EAGMQSGDVI VTFDSAPVAD TRDLVRRVAD
APIGEAVRVI VMREGKTRTL SVTLGRREEA ENEGPEAPGA AEPTEPSTAD LLGLTVAPLT
AEQAGELGLP GGTEGLAVTD VDPASEAYSK GLREGDVITE AGQQKVVSIK DLQDRVTEAR
EAGRKSLLLL IRRGGDPRFV ALTVSE