Gene Rsph17025_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2507 
Symbol 
ID5084858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2556689 
End bp2558074 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content70% 
IMG OID640484069 
Productprotease Do 
Protein accessionYP_001168700 
Protein GI146278541 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0254484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCAGG CCTTGCTTGC CCTCCTTTTG ACCGCCGCCC TGTTTCCCCT GCCGATTCTG 
GCCGAGACAC GGATCCCGCA AAGCGCGGCC GAGATCTCGC TGACCTTTGC CCCGGTTGTC
CGGTCGGCGG CCCCGGCGGT GGTGAACATC TACGCCACGC GCGTTGTGGC CCAGCGGGTG
AGCCCCTTTG CGGCCGACCC GTTCTTCGAC CAGTTGTTCC GTGACTTCGG ACGCCGCCAG
CCGCGGGTTC AGAACTCGCT CGGCTCGGGC GTGATTGTAG CCGCGGACGG CATCGTGGTG
TCGAACTACC ATGTCGTCGG GCAGGCCGAT GCGATCCGTG TCGTGCTGAA CGACCGCCGC
GAGTATGAGG CCGAGGTGAT GCTGTCCGAT CAGGACAGCG ATCTTGCCGT GCTGAAGCTG
AGGGATGCGC ACGATCTGCC CTATCTGTCG CTGCGCGATT CGGACGGGAT CGAGGTGGGC
GAACTGGTGC TCGCGATCGG CAACCCGTTC GGCGTGGGCC AGACGGTGTC GCAGGGAATC
GTCTCGGGGC TTGCGCGGTC GGGCCTTTCG ATCGACGGCG GGCGGGGCTA CTTCATCCAG
ACCGACGCGG CCATCAATCC CGGCAACTCG GGCGGGGCGC TGGTCGATAC GGCCGGAAGG
CTCGTAGGCA TCAACACCGC GATCCTCACC CAGTCGGGCG GGTCGAACGG GATCGGCTTC
GCCATCCCCG CCAACCTGGT GCGCAGCTTC CTGGCGCAGG CCGCGGCGGG CGAGAGCCGC
TTTCAGCGCC CGTGGGCGGG GGTGAACGGG CAGGTGGTCG ATGCGGCGAT GGCCGAGGCG
ATGGGGCTCG AGCGGCCCGA GGGAGTCGTC CTGACCGAAC TCGATCCCGA AAGCCCGTTC
CGCGCGGCGG GCCTGCGCTC GGGTGATGTG GTGGTGGCGC TTGGCGGCCA GCGGACGGAC
AGTCCGCAGG AGGTGATGTT CCGCCTGTCC TCCATGGGGA TCGGCTCGCG CACCACGGTC
ACCTATCTGC GCGACGGCCA GACGCGCGAG GCCGAGGTTG CGCTGATGGC GCCGCCCGAC
AATCCCCCGC GTGAGACGCT GACGCTGCGC GAGACGGCGC TTGCGGGCCT CACGGTCGAG
CGGCTGAACC CGGCCGTCCG GGCCGAGATG AACCTGCCGC TGACGCTGGA AGGTGTCGTG
GTGCGGGGCG CCGAGGCGCA CGCGGCGCGG GCCGGGCTCC GGCCGGGCGA TCTCCTTCTG
GAGATCAACG GCCGCCGCAT CGAGCGGTCG CGCGATGTGC TGGTGGCGGC GCGGGCACAG
TCGCGCTGGT GGCAGATCGA CGTGCTGCGC GACGGCCAGT CGCTGCGTCT CCGCTTCCGT
CTGTAG
 
Protein sequence
MRQALLALLL TAALFPLPIL AETRIPQSAA EISLTFAPVV RSAAPAVVNI YATRVVAQRV 
SPFAADPFFD QLFRDFGRRQ PRVQNSLGSG VIVAADGIVV SNYHVVGQAD AIRVVLNDRR
EYEAEVMLSD QDSDLAVLKL RDAHDLPYLS LRDSDGIEVG ELVLAIGNPF GVGQTVSQGI
VSGLARSGLS IDGGRGYFIQ TDAAINPGNS GGALVDTAGR LVGINTAILT QSGGSNGIGF
AIPANLVRSF LAQAAAGESR FQRPWAGVNG QVVDAAMAEA MGLERPEGVV LTELDPESPF
RAAGLRSGDV VVALGGQRTD SPQEVMFRLS SMGIGSRTTV TYLRDGQTRE AEVALMAPPD
NPPRETLTLR ETALAGLTVE RLNPAVRAEM NLPLTLEGVV VRGAEAHAAR AGLRPGDLLL
EINGRRIERS RDVLVAARAQ SRWWQIDVLR DGQSLRLRFR L