Gene Rsph17029_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0388 
Symbol 
ID4897569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp400776 
End bp402161 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content70% 
IMG OID640110972 
Productprotease Do 
Protein accessionYP_001042276 
Protein GI126461162 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCACG CTCTGCTTGC CTTTTCCCTG ATCGCTCTCC TGTCGCCGCT TGCCGCGCCC 
GCCGAGACGC GCCTGCCCGA GAGCGCCGCC GAGATTTCGC TCTCCTTTGC GCCGGTGGTG
CGCTCGGCGG CACCTGCGGT CGTGAACATC TATGCCACGC GGGTGGTCGA GCAGCGCGTG
AGCCCCTTTG CCGCCGATCC GTTCTTCGAC CAGCTCTTCC GGGATTTCGG TCGCCGCCAG
CCGCGGGTGC AGAACTCGCT GGGCTCCGGC GTGATCGTCT CCGGCGACGG GATCGTGGTG
TCGAACTATC ATGTGGTGGG ACAGGCGGAT GCGATCCGGG TCGTGCTGAA CGACCGGCGC
GAATATGAGG CCGAGGTGAT GCTCGCCGAT CAGGACAGCG ACCTTGCGGT ACTGAAGTTG
AAGGAGGCTG CGGATCTGCC GCATCTGGGG CTGCGCGATT CCGACGGCGT CGAGGTGGGG
GAGCTGGTGC TGGCCATCGG CAACCCGTTC GGAGTGGGCC AGACCGTATC GCAGGGCATC
GTCTCGGGGC TCGCGCGCTC GGGTCTCTCG ATCGACGGCG GGCGCGGCTA TTTCATCCAG
ACCGACGCCG CCATCAACCC CGGCAACTCG GGCGGCGCAC TGGTCGATAC TGCGGGGCGG
CTCGTGGGGA TCAACACGGC GATCCTCACC CAGTCGGGCG GCTCGAACGG GATCGGCTTC
GCCATTCCCG CCAATCTCGT GCGCAGCTTC CTCGCGCAGG CGGAGGCGGG CGAGGCGCGC
TTCCAGCGTC CCTGGGCCGG GGTCAACGGG CAGGCGGTCG ATGCAAGCAT GGCCGAGGCG
ATGGGGCTCG AACGCCCGGA AGGGGTGGTG CTGACCGAGC TCGATCCCGA GAGCCCGTTC
CGCGCCGCGG GCCTGCGCGC GGGCGATGTG GTGGTGGCGC TGGAGGGGCA GCGCACCGAC
AGCCCGCAGG AGGTGATCTT CCGGCTCTCC TCCTTGGGCA TCGGCGCGCG CGCCACGGTG
AGCTATCTGC GCGACGGCGA GACGCGCGAG GCCGAGATCG CGCTGGTCGT GGCGCCCGAC
AAGCCGCCCC GCGAGACGGT GGCGCTGCGC GAGACGGTCC TTGCCGGGCT CACGGTCGAG
CGGCTCAATC CCGCGGTGCG GGCCGAGCTG AACCTGCCCC TGACCCTCGA AGGGGTGGTG
GTGCGCGCCT CCGAGGCGAC GGCGGCGCAG ACGGGCCTCC GGCCGGGCGA CATCCTGCTC
GAGATCAACG GCCGCCGGAT CGAGCGCCCG CGCGATGTGG AGCGCGCGGC GCAGGAGCGG
GTGCGCTGGT GGCAGATCGA CGTTCTCCGC GACGGCAAGC CGCTGCGACT GCGCTTCCGT
CTCTGA
 
Protein sequence
MRHALLAFSL IALLSPLAAP AETRLPESAA EISLSFAPVV RSAAPAVVNI YATRVVEQRV 
SPFAADPFFD QLFRDFGRRQ PRVQNSLGSG VIVSGDGIVV SNYHVVGQAD AIRVVLNDRR
EYEAEVMLAD QDSDLAVLKL KEAADLPHLG LRDSDGVEVG ELVLAIGNPF GVGQTVSQGI
VSGLARSGLS IDGGRGYFIQ TDAAINPGNS GGALVDTAGR LVGINTAILT QSGGSNGIGF
AIPANLVRSF LAQAEAGEAR FQRPWAGVNG QAVDASMAEA MGLERPEGVV LTELDPESPF
RAAGLRAGDV VVALEGQRTD SPQEVIFRLS SLGIGARATV SYLRDGETRE AEIALVVAPD
KPPRETVALR ETVLAGLTVE RLNPAVRAEL NLPLTLEGVV VRASEATAAQ TGLRPGDILL
EINGRRIERP RDVERAAQER VRWWQIDVLR DGKPLRLRFR L