Gene Rsph17025_3140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3140 
Symbol 
ID5085299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp230 
End bp1681 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID640484712 
Producthypothetical protein 
Protein accessionYP_001169329 
Protein GI146279171 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.673481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.234219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGC CCACCCCCCG ATCCTCGCTG AAGGCCGTCC TGATTGCCAG CACCCTCCTT 
GCCGGAGGTG CCGTCGGCAC CGCGCTGCCG GTGGCGCCTG CCCACGCCGA GGTGCCGATG
CAGGGCTACG CCGATCTCGT GGCCCGTGTC TCGCCGGCCG TCGTCTTCAT CGAGGTGACC
GCCAAGTCGC AGGAGCCGGC CCCGAGGGCG GCATCCCCGC TCGAGGAGTT CCTTCGCCGC
TTCGGCGAGA TCGACCCGCA ATTCCGCATG CCCGCCCCGC CGGAGCGGGA CCGCGTCATG
CACGGGCTCG GGTCGGGGTT CCTGATCTCG CAGGACGGCG TGATCGTGAC CAACAACCAC
GTTGTCGAGA ATGCCACCGA CATGACCGTC AAGCTCGAGG ACGGGCGCGA GTTCAAGGCC
GAGATGGTGG GCGCCGATCC CATGACCGAC ATCGCCGTGA TCCGGCTGCG GGATGCGAGT
GATCTGCCCT TCGTCGAGTT CGGGGACAGC GACCGGCTGC GCGTGGGCGA TGCGGTCGTG
GCGGTCGGCA ATCCGTTCGG CCTTGGCGGG ACGGTCACGT CGGGCATCGT CTCGGCCATG
GGGCGCAACA TCAACTCCGG CCCCTATGAC GACTACATCC AGACCGACGC CGCCATCAAC
CGGGGCAACT CGGGCGGACC GCTCTTCGAC ACGAGCGGCA CGGTGGTGGG CATGAACACG
GCGATCTTCT CGCCCACGGG AGGCTCGGTT GGCATCGGCT TCTCGATCCC GGCCAACACG
GTGCGGGATG TCGTGGCGCA ACTGCAGGAA ACGGGTTCGG TCTCGCGCGG ATGGCTGGGC
GTGACGATCC AGCCCCTGAC GCCCGAGATC GCGCAGGCGC TGGGTCTCGA GGGCAGCCGG
GGGGCGCTCG TGGCCGAGGT GCAGCCGGAC AGCCCGGCCG AGGCGGGAGG CGTCGAGAGC
GGCGATGTCA TCACCGCCGT CAACGGGCAG GAGATCGGCG AGCGGTCCAG CCTGCCCCGG
CTGATCGCGG CCATCCCGAA CGGCGAGGAG GCCCGGCTCA CCGTTCAGCG CGACGGGCGC
GAGCGTGAGA TGACGGTCAC GATCGGCGAG CTGTCGGCCG ACCGGCTGGA GCCCGCCGCG
GCCGCCGCGC CGGAGGGGCT GGGCGCGCCG CTCGGGCTCG AGGTTCAGCC GCTGGAGCCT
GCGCTGGCCC GGCAACTCGG ACTGCCCGAA GATGCCTCGG GCGTGGTGGT GACGGCGGTC
GATCCGGCTG GCCCGAACGC CGACCGGCTG GCGCCCGGAG ACGTGATCGA GGAGGCCGGT
GGGCGTGCGA TCGCGACGCC GCGGGATCTT GCCTCGGCCG TGGCGGAGGC GCGCGGCCGC
GGGGTCCTGT TGCTGAAGGT GCTGCGGCAG GGCAATCCCG TCTATGTGGG TGCCGAGGTC
GCCGCGTCCT GA
 
Protein sequence
MSLPTPRSSL KAVLIASTLL AGGAVGTALP VAPAHAEVPM QGYADLVARV SPAVVFIEVT 
AKSQEPAPRA ASPLEEFLRR FGEIDPQFRM PAPPERDRVM HGLGSGFLIS QDGVIVTNNH
VVENATDMTV KLEDGREFKA EMVGADPMTD IAVIRLRDAS DLPFVEFGDS DRLRVGDAVV
AVGNPFGLGG TVTSGIVSAM GRNINSGPYD DYIQTDAAIN RGNSGGPLFD TSGTVVGMNT
AIFSPTGGSV GIGFSIPANT VRDVVAQLQE TGSVSRGWLG VTIQPLTPEI AQALGLEGSR
GALVAEVQPD SPAEAGGVES GDVITAVNGQ EIGERSSLPR LIAAIPNGEE ARLTVQRDGR
EREMTVTIGE LSADRLEPAA AAAPEGLGAP LGLEVQPLEP ALARQLGLPE DASGVVVTAV
DPAGPNADRL APGDVIEEAG GRAIATPRDL ASAVAEARGR GVLLLKVLRQ GNPVYVGAEV
AAS