Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3140 |
Symbol | |
ID | 5085299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | + |
Start bp | 230 |
End bp | 1681 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640484712 |
Product | hypothetical protein |
Protein accession | YP_001169329 |
Protein GI | 146279171 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.673481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.234219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTGC CCACCCCCCG ATCCTCGCTG AAGGCCGTCC TGATTGCCAG CACCCTCCTT GCCGGAGGTG CCGTCGGCAC CGCGCTGCCG GTGGCGCCTG CCCACGCCGA GGTGCCGATG CAGGGCTACG CCGATCTCGT GGCCCGTGTC TCGCCGGCCG TCGTCTTCAT CGAGGTGACC GCCAAGTCGC AGGAGCCGGC CCCGAGGGCG GCATCCCCGC TCGAGGAGTT CCTTCGCCGC TTCGGCGAGA TCGACCCGCA ATTCCGCATG CCCGCCCCGC CGGAGCGGGA CCGCGTCATG CACGGGCTCG GGTCGGGGTT CCTGATCTCG CAGGACGGCG TGATCGTGAC CAACAACCAC GTTGTCGAGA ATGCCACCGA CATGACCGTC AAGCTCGAGG ACGGGCGCGA GTTCAAGGCC GAGATGGTGG GCGCCGATCC CATGACCGAC ATCGCCGTGA TCCGGCTGCG GGATGCGAGT GATCTGCCCT TCGTCGAGTT CGGGGACAGC GACCGGCTGC GCGTGGGCGA TGCGGTCGTG GCGGTCGGCA ATCCGTTCGG CCTTGGCGGG ACGGTCACGT CGGGCATCGT CTCGGCCATG GGGCGCAACA TCAACTCCGG CCCCTATGAC GACTACATCC AGACCGACGC CGCCATCAAC CGGGGCAACT CGGGCGGACC GCTCTTCGAC ACGAGCGGCA CGGTGGTGGG CATGAACACG GCGATCTTCT CGCCCACGGG AGGCTCGGTT GGCATCGGCT TCTCGATCCC GGCCAACACG GTGCGGGATG TCGTGGCGCA ACTGCAGGAA ACGGGTTCGG TCTCGCGCGG ATGGCTGGGC GTGACGATCC AGCCCCTGAC GCCCGAGATC GCGCAGGCGC TGGGTCTCGA GGGCAGCCGG GGGGCGCTCG TGGCCGAGGT GCAGCCGGAC AGCCCGGCCG AGGCGGGAGG CGTCGAGAGC GGCGATGTCA TCACCGCCGT CAACGGGCAG GAGATCGGCG AGCGGTCCAG CCTGCCCCGG CTGATCGCGG CCATCCCGAA CGGCGAGGAG GCCCGGCTCA CCGTTCAGCG CGACGGGCGC GAGCGTGAGA TGACGGTCAC GATCGGCGAG CTGTCGGCCG ACCGGCTGGA GCCCGCCGCG GCCGCCGCGC CGGAGGGGCT GGGCGCGCCG CTCGGGCTCG AGGTTCAGCC GCTGGAGCCT GCGCTGGCCC GGCAACTCGG ACTGCCCGAA GATGCCTCGG GCGTGGTGGT GACGGCGGTC GATCCGGCTG GCCCGAACGC CGACCGGCTG GCGCCCGGAG ACGTGATCGA GGAGGCCGGT GGGCGTGCGA TCGCGACGCC GCGGGATCTT GCCTCGGCCG TGGCGGAGGC GCGCGGCCGC GGGGTCCTGT TGCTGAAGGT GCTGCGGCAG GGCAATCCCG TCTATGTGGG TGCCGAGGTC GCCGCGTCCT GA
|
Protein sequence | MSLPTPRSSL KAVLIASTLL AGGAVGTALP VAPAHAEVPM QGYADLVARV SPAVVFIEVT AKSQEPAPRA ASPLEEFLRR FGEIDPQFRM PAPPERDRVM HGLGSGFLIS QDGVIVTNNH VVENATDMTV KLEDGREFKA EMVGADPMTD IAVIRLRDAS DLPFVEFGDS DRLRVGDAVV AVGNPFGLGG TVTSGIVSAM GRNINSGPYD DYIQTDAAIN RGNSGGPLFD TSGTVVGMNT AIFSPTGGSV GIGFSIPANT VRDVVAQLQE TGSVSRGWLG VTIQPLTPEI AQALGLEGSR GALVAEVQPD SPAEAGGVES GDVITAVNGQ EIGERSSLPR LIAAIPNGEE ARLTVQRDGR EREMTVTIGE LSADRLEPAA AAAPEGLGAP LGLEVQPLEP ALARQLGLPE DASGVVVTAV DPAGPNADRL APGDVIEEAG GRAIATPRDL ASAVAEARGR GVLLLKVLRQ GNPVYVGAEV AAS
|
| |