Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1014 |
Symbol | |
ID | 5321859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1084606 |
End bp | 1086003 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640789956 |
Product | protease Do |
Protein accession | YP_001326702 |
Protein GI | 150396235 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.370386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.451123 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTTTG GCCACCGAGC CGCCGCGGCT ATCGTCCTCG CAATTGCCAT TTCGACGCCT GCGATGGCTC AGGATACCAG GACTGTGCCG CAGTCGCGGG CGGAGATGCA ACTCTCTTTC GCGCCGCTCG TCAAACAGAC GGCGAATGCG GTCGTGAACG TCTATGCCGA GCGCGCGGTG GAACGGCGGT CGATCTTTGC CGGGGACCCT TTCTTCGAGG AGTTTTTCGG TCAGCGGATG CCCAATCGCA CCGAAAAGCA GTCATCGCTC GGATCAGGCG TGATCGTCGG CCGCAACGGC CTGGTCGTCA CCAACAACCA TGTCATCGAT GGTGCCGACG ATATCAAGGT GGCACTGGCC GACGGGCGGG AGTTCCCTTG CAAGCTTATA TTGAAGGACG ATCGCCTGGA CCTCGCGGTC ATGAAAATCC AGTCGGACGG CCCGTTCGAC ATCATCCCGA TCGGCGATTC CGACGCGGTG GAAGTCGGGG ACCTTGTGCT GGCGATGGGT AATCCTTTCG GTGTCGGGCA GACGGTCACG AGCGGCATCG TGTCGGCACT CGCCCGTAAC CAGATTTCCA ACGGGGATTT CGGTTTTTTC ATCCAGACGG ATGCAGCCAT CAATCCCGGC AATTCCGGCG GGGGTCTGAT CGACATGAAG GGCGAGTTGA TCGGAATCAA CACCGCGATT TTCTCAAGAG GCGGCGGTTC CAACGGTGTC GGCTTTGCGA TCCCTGCCAA TCTGGTCAAG GTTTTCGTGG CCTCCGCTGA AGGAGGCAAT GGCTCATTCA TTCGGCCCTT CGTCGGAGCG ACCTTCGAAC CGGTGACGTC CGACGTGGCC GAGGCGCTTG GACTTGAACG GGCGCGTGGG GCGCTGGTGA CGGCGGTTGT CGCGGGCGGT CCGGCCGAGA GCGCCGGCAT GCGCCCCGGC CAGGTGGTCA CCGCCGTCAA CGATATACCG GTCGAACACC CCGATGCGCT CGGCTACCGC CTGACGACGG TCGGGATCGG GCATGAGGCG CGCGTGACGG TTTCGGAGAA CGGCGATTTG CGCGAAATCA CCCTCCGGCT GGAGCGGGCG CCGGAAACTC AGCCGCGTGA CGAACGGCTG ATCGAGGGTC GCAATCCCTT CGCCGGCGCC GTGGTGGCAA ATCTCTCACC CCGGCTTGCC GAGGAGTTGC GCATGCCGAC GTCGCTGCAG GGCGTGGTGG TCACCGAGAT CAATCGCGGC TCGCCGGCCG CTCGCATCGG CCTCGAACCG AAAGACATTG TTCGTTCTGT CAACGGCACC GCAATCGAGA GTTCGAAGAC ACTGGAAAGC GTCGTCGCCG AGGATGCTTC CTTCTGGCGT GTCGAGATCG AACGCAACGG CCAGATCATC CGTCAGTTCT TCCGATGA
|
Protein sequence | MIFGHRAAAA IVLAIAISTP AMAQDTRTVP QSRAEMQLSF APLVKQTANA VVNVYAERAV ERRSIFAGDP FFEEFFGQRM PNRTEKQSSL GSGVIVGRNG LVVTNNHVID GADDIKVALA DGREFPCKLI LKDDRLDLAV MKIQSDGPFD IIPIGDSDAV EVGDLVLAMG NPFGVGQTVT SGIVSALARN QISNGDFGFF IQTDAAINPG NSGGGLIDMK GELIGINTAI FSRGGGSNGV GFAIPANLVK VFVASAEGGN GSFIRPFVGA TFEPVTSDVA EALGLERARG ALVTAVVAGG PAESAGMRPG QVVTAVNDIP VEHPDALGYR LTTVGIGHEA RVTVSENGDL REITLRLERA PETQPRDERL IEGRNPFAGA VVANLSPRLA EELRMPTSLQ GVVVTEINRG SPAARIGLEP KDIVRSVNGT AIESSKTLES VVAEDASFWR VEIERNGQII RQFFR
|
| |