Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0637 |
Symbol | |
ID | 5321473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 682614 |
End bp | 684167 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640789573 |
Product | protease Do |
Protein accession | YP_001326328 |
Protein GI | 150395861 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.136267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATGA TCAAAACGTC CCGTCCCTCC TTGAAGACAG TTCTGAAGAC CACGACTGTT GCCGGCGTTG CCGCCGTACT GCTGACCACC GGGCTTCCGG CACAGATCAC TCAGTCCTTT GCCGAAGCGG TCAGCGTGCC GGCTCCCGCC GTTCCGAGCT TTGCCAACGT CGTCGAGGCG GTTTCGCCGG CGGTCGTTTC CGTTCGCGTG CAGGCACGTG AACAAGCCAG TGACGACGAA AGCAACTTCA CCTTTGATTT CGGCGGTCGC GGCTTCGACG ATCTGCCGGA AGACCATCCG CTTCGGCGCT TCTTCCGCGA ATTCGACCCG CGTGACAACG ACCGTGCCGA CCGGTGGCGC GACCGCCGCG GCCCGCGCGG TGAGGGCCGT CTGCGTCCGC GGGCGCAAGG CTCCGGCTTC TTCATCACCG AAGACGGCTA CCTCGTCACC AACAACCACG TCATTTCTGA CGGATCGGCC TTCACCGTCA TTATGGATGA CGGTACCGAG CTCGAAGCCA AGCTCGTCGG CAAAGACAGC CGGACAGATC TTGCAGTGCT CAAGGTGGAC GCCAAGCGAA AGTTCACACA TGTGAGCTTC GCCGATGACG AAAAGGTGCG TGTCGGCGAC TGGGTGGTCG CTGTCGGTAA TCCCTTCGGC CTTGGCGGCA CCGTGACAGC GGGGATCATC TCCGCTCGGG GCCGCGATAT CGGCTCCGGT CCTTACGACG ATTACCTGCA GGTCGACGCA GCGGTGAACC GTGGGAATTC CGGAGGCCCG ACCTTCAACC TCTCCGGAGA GGTGGTCGGA ATCAACACGG CCATATTCTC GCCTTCCGGC GGCAATGTCG GCATCGCCTT CGCAATTCCC GCCTCCGTCG CGAAGGACGT CGTTGACTCC TTGATCAAGG ACGGCACCGT TTCGCGTGGC TGGCTGGGTG TCCAGATCCA GCCGGTGACG AAGGATATTG CCGAGTCGCT CGGCCTTGCC GAGGCGAAAG GTGCTCTCGT CGTAGAGCCT CAAACGGGCT CGCCGGGCGA AAAGGCCGGC ATCAAGAACG GCGACGTCGT GACGGCCCTT AATGGCGAGC CGGTCAAGGA TCCGCGTGAT CTTGCCCGGC GAGTGGCGGC ACTGCGCCCC GGCTCCACTG CCGAGGTCAC TCTTTGGCGC TCCGGCAAGT CCGAAACGGT CAAGCTCGAG ATCGGCACGC TGCCGAGCGA TGCCAAGGAG ACTGCACCGA CAACCGGCGA AGCACAGCCG GACGAAGGTC AGGCAAGCGA CGAGGCACTG GCCGGGCTCG GCCTGACGGT GACCCCGTCG GAAGACGACA GGGGCGTCAC GATCACATCC GTCGACCCGG ACTCCGACGC TAGCGATCGC GGTCTGAAGC AAGGCGAGAA GATCGTCTCC GTCAACAATC AGGAAGTGAA ATCGGCGGAC GACATTCTCA AGGTGATCAA CAACGCCAGA AAGGACAATC GGACCAAGGC GCTGTTCCAG ATCGAAGCCC AGGAAGGCAG CCGCTTCGTC GCACTCCCGA TCGCTCAGGG CTGA
|
Protein sequence | MSMIKTSRPS LKTVLKTTTV AGVAAVLLTT GLPAQITQSF AEAVSVPAPA VPSFANVVEA VSPAVVSVRV QAREQASDDE SNFTFDFGGR GFDDLPEDHP LRRFFREFDP RDNDRADRWR DRRGPRGEGR LRPRAQGSGF FITEDGYLVT NNHVISDGSA FTVIMDDGTE LEAKLVGKDS RTDLAVLKVD AKRKFTHVSF ADDEKVRVGD WVVAVGNPFG LGGTVTAGII SARGRDIGSG PYDDYLQVDA AVNRGNSGGP TFNLSGEVVG INTAIFSPSG GNVGIAFAIP ASVAKDVVDS LIKDGTVSRG WLGVQIQPVT KDIAESLGLA EAKGALVVEP QTGSPGEKAG IKNGDVVTAL NGEPVKDPRD LARRVAALRP GSTAEVTLWR SGKSETVKLE IGTLPSDAKE TAPTTGEAQP DEGQASDEAL AGLGLTVTPS EDDRGVTITS VDPDSDASDR GLKQGEKIVS VNNQEVKSAD DILKVINNAR KDNRTKALFQ IEAQEGSRFV ALPIAQG
|
| |