Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0924 |
Symbol | |
ID | 6979642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 938095 |
End bp | 939678 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643395635 |
Product | protease Do |
Protein accession | YP_002280444 |
Protein GI | 209548527 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.010153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0275633 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAGA ATTTCAACGG ACGTCCGTCC CTCGCCACTG TGCTCAAGGC TTCTACCGTC GCCGGTATCG CAGCCGCTGT GCTCGCAACC GGCGTTCCGC TCGAAATCAC CCGGTCTTAT GCCGAAGCCG TCAAGGTTCA GGCGCCTGCC GTGCCGAGCT TCGCCAATGT CGTCGATGCC GTTTCGCCGG CCGTCGTTTC CGTCCGCGTC GAAAACCGCG TCAATCCCGT CTCCGACAAT GACGGCTTCT CCATCGAAGG CCGCGGCTTC GACGATCTTC CCGATGATCA TCCGCTGAAG CGCTTCTTCA AGCAGTTCGG TGGCCAGGAC CCAAGTGATC AGCAGGGCCA TCAGCGGCGC TTCGGCCAGA ACGGCCCGGG TGGCCAAAAT GGCCCCGGCG GCAAGGGCCG TCTGCGTCCG GTCGCTCAGG GGTCCGGCTT CTTCATCTCC GAGGATGGCT ACATCGTTAC CAACAACCAC GTCGTTTCCG ACGGCCAGGC CTTCGTCGCT GTCATGAATG ACGGCACCGA ACTCGATGCC AAGCTGATCG GCAAGGATCC GCGCACCGAT CTCGCCGTCC TCAAGGTCGA CGGCAAGGGC AAGAAGTTCA CCTACGTCAA CTGGGCCGAT GACAACAATG TCCGCGTCGG CGACTGGGTC GTCGCCGTCG GCAACCCCTT CGGCCTCGGC GGCACGGTCA CGGCCGGCAT CGTTTCGGCC CGCGGCCGCG ATATCGGCTC TGGCCCCTAT GACGATTACC TGCAGGTGGA TGCCGCCGTG AACCGCGGCA ACTCCGGTGG CCCAACCTTC AACCTCAGCG GCGAAGTCGT CGGCATCAAC ACCGCGATCT TCTCGCCGTC CGGCGGCAGC GTCGGTATCG CCTTCGCCAT TCCGGCCTCG ACAGCCAGGG ATGTCGTCGC CGATCTGATG AAGGACGGCC AGGTTTCGCG TGGCTGGCTG GGTGTCCAGA TCCAGCCGGT GACCAAGGAT ATCGCCGAAT CCATCGGCCT TTCCGAGCCG AGCGGCGCCC TTGTCGTCGC GCCCCAGGCC GGGTCGCCCG GCGACAAGGC CGGCATGAAG GCCGGCGACG TCGTTACCGC GCTGAACGGT GAAACGATCA AGGATGCGCG TGACCTCAGC CGCCGCATTG GCGCGATGCA GCCGGGCAGC AAGGTCGAGC TTTCGGTCTG GCGTGCCGGC AAGGCCCAGC CTCTGACCGT CGAACTCGGC ACGCTGCCGG CCGACCAGAA GGATGCGAAC GCCGATGACA ACAGCCAGCC GCAGCAGCCG GAGGCACCGG CGTCCGAAAA GGCGCTTGCC GATCTCGGCC TGACGGTCGG TCCTTCCGAT GACGGCAAGG GCCTGGCGAT CACCGGCATC GACCCGGACT CCGACGCCGC CGACAAGGGC ATCAAGGAAG GCGAGAAGAT CACCTCGGTC AACAACCAGG AAGTCTCCAG CCCCGCCGAT GTCGTCAAGG TGCTGAACCA GGCCAAGAAG GACGGCCGCA CCCGGGCGCT CTTCCAGATC CAGTCGAGCG AAGGAAGCCG TTTCGTCGCT CTTCCGATCA ACGGCCAGGG CTGA
|
Protein sequence | MLKNFNGRPS LATVLKASTV AGIAAAVLAT GVPLEITRSY AEAVKVQAPA VPSFANVVDA VSPAVVSVRV ENRVNPVSDN DGFSIEGRGF DDLPDDHPLK RFFKQFGGQD PSDQQGHQRR FGQNGPGGQN GPGGKGRLRP VAQGSGFFIS EDGYIVTNNH VVSDGQAFVA VMNDGTELDA KLIGKDPRTD LAVLKVDGKG KKFTYVNWAD DNNVRVGDWV VAVGNPFGLG GTVTAGIVSA RGRDIGSGPY DDYLQVDAAV NRGNSGGPTF NLSGEVVGIN TAIFSPSGGS VGIAFAIPAS TARDVVADLM KDGQVSRGWL GVQIQPVTKD IAESIGLSEP SGALVVAPQA GSPGDKAGMK AGDVVTALNG ETIKDARDLS RRIGAMQPGS KVELSVWRAG KAQPLTVELG TLPADQKDAN ADDNSQPQQP EAPASEKALA DLGLTVGPSD DGKGLAITGI DPDSDAADKG IKEGEKITSV NNQEVSSPAD VVKVLNQAKK DGRTRALFQI QSSEGSRFVA LPINGQG
|
| |