Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1073 |
Symbol | |
ID | 8012200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1046768 |
End bp | 1048351 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823656 |
Product | protease Do |
Protein accession | YP_002974907 |
Protein GI | 241203811 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.952362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAGA ATTTCAACGG ACGTCCGTCC CTCGCCACTG TGCTCAAGGC CTCTACCGTT GCCGGTATCG CAGCCGCTGT GCTCGCAACC GGCGTTCCGC TCGAAATCAC CCGGTCTTAT GCCGAAGCTG TCAAGGTTCA GGCGCCCGCC GTTCCGAGCT TCGCCAATGT CGTCGACGCC GTTTCGCCGG CCGTCGTTTC CGTCCGGGTC GAAAATCGCG TCAATCCCGT CTCCGACAAC AACAATGACG GCTTCTCCTT CGATTTCAAC GGCCGCGGCT TCGACGACCT ACCCGACGAT CATCCGCTGA AGCGGTTCTT CAAGCAGTTC GGCCAGGATC CGAATGATCA GCAGGGCCAT TCCAGGCGCT TCGGCCAGAA CGGCCCGAAT GGTCCGGGCG GCAAGGGTCG CCTCCGCCCC GTCGCCCAGG GCTCCGGCTT CTTCATCTCT GAGGACGGCT ACATCGTCAC CAACAATCAC GTCGTTTCCG ATGGTCAGGC CTTCGTCGCC GTCATGAAAG ACGGCACCGA ACTCGATGCC AAGCTGATCG GCAAGGATCC GCGCACCGAT CTCGCTGTGC TGAAGGTCGA CGGCAAGGGC AAGAAGTTCA CCTACGTCAA CTGGGCCGAC GACAACAATG TCCGCGTCGG TGACTGGGTC GTTGCCGTCG GCAATCCCTT CGGTCTCGGC GGCACGGTTA CAGCCGGCAT CGTCTCAGCT CGCGGCCGTG ATATCGGTTC CGGTCCTTAT GACGATTATC TGCAGGTGGA TGCCGCCGTG AACCGCGGCA ACTCCGGCGG TCCGACCTTC AACCTCAGCG GCGAAGTCGT CGGCATCAAC ACCGCGATCT TCTCGCCGTC GGGCGGCAGC GTCGGCATCG CCTTCGCCAT TCCCGCCTCG ACCGCCAAGG ACGTCGTCGC CGATCTGATG AAGGACGGCC AGGTTTCGCG CGGCTGGCTG GGTGTCCAGA TCCAGCCGGT AACCAAGGAC ATCGCCGAAT CCATCGGCCT TTCCGAGCCG AGCGGCGCCC TGGTCGTTGC CCCGCAGGCC GGGTCGCCGG GTGACAAGGC CGGCATGAAG GCCGGCGACG TCGTCACCGC GCTGAATGGG GAGACGATCA AGGATGCCCG TGATCTCAGC CGCCGTATCG GTGCGATGCA GCCGGGCAGC AAGGTTGAGC TTTCGGTCTG GCGCGCCGGC AAGGCCCAGC CTCTCACCGT CGAACTCGGC ACGTTGCCGA TCGACCAGAA GGATGCGTCT GCCGATGACA ACAGCCAGCC GCAGCAGCCT GAAGCACCGG CTTCCGAGAA GGCGCTTGCC GATCTCGGCC TGACGGTCGG CCCGTCTGAC GACGGCAAGG GCCTGGCGAT AACAGACATC GACCCGAACT CCGATGCCGC CGACAAGGGC ATTAAGGAAG GTGAGAAGAT CACCTCGGTC AACAACCAGG AGGTCTCCAG CGCCGACGAC ATCGTCAAGG TGCTGAACCA AGCCAAGAAG GACGGGCGCA CCCGCGCTCT CTTCCAGATC CAGTCCAGTG AGGGGAGCCG CTTCGTAGCG CTTCCGATCA ACGGCCAGGG CTGA
|
Protein sequence | MLKNFNGRPS LATVLKASTV AGIAAAVLAT GVPLEITRSY AEAVKVQAPA VPSFANVVDA VSPAVVSVRV ENRVNPVSDN NNDGFSFDFN GRGFDDLPDD HPLKRFFKQF GQDPNDQQGH SRRFGQNGPN GPGGKGRLRP VAQGSGFFIS EDGYIVTNNH VVSDGQAFVA VMKDGTELDA KLIGKDPRTD LAVLKVDGKG KKFTYVNWAD DNNVRVGDWV VAVGNPFGLG GTVTAGIVSA RGRDIGSGPY DDYLQVDAAV NRGNSGGPTF NLSGEVVGIN TAIFSPSGGS VGIAFAIPAS TAKDVVADLM KDGQVSRGWL GVQIQPVTKD IAESIGLSEP SGALVVAPQA GSPGDKAGMK AGDVVTALNG ETIKDARDLS RRIGAMQPGS KVELSVWRAG KAQPLTVELG TLPIDQKDAS ADDNSQPQQP EAPASEKALA DLGLTVGPSD DGKGLAITDI DPNSDAADKG IKEGEKITSV NNQEVSSADD IVKVLNQAKK DGRTRALFQI QSSEGSRFVA LPINGQG
|
| |