Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0871 |
Symbol | |
ID | 8012024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 861290 |
End bp | 862792 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644823456 |
Product | protease Do |
Protein accession | YP_002974707 |
Protein GI | 241203611 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACACA TCCTCCGCAA CCATCGCACC GCAGCCCTTG TCGGGGCCGC CATCATCGCC GGCGCAGCCT GCCTGCCCCT CGCCCTCAAC GCCTCCAATG CCGTTGCTGC GCCTTCGGAT AACGGCGGCA TTCTCGCCCC CAACGGCTCC TTCGCTTCCA TCGTCGAAGC CGACAAGCCT GCGGTCGTCA CCATCACCAC GACGATGAAG GCGACCGATG TCAGCGCCGA CCAGGAATCG CCGATGGACG AGCAGTTCCG CCAGTTCTTC GAGGATCAGG GCATCCCGCT GCCGCGCCAG GCACCGCAAA AGCGGCCTTC GCAGCAGGCG ATGGCGCTCG GTTCCGGCTT CATCATCAGC CGCGACGGGG TGATCGTCAC CAACAACCAT GTCATCGACA ATGCCGTCGA TATCAAGGTG ACGCTGGATG ACGGCACGGA ACTGCCGGCC AAGCTGATCG GCACCGATCC GAAATCCGAT GTCGCCGTGC TGAAGATAGA GGCGGGCAAG CCGCTGCAGA CTATCGCCTG GGGCGATTCC GACAGGCTGA AGCTCGGCGA CCAGATTCTG GCGATCGGCA ACCCCTTCGG CATCGGCACC ACGGTGACGG CAGGCATCGT CTCGGCGCGC GGCCGCGACC TGCACAGCGG GCCTTATGAC GATTTCATCC AGATCGACGC GCCGATCAAC CATGGCAATT CCGGAGGACC GCTGGTCGAC CGCAGCGGCA ATGTCGTCGG CATCAACACC GCCATCTATT CGCCGAACGG CGGCAGCGTC GGCGTCGGTT TCGCCATTCC CTCCGACGAG GCCAAGGCGA TCGTCGCCAA GCTGCAGAAG GACGGCTCGA TCGATCACGG CTATCTCGGC GTGCAGATCC AGCCGGTCAC CAAGGATGTC GCCGATGCCG TCGGCCTCGA TAAGACCGGC GGCGCGCTGG TTGCCGCCGT CACCGCCGAT ACGCCGGCCG CCCATGCCGG CCTGAAGCCC GGCGATATCG TCACGTCAGT CGGCGGCGAG AGCGTCAAGA CGCCGAAAGA CCTGTCGCGC CTGGTCGCCG ACCTTTCGCC GGGCGCGAAA AAATCCCTCA GCGTCTGGCG CGACGGCAAG ACGATCGATC TCAACGTCAC CGTCGGCACC AATGAGGAAG GCCAGAAACA GGCGGCGGCC GAAAGCCCCG ACGCTCAAGA TCAGAGCTCC GGCCAGCCGA GCCTCGGCAT CGGCCTCGCC GATCTGACGC CCGATGTGCG CCAGCAGCTC AACCTGCCGC GCTCGATCAA CGGTGCGGTG GTCGCCAAGG TCGCCCCGGA CAAGTCAGCG GCTGCCGCCG GCATCCAGTC CGGCGATGTC ATCGTCTCGG TGAATGACAG ACCTGTTCAT AACGCCCGCG ACGTCAAGAC CGCAATTGCC GATGCCGGCA AGGCCGGCCG CAAGTCGGTG CTGCTGCTCG TCGAACGCGA TGGCAACAAG ACCTTTGTCG CCGTGCCGTT TGGGGCGGCG TGA
|
Protein sequence | MSHILRNHRT AALVGAAIIA GAACLPLALN ASNAVAAPSD NGGILAPNGS FASIVEADKP AVVTITTTMK ATDVSADQES PMDEQFRQFF EDQGIPLPRQ APQKRPSQQA MALGSGFIIS RDGVIVTNNH VIDNAVDIKV TLDDGTELPA KLIGTDPKSD VAVLKIEAGK PLQTIAWGDS DRLKLGDQIL AIGNPFGIGT TVTAGIVSAR GRDLHSGPYD DFIQIDAPIN HGNSGGPLVD RSGNVVGINT AIYSPNGGSV GVGFAIPSDE AKAIVAKLQK DGSIDHGYLG VQIQPVTKDV ADAVGLDKTG GALVAAVTAD TPAAHAGLKP GDIVTSVGGE SVKTPKDLSR LVADLSPGAK KSLSVWRDGK TIDLNVTVGT NEEGQKQAAA ESPDAQDQSS GQPSLGIGLA DLTPDVRQQL NLPRSINGAV VAKVAPDKSA AAAGIQSGDV IVSVNDRPVH NARDVKTAIA DAGKAGRKSV LLLVERDGNK TFVAVPFGAA
|
| |