Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2539 |
Symbol | |
ID | 6981281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2570194 |
End bp | 2571930 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643397253 |
Product | protease Do |
Protein accession | YP_002282038 |
Protein GI | 209550121 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.874075 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCCA CGAATCGCTC GCCCTTCAGA CGAACGCTCG CGCTTATGGC CAGCGCTGCA ATTCTTGCGC ATGCTGGCAT GAACGGGGTC GCCTATGCGC AAACCGCGCC GCAGACGACG GCGCCCGGCG TTGCTACACC CGCTCCGGCT ACTCCGGAAA CGGCTGCTCC TGCGCCGACG CCGCCTGCAA CGGCTGCACC GCAGCCGACC CCGCAAATGC AGGCCGCAAC CCCGAACAAC GGTCCCGCTT CGGTCGCCGA TCTCGCCGAA GGGCTGCTCG ACGCCGTGGT CAACATCTCG ACCTCGCAGA ATGTGAAGGA TGACGAGGGC GTCGGTCCGG CGCCGCGCGC GCCCGACGGC TCGCCGTTCC AGGAGTTCTT CAACGACTTC TTCGACAAGA AGCAGGGCAA CAAGGGCCCG AACCACAATG TCAGCTCGCT CGGCTCCGGC TTCGTCATCG ACCCGGCCGG CTATATCGTC ACCAACAACC ATGTGATCGA GGGCGCCGAC GACATCGAGA TCAATTTCGC CAATGGTTCG AAGCTCAAGG CGAAACTGAT CGGCACGGAT ACGAAGACCG ATCTTTCGGT GCTGAAGGTC GAGCCGAAGG CACCGCTGAA ATCGGTGAAA TTCGGCGATT CCAGCACCAT GCGCATCGGC GACTGGGTGA TGGCGATCGG CAATCCGTTC GGCTTCGGCG GTTCGGTGAC GGTCGGCATC ATTTCCGGGC GCGGCCGCAA CATCAATGCT GGCCCCTATG ACAACTTCAT TCAGACGGAT GCCGCGATCA ACAAGGGCAA TTCCGGCGGA CCGCTCTTCA ACATGAAGGG TGAGGTGATC GGCATCAATA CGGCGATCAT TTCGCCGAGC GGCGGCTCGA TCGGCATCGG CTTCTCGGTG CCTTCAGAGC TTGCCTCCGG CGTCGTCGAT CAATTGCGCG AATATGGTGA GACGCGGCGC GGCTGGCTCG GTGTGCGCAT CCAGCCGGTC ACCGACGATA TCGCTGACAG TCTCGGGCTC GACACTGCCA AGGGTGCTCT GGTCGCCGGC GTCATCAAGG GCGGCCCGGT CGACGACGGT TCGATCAAGG CGGGTGACGT CATTTTGAAA TTCGACGGCA AGACCGTCAG CGAAATGCGC GATCTGCCGC GCGTCGTGGC GGAAAGCTCG GTTGGCAAGG AAGTCGACGT GGTGGTGCTG CGCGACGGCA AGGAGCAGAC CGTTAAGGTG AAACTCGGCC GGCTCGAAGA CAGCGACCAG GCGGCAGCAT CCGGCGATGC GGCGCCCGAC GGTTCGCAGG ATGACGGCGT GATCACCCCG GACCCCGGCG AGAACAACGA CATGGACGAG CCGGACTCCG GCGATCAGGC CCAGCCGGCA CCAGGCGCGC CCACGCCGGA CCAACACCAG GGCCAGGTGT CACCGGATGC ATCAACACCG AAGAACGTGC TCGGCCTGTC GCTGTCGCTT TTGAGCGCCG AGACGCGCAA GGCTTTCGGC ATTGCCGAGA GCGTCGACGG TGTCGTCGTG ACGGAGGTGA CACCCGGCTC CGCCTCGGCC GAAAAAGGGC TGAAGCCCGG CGACGTGATC GTGGAAGTGG CGCAGGAGTT TATGAAGTCG CCGGACGCGG TCGCTGCCAA GGTGAAGTCG CTGAAGCAGG AAGGCCGCCG CAACGCCCAA CTGATGATCG CATCGGCAAA TGGTGATCTG CGGTTTGTGG CGGTGCCAAT GGAGTAA
|
Protein sequence | MAPTNRSPFR RTLALMASAA ILAHAGMNGV AYAQTAPQTT APGVATPAPA TPETAAPAPT PPATAAPQPT PQMQAATPNN GPASVADLAE GLLDAVVNIS TSQNVKDDEG VGPAPRAPDG SPFQEFFNDF FDKKQGNKGP NHNVSSLGSG FVIDPAGYIV TNNHVIEGAD DIEINFANGS KLKAKLIGTD TKTDLSVLKV EPKAPLKSVK FGDSSTMRIG DWVMAIGNPF GFGGSVTVGI ISGRGRNINA GPYDNFIQTD AAINKGNSGG PLFNMKGEVI GINTAIISPS GGSIGIGFSV PSELASGVVD QLREYGETRR GWLGVRIQPV TDDIADSLGL DTAKGALVAG VIKGGPVDDG SIKAGDVILK FDGKTVSEMR DLPRVVAESS VGKEVDVVVL RDGKEQTVKV KLGRLEDSDQ AAASGDAAPD GSQDDGVITP DPGENNDMDE PDSGDQAQPA PGAPTPDQHQ GQVSPDASTP KNVLGLSLSL LSAETRKAFG IAESVDGVVV TEVTPGSASA EKGLKPGDVI VEVAQEFMKS PDAVAAKVKS LKQEGRRNAQ LMIASANGDL RFVAVPME
|
| |