Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5403 |
Symbol | |
ID | 8007361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 818954 |
End bp | 821071 |
Gene Length | 2118 bp |
Protein Length | 705 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644822307 |
Product | carboxyl-terminal protease |
Protein accession | YP_002973567 |
Protein GI | 241113732 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATAG CCTACGTTTT CCTTGGTGCA TTTCTTGCAA TTGCACAGTC CGCATATGCT GAAGTCGCAT CACCGCCTGT TTTGGCGCCG CTAAAGCGAC AAGCGCAGGC CGCTGAGTTG AGCGCACAAT TTCTTTCGCG GTATAGCTAC AAGCCCGTTC CACTCGACGA CGCCTTGTCG GCCAGGATCA TGGATCAGTT CATCAAGTCA CTTGATCCGG ACCGCATGCT CTTCCTGCAA GCGGACATCG ACAAGTTCAT GTCCGATCGC AGCGAGATCG ACGATGCGAT AGAACGGAAG GACTTGAAGA TCCCGTTTGC GATTTTCAAC GCCTATGAAC AGCGCGTTGT CGACCGCATG AACTATGCGC GCAGCCTGCT GAAGCAGGAC TTCGATTTCA GTACGCAGGA AAATTATTCG GTGCTGCGCG ATAAAGCGCC GTGGTCGCAG TCGAAAGCCG AGAGCAATGA GCTTTGGCGC AAGCGGGTCA AGAGCGACTG GTTGCGGTTG AAACTCGGCG GCAAAAACGA CGCGGCCATC CGCGAAACGC TCGACAAACG TTATGAGAAC ATACTCGAGC GCGCTTACAA GTTTAAAAGC GACGACGTTT TCCAGTCGTT CATGGATGCC TATTCAACGT CGATCGATCC GCACACGGAC TATTTCGGCG CGGCCGCTTC AGCCGACTTC AATGTCTCGA TGAAGCTTTC GCTGTTTGGT ATCGGTGCCG TGCTGCAGGA ACGCGACGAC TACACGACGA TCCGTGAGCT CGTGCCTGGC GGGCCGGCGC AGCTTTCCGG CAAGCTCGCG GTCGGAGACC GCATTACGGG CGTTGGTCAA GGCAAGGATG GGGCGATCAA AGAAGTGGTA GGCACGCGCC TTGATGAAGT CGTGCAGATG ATACGCGGAA AAAAAGACTC CGTCGTGCGG CTGGATATCC TGCCGGCAGA TGCCGGAGCA GATGGCACGC ATCGCGTCAT CAGCCTGGTG CGCGATAAAA TCAGTCTCGA CAAGCAGGCT GCCAGGAAGA CTGTGCTGTC CGTGAAGGCG GGCGACGCCA CGCGTAAAAT CGGGATCATC ACGCTGCCGG TATTCTATGA GGATTTTGAA GCCAAGCGCA AAGGTGACCA GGATTACAAA AGCGCAAGCC GCGATGTCGC CAAGCTTCTC GACGAACTGA AGGAAGAAAA GGTCGACAGC GTTCTGATTG ACCTGCGCAA CAATGGCGGC GGTTCATTGG ACGAGGCGAT TGATTTGACC GGTCTCTTCA TCGGCAATGG ACCGGTCGTT CAGCAACGCG GCAGCGACGG CAAGATCGAG GTCAAAAGCG CTGAGCTTGC AGCGCCTGTC TGGGCAGGCC CGATGGGTGT CCTGATCAAT CGCGGCTCGG CGTCGGCTTC AGAGATTTTT GCCGCGGCCA TCCAGGATTA TGGTCGAGGC GTGATCGTCG GCGAACCCAG TTTCGGGAAG GGCACCGTTC AGACCGTCGT CGACCTTGAC CAGATCGTCC GCAACAGCAA GCCTGAGTTC GGGGAGCTGA AAGTGACGAT TGCCCAGTTT TTCCGAGTCA ACGGCGGTAC GACGCAGCTG CGCGGCGTGA CGCCTGATAT CAGCTTACCG GGACTTTCCG ATCCCACGAG CTTCGGCGAG ACCAGTTATG ACAATGCGCT GCCGTGGGCG CAGATCAAGC CCGCGAATTA CACGCCCTCC GACACTGTCT CGACGTTGCT GCCGACATTG CAAAGCCGCC ATGATGCGCG GGTTGGAAGC GATCCGGATT TCCAACGCTT GTTAAAAGAC CTTGCCGACC TCAAGGCACA GCGCGAGAAA GGGGTTATCT CCCTCAATGA AGCGGAACGT CGCAAAGAAG CGACGGCTCG AGAGAAACGG TTCAAGGATC GAGCGCAAGT GAGCGATGGC GAGGATCCGG GTGGAGATGA TGGTCTGGAG GCAGGCGAGC GCAGCCTGAG CGCTGATATT GCCATCGAGA ATGCTCGCAA GAACGCAAAG GACGTCCTGC TCGATGAAGC CGCCGCCATT CTTGCCGACG AGGCGGATTT GCAGCAAGGC GGGCTAAAGG CAGCCACAAA ACAAACGGGA AACACGAACG GAAAATAA
|
Protein sequence | MRIAYVFLGA FLAIAQSAYA EVASPPVLAP LKRQAQAAEL SAQFLSRYSY KPVPLDDALS ARIMDQFIKS LDPDRMLFLQ ADIDKFMSDR SEIDDAIERK DLKIPFAIFN AYEQRVVDRM NYARSLLKQD FDFSTQENYS VLRDKAPWSQ SKAESNELWR KRVKSDWLRL KLGGKNDAAI RETLDKRYEN ILERAYKFKS DDVFQSFMDA YSTSIDPHTD YFGAAASADF NVSMKLSLFG IGAVLQERDD YTTIRELVPG GPAQLSGKLA VGDRITGVGQ GKDGAIKEVV GTRLDEVVQM IRGKKDSVVR LDILPADAGA DGTHRVISLV RDKISLDKQA ARKTVLSVKA GDATRKIGII TLPVFYEDFE AKRKGDQDYK SASRDVAKLL DELKEEKVDS VLIDLRNNGG GSLDEAIDLT GLFIGNGPVV QQRGSDGKIE VKSAELAAPV WAGPMGVLIN RGSASASEIF AAAIQDYGRG VIVGEPSFGK GTVQTVVDLD QIVRNSKPEF GELKVTIAQF FRVNGGTTQL RGVTPDISLP GLSDPTSFGE TSYDNALPWA QIKPANYTPS DTVSTLLPTL QSRHDARVGS DPDFQRLLKD LADLKAQREK GVISLNEAER RKEATAREKR FKDRAQVSDG EDPGGDDGLE AGERSLSADI AIENARKNAK DVLLDEAAAI LADEADLQQG GLKAATKQTG NTNGK
|
| |