Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0763 |
Symbol | |
ID | 6979481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 778524 |
End bp | 780029 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643395475 |
Product | protease Do |
Protein accession | YP_002280284 |
Protein GI | 209548367 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACACA TCCTCCGCAA ACATCGCACT GCTGCCCTCA TCGGGGCTGC CATCATCGCC GGCGCGGCAT GCCTGCCGTT TACCATTACC GCGGCGAACG CCGTTGCCTC GCCTGCGGAT GCCGGCGGCA TTCTCGCCGC CAGCGGCTCT TTCGCCTCTA TCGTCGATGC CGACAAACCT GCCGTCGTCA CCATCACCAC GACCATGAAG GCAACCGATG TCAGCGCCGA CCAGCAGCAG TCGCCGATGG ACGAGCAGTT CCGCCAGTTT TTCGAGGATC AGGGCATCCC GCTGCCGCGC CAGGCGCCGA AAAACCGGTC TTCGCAGCAG ACAATGGCGC TCGGCTCCGG CTTCATCATC AGCCCAGACG GCGTGATCGT TACCAACAAC CATGTCATCG ACAATGCCGT CGACATCAAG GTGACGCTGG ATGACGGCAC GGAACTGCCG GCCAAGCTGA TCGGCACCGA CCCGAAATCC GATGTCGCCG TCGTGAAGAT AGAGGCGGGA AAGCCGCTGC AGACCATTGC CTGGGGCGAT TCCGACAGGC TGAAGCTCGG CGACCAGATC CTGGCGATCG GCAACCCCTT CGGCATCGGC ACCACGGTGA CGGCGGGCAT CGTCTCGGCG CGCGGCCGCG ACCTGCACAG CGGGCCCTAT GACGATTTCA TCCAGATCGA CGCGCCGATC AACCATGGCA ACTCAGGCGG GCCGCTCGTC GACCGCAGCG GCAATGTCGT CGGCATCAAC ACTGCGATCT ATTCGCCGAA CGGCGGCAGT GTCGGCGTCG GCTTCGCCAT TCCGTCCGAC GAGGCCAAGG CGATCGTCGC CAAACTGCAG AAGGACGGCT CGATCGATCA CGGTTATCTC GGCGTGCAGA TCCAGCCTGT GACGAAAGAC GTCGCCGATG CCGTCGGCCT CGATAAAACA GGCGGCGCAC TGGTTGCCGC CGTCACCGCC GATACGCCGG CGGCCCATGC CGGCGTGAAG CCGGGCGATA TCATCACCTC GGTCGGCGGC GAGAGCGTCA AGACGCCGAA GGACCTGTCG CGGCTGGTCG CCGATCTTTC GCCAGGCGCC AAAAAATCTC TCGGCATCTG GCGTGACGGC AAGACGATCG ATCTCAACGT CACCGTCGGC GGCAATGAGG ACGGCCAGAA ACAGGCCGCC GCCGAAAGCT CGGACAGCAA GGGCGAGAGC AGCGGCCAGC CGAGCCTCGG CATCGGCCTC GCCGACCTGA CGCCAGACGT GCGCGAGCAG CTCACCCTGC CGCGCGCCGT CAGCGGCGCG GTGGTCGCCA GCGTCGATCC CGACAAGTCG GCCGCGGCCG CCGGCATCCA GTCGGGCGAT GTCATCGTCT CGGTCAACGA CAGACCGGTC CACAGCACCC GCGACGTCAA GACCGCGATT GCCGAGGCCG GCAAGGCTGG CCGCAAATCG GTGCTACTGC TCGTCGAACG CGATGGCGGC AAGACCTTCG TCGCCGTGCC GTTCGGTGCG GCCTGA
|
Protein sequence | MSHILRKHRT AALIGAAIIA GAACLPFTIT AANAVASPAD AGGILAASGS FASIVDADKP AVVTITTTMK ATDVSADQQQ SPMDEQFRQF FEDQGIPLPR QAPKNRSSQQ TMALGSGFII SPDGVIVTNN HVIDNAVDIK VTLDDGTELP AKLIGTDPKS DVAVVKIEAG KPLQTIAWGD SDRLKLGDQI LAIGNPFGIG TTVTAGIVSA RGRDLHSGPY DDFIQIDAPI NHGNSGGPLV DRSGNVVGIN TAIYSPNGGS VGVGFAIPSD EAKAIVAKLQ KDGSIDHGYL GVQIQPVTKD VADAVGLDKT GGALVAAVTA DTPAAHAGVK PGDIITSVGG ESVKTPKDLS RLVADLSPGA KKSLGIWRDG KTIDLNVTVG GNEDGQKQAA AESSDSKGES SGQPSLGIGL ADLTPDVREQ LTLPRAVSGA VVASVDPDKS AAAAGIQSGD VIVSVNDRPV HSTRDVKTAI AEAGKAGRKS VLLLVERDGG KTFVAVPFGA A
|
| |