Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1364 |
Symbol | |
ID | 6980092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1382683 |
End bp | 1384086 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396085 |
Product | protease Do |
Protein accession | YP_002280884 |
Protein GI | 209548967 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGGCC TGTTCAAACA CGCCTCCGTT TCGCTGCTCG CTCTGACGCT GCTTCTGCCG GCTGCCGCTT ACGCGCAGAC GGCAAAGACG GTGCCCGAGA GCCAGATGCA GATGCAGCTC TCCTTCGCGC CGCTCGTCAA GCAGACATCC GGCGCCGTCG TCAATGTCTA TGCGGAAAAG ACCGTCCAGC GGCAGTCGCC CTTTGCCGGC GATCCCTTCT TCGAGCAGTT TTTCGGCCAG CAGATGCCGA ATCGCTCGGA AAAGCAGTCC TCGCTCGGTT CCGGCGTCAT CGTCGAGGCG AACGGCACTG TCGTGACCAA CAATCACGTG GTCGAAGGCG CCGACGATAT CAAGGTTGCG CTGCCTGATG GACGCGAGTT CCCCTGCAAA GTGGTGCTGC GCGACGACCG CGTCGATCTT GCCGTGCTGA AGATCGACAC CAAGGAAAGC TTCCCGACAT TGCCGATCGG CAATTCCGAT GCCGTCGAGG TCGGTGATCT CGTGCTGGCG ATCGGCAACC CCTTCGGCGT CGGCCAGACG GTGACGAGCG GTATCGTCTC CGCGCTTGCC CGCAACCAGG TGGTCAGGAA CGAGTTCGGC TTCTTCATCC AGACCGACGC CTCGATCAAT CCCGGCAATT CCGGCGGCGC CCTGATGAAC ATGAAGGGCG AACTGATCGG CATCAATACG GCGATCTTCT CGCGCGGCGG CGGCTCGAAC GGTATCGGCT TTGCCATCCC CGCCAATCTG GTCAAGGTCT TCCTCGCTTC GGCCGATGCC GGCGTCAAAT CCTTCGAGCG GCCCTATGTC GGCGCGAGTT TCGATGCGGT GACCTCGGAA GTCGCCGAAG CGCTTGGACT GAATAAGGCG CGCGGCGCAC TGGTGGTCAA GGTTTCGGAA GGCGGCCCGG CCGCCAAGGC CGGGCTGAAG GCAGGTGAAA TCGTCACCGC TGTCGACGGT ATTTCGGTCG AGCATCCGGA TGCGCTGCTT TACCGGCTGA CGACGGCCGG TCTCGGAAAA TCGGTCAAGC TCACGGTCGT CGAGAACGGC CGCGAGGAGC AGCTGCCGCT GACGCTCGAT CGCGCCCCGG AAACCTCGCC GCGCGACCAG CGCACCATCG GCGGGCGTAC TCCCTTCAGC GGTGCTGTCG TCGAGAACCT GTCGCCGCGG GTCGCCGACG AGTTGCGCAT GCCGCCGGAA TCGGCAGGCG TCGTCGTATC TGAGGTGAAG GAGGATTCGC CTGCCGCCCG TCTCGGTTTC GAGCCGAAGG ATATCATCGT CTCGATCAAC GGCACCGATG TGAAGTCGAC CAGCGAGCTG TCCCAAATCG CCGATAGCGA CCCCGGCCTC TGGCGGGTGG AAATCGAGCG CGACGGCCAG CGCATCCGGC AGTTCTTCCG ATGA
|
Protein sequence | MQGLFKHASV SLLALTLLLP AAAYAQTAKT VPESQMQMQL SFAPLVKQTS GAVVNVYAEK TVQRQSPFAG DPFFEQFFGQ QMPNRSEKQS SLGSGVIVEA NGTVVTNNHV VEGADDIKVA LPDGREFPCK VVLRDDRVDL AVLKIDTKES FPTLPIGNSD AVEVGDLVLA IGNPFGVGQT VTSGIVSALA RNQVVRNEFG FFIQTDASIN PGNSGGALMN MKGELIGINT AIFSRGGGSN GIGFAIPANL VKVFLASADA GVKSFERPYV GASFDAVTSE VAEALGLNKA RGALVVKVSE GGPAAKAGLK AGEIVTAVDG ISVEHPDALL YRLTTAGLGK SVKLTVVENG REEQLPLTLD RAPETSPRDQ RTIGGRTPFS GAVVENLSPR VADELRMPPE SAGVVVSEVK EDSPAARLGF EPKDIIVSIN GTDVKSTSEL SQIADSDPGL WRVEIERDGQ RIRQFFR
|
| |