Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1463 |
Symbol | |
ID | 8012551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1449112 |
End bp | 1450515 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824052 |
Product | protease Do |
Protein accession | YP_002975294 |
Protein GI | 241204198 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.819915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.415171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGGCC TGTTCAAGCG CGCCTCCGTC TCGCTATTCG CTCTCATGCT CGTTCTGCCG GCTGCGGCCC ATGCGCAGAC GGCAAAGACC GTACCCGAGA GCCAGATGCA GATGCAGCTC TCCTTCGCGC CGCTCGTCAA ACAGACGTCA GGCGCCGTCG TCAACGTTTA TGCGGAAAAG ACCGTCCAGC GGCAGTCGCC CTTTGCTGGC GACCCTTTCT TCGAGCAATT TTTCGGCCAG CAGATGCCGA ACCGTTCGGA GAAGCAGTCT TCGCTCGGCT CCGGCGTCAT CGTCGAGGCG AACGGCACTG TCGTGACCAA CAATCACGTC ATCGAGGGCG CCGACGATAT CAAGGTGGCG CTCTCGGACG GCCGCGAATT CCCCTGCAAG GTGGTACTGC GCGACGACCG TGTCGACCTT GCCGTGCTGA AGATCGACGC CAAGGAAAGC TTTCCGACAT TGCCGATCGG CAATTCCGAT ACGGTCGAGG TCGGTGATCT CGTGCTGGCG ATCGGCAATC CCTTCGGTGT CGGCCAGACG GTGACGAGCG GTATCGTCTC GGCGCTTGCC CGCAACCAGG TGATCAAGAA CGAGTTCGGT TTCTTCATCC AGACCGATGC CTCGATCAAT CCCGGCAATT CCGGCGGTGC CTTGATGAAT ATGAAGGGCG AGCTGATCGG CATCAACACG GCGATCTTCT CGCGCGGCGG TGGATCGAAC GGCATTGGTT TCGCCATCCC CGCCAATCTG GTCAAGGTCT TCCTCACCTC TGCCGATGCC GGCGTCAAAT CCTTCGAGCG GCCCTATGTC GGCGCGAGCT TCGATGCCGT GACCTCGGAA GTGGCTGAGG CGCTGGGGCT GAACAAAGTC CGCGGCGCGC TCGTCGTCAA GGTTTCGGAA GGTGGGCCAG CCGCCAAGGC CGGGCTGAAG GCCGGCGAAA TCGTCACCGC CGTCGACGGA ATTTCCGTCG AGCATCCGGA TGCGCTGCTC TACCGGCTGA CGACGGCCGG TCTCGGCAAT TCAGTCAAGC TCACCGTCAT CGAGAATGGC CGCGAGGAGC AACTGCCGCT GACGCTTGCC CGCGCCCCGG AAACCTCGCC GCGCGACCAG CGCACCATCG GCGGACACAC GCCGTTTACC GGTGCCGTCG TCGAAAACCT GTCGCCGCGT GTCGCCGACG AGCTGCGCAT GCCGCCCGAA TCGGCAGGCG TCGTCGTATC CGAGGTGAAG GAGGATTCGC CTGCCGCCCG TCTCGGTTTC GAACCGAAGG ATATCATCGT CTCGATCAAC GGTACCGATG TGAAGTCGAC CAGCGAACTG TCCGAGATCG CCGATAGCGA CCCCGGCCTC TGGCGGGTCG AGATCGAGCG TGACGGCCAG CGCATCCGGC AATTCTTCCG ATGA
|
Protein sequence | MQGLFKRASV SLFALMLVLP AAAHAQTAKT VPESQMQMQL SFAPLVKQTS GAVVNVYAEK TVQRQSPFAG DPFFEQFFGQ QMPNRSEKQS SLGSGVIVEA NGTVVTNNHV IEGADDIKVA LSDGREFPCK VVLRDDRVDL AVLKIDAKES FPTLPIGNSD TVEVGDLVLA IGNPFGVGQT VTSGIVSALA RNQVIKNEFG FFIQTDASIN PGNSGGALMN MKGELIGINT AIFSRGGGSN GIGFAIPANL VKVFLTSADA GVKSFERPYV GASFDAVTSE VAEALGLNKV RGALVVKVSE GGPAAKAGLK AGEIVTAVDG ISVEHPDALL YRLTTAGLGN SVKLTVIENG REEQLPLTLA RAPETSPRDQ RTIGGHTPFT GAVVENLSPR VADELRMPPE SAGVVVSEVK EDSPAARLGF EPKDIIVSIN GTDVKSTSEL SEIADSDPGL WRVEIERDGQ RIRQFFR
|
| |