Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6197 |
Symbol | |
ID | 8016210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | + |
Start bp | 241800 |
End bp | 242702 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644827503 |
Product | proline-specific peptidase |
Protein accession | YP_002978703 |
Protein GI | 241258819 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.941686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGAAG TCACGACCAA AGAAGCTTAC CTGCCCTTTC GCGACTATCG CACCTGGTAT CGCGTCACCG GTTCGCTGGA GAGCGGCAAG CTGCCCCTCG TCGTCGCCCA TGGCGGGCCT GGCTGCACCC ATGATTATGT CGATTCCTTC AAGGATATCG CCGCCCTCGA CGGCCGTCCG GTCATCCATT ACGACCAGCT CGGCAATGGC AATTCCACCC GACTTCCGGA AAAAGGCCCG GATTTCTGGA CGGTCGGCCT GTTCCTCGAG GAGCTGGACA CGCTACTTTC CCATCTCGGC ATTCGGGATC GTTATGCCTT CCTCGGCCAG TCCTGGGGCG GCATGCTCGG CGCCGAACAT GCGGTGCGCC AGCCGCAAGG TCTGAAGGCG CTTGTCATCG CCAACTCGCC GGCCAACATG CACACCTGGG TTTCGGAGGC GAACCGGCTG AGGCAGGAAC TGCCGAAAGA GGTGCAGGAC ACGCTGCTGA AGCATGAGCT GGTGGGAAGC CTCACCGATC CGGACTATAT CGCCGCCTCA CGCGTCTTCT ATGACCGCCA TGTCTGCCGC GTGGTGCCGT GGCCGCCTGA AGTGGCGCGG ACCTTTGCAA TCATGGACGA GGACAACACC GTCTACCGCA ACATGAACGG CCCGACCGAA TTTCACGTCA TCGGTACGAT GAAAGACTGG ACGATCGAGA ACAGGCTGGA CCGCATCGAA GCCCCGACGC TGCTGATCTC GGGAAAATAC GACGAGGCGA CGCCCCTGGT GGTAAGGCCC TATCTCGAAC GCGTTCCGGG CTGCGAATGG GTGCTCTTCG AAAATTCCAG CCATATGCCG CATGTCGAGG AAAAGCAGCT TTGCCTGGCG ACCGTTTCCG GTTTCCTGTC CCGGCACGAC TGA
|
Protein sequence | MGEVTTKEAY LPFRDYRTWY RVTGSLESGK LPLVVAHGGP GCTHDYVDSF KDIAALDGRP VIHYDQLGNG NSTRLPEKGP DFWTVGLFLE ELDTLLSHLG IRDRYAFLGQ SWGGMLGAEH AVRQPQGLKA LVIANSPANM HTWVSEANRL RQELPKEVQD TLLKHELVGS LTDPDYIAAS RVFYDRHVCR VVPWPPEVAR TFAIMDEDNT VYRNMNGPTE FHVIGTMKDW TIENRLDRIE APTLLISGKY DEATPLVVRP YLERVPGCEW VLFENSSHMP HVEEKQLCLA TVSGFLSRHD
|
| |