Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6377 |
Symbol | |
ID | 6983451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | + |
Start bp | 21324 |
End bp | 22226 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643399377 |
Product | proline-specific peptidase |
Protein accession | YP_002284133 |
Protein GI | 209552218 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.843826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.494551 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGAAA TCGTGACGAA TGAAGCCTAT TTGCCTTTTC ACGACTATCA CACCTGGTAT CGTGTCACCG GGTCGCTGGA AAGCGACAAG CTGCCGCTCG TCGTCGCCCA TGGAGGGCCC GGCTGCACCC ATGATTACGT CGATTCCTTC AAGGACATCG CCGCGCTCGA CGGCCGCCCG GTTATTCATT ACGACCAGCT CGGCAATGGC AACTCCACCC GCCTTCCCGA CAAAGGACCG GATTTCTGGA CGGTCGGCCT GTTTCTCGAA GAGCTGGACG CGCTGCTTGC CCATCTCGGC ATCCAGCATC GTTATGCCTT TCTCGGCCAG TCCTGGGGCG GCATGCTCGG CGCCGAACAT GCGGTGCGCC GGCCGCAAGG CCTGAAGGCG CTCGTCATCG CCAACTCGCC GGCCAACATG CACACCTGGG TTTCGGAAGC GAACCGGCTG AGGCGGGAAC TGCCGAAGGA GGTGCAGGAC ACGCTGCTGA AGCATGAGCT GGCGGGAAGC CTCACCGATC CGGATTATAT CGCCGCCTCG CGCGTCTTTT ATGACCGCCA TGTCTGCCGC GTGGTGCCGT GGCCAGCCGA AGTGGCGCGG ACCTTCGCGA TCATGGACGA GGACAACACT GTTTACCGCA ACATGAACGG ACCGACCGAA TTTCACGTCA TCGGCACGAT GAAGGACTGG ACGATCGAAA ACAGGCTCGA CCGCATCGAA GCCCCGACGT TGCTGATCTC CGGCCAACAT GACGAGGCGA CACCCCTGGT GGTAAGGCCC TATCTCGACC ATGTTCCCGG CTGCGAATGG GTGCTCTTCG AAAATTCCAG CCACATGCCG CATGTCGAGG AAAAGCAGCT TTGCCTGGCG ACCGTTTCCG CTTTCCTGTC ACGGCACGAT TGA
|
Protein sequence | MSEIVTNEAY LPFHDYHTWY RVTGSLESDK LPLVVAHGGP GCTHDYVDSF KDIAALDGRP VIHYDQLGNG NSTRLPDKGP DFWTVGLFLE ELDALLAHLG IQHRYAFLGQ SWGGMLGAEH AVRRPQGLKA LVIANSPANM HTWVSEANRL RRELPKEVQD TLLKHELAGS LTDPDYIAAS RVFYDRHVCR VVPWPAEVAR TFAIMDEDNT VYRNMNGPTE FHVIGTMKDW TIENRLDRIE APTLLISGQH DEATPLVVRP YLDHVPGCEW VLFENSSHMP HVEEKQLCLA TVSAFLSRHD
|
| |