Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2056 |
Symbol | |
ID | 8013085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2049305 |
End bp | 2050279 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824642 |
Product | proline iminopeptidase |
Protein accession | YP_002975873 |
Protein GI | 241204777 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.604828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCTC TCTATCCCGA AATCGAACCC TATGATCATG GCCTGCTCGA TACGGGCGAC GGCAATCTGA TCTATTGGGA GGCCTGCGGC AATCCGGCGG GCCGCCCGGC GCTGGTGCTT CATGGCGGCC CTGGTTCCGG CTGTACGACC GCGGCGCGCC GCTATTTCGA TCCCGACGCC CACCGAATCA TTCTGTTCGA TCAGCGCAAT TGCGGCCGCA GCCTGCCGAG CGCTGCCGAT CCCGAAACCG ATCTCTCCCT CAACACCACC TGGCATATCG TTGCCGATAT CGAGCGGCTG CGGGCCTGTC TCGGCATCGA TACCTGGCTC CTTTTCGGCA ATTCCTGGGG TTCGACGCTG GCGCTGGCCT ATGCTGAAAC CCATCCGGAG TGTGTCGCCG CGATCGTCCT GTCAGGCGTG ACCACCACCC GGCGCTCGGA AATCGACTGG CTCTATCGTG GCATGGCGCC GCTCTTTCCG GAAGAATGGC AACGTTTCCG CCAGGCTGTT CCTCCTGGCA GCCAGGGACG GGACGAGGAC ATGGTTGCAG CCTATCATCG TCTCCTCAAC GATGCGGACC CGGAAACGCG CCTCCAAGCG GCGCGCGACT GGCATGATTG GGAGGCGGCC TCGATCCTGC TCGCCGATCC CCAAGGCCGG CCGCGCCGCT GGGCCGATCC GGCCTGTTTG CTGACGCGCG CCCGCATCAT CACCCACTAC TTCACCAACG GCGCATGGCT GGAGGACGCC CAGCTTTTGA AGAACACCGC GCGGCTCATC GGCATTCCCG GTATCCTGCT GCAGGGAAGG CTCGACATCG AGGCGCCGCT CGTCACGGCC TGGGAACTCG CCCGCGCCTG GCCGCAAAGC GAGCTCAGCA TCCTTCCGCA TGCTGCCCAT TCCATCGCAA ATCCGGATAT GAGCGCGGCG ATTGTGACTG CCACCGATCG ATTTCGCGAT TTTCCTCCAA AATAA
|
Protein sequence | MSALYPEIEP YDHGLLDTGD GNLIYWEACG NPAGRPALVL HGGPGSGCTT AARRYFDPDA HRIILFDQRN CGRSLPSAAD PETDLSLNTT WHIVADIERL RACLGIDTWL LFGNSWGSTL ALAYAETHPE CVAAIVLSGV TTTRRSEIDW LYRGMAPLFP EEWQRFRQAV PPGSQGRDED MVAAYHRLLN DADPETRLQA ARDWHDWEAA SILLADPQGR PRRWADPACL LTRARIITHY FTNGAWLEDA QLLKNTARLI GIPGILLQGR LDIEAPLVTA WELARAWPQS ELSILPHAAH SIANPDMSAA IVTATDRFRD FPPK
|
| |