Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1856 |
Symbol | |
ID | 6980594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1903369 |
End bp | 1904337 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396578 |
Product | proline iminopeptidase |
Protein accession | YP_002281367 |
Protein GI | 209549450 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.455189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0201858 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCTC TCTATCCGGA AATCGAACCC TATGATCATG GCCTGCTCGA CACGGGCGAC GGCAACCTGA TCTATTGGGA GGCCTGCGGC AATCCGGCTG GCCGGCCGGC GCTGGTGCTG CATGGCGGGC CGGGCTCCGG CTGTTCGACG ACGGCGCGGC GCCATTTCGA TCCGGACGCC TACCGAATCA TTCTGTTCGA TCAGCGCAAT TGCGGCCGCA GTCTGCCGAG CGCTGCCGAT CCCGAAACCG ATCTCTCCCG CAACACCACC TGGCATCTCG TTGCCGATAT CGAACGGCTG CGGGTCTTCT TCGGCATCGA CACCTGGCTG GTTTTCGGCA ACTCCTGGGG ATCGACGCTG GCGCTCACGT ATGCCGAAAC CCATCCGCAG CGCGTCGCCG CGATCGTCAT ATCAGGCGTG ACCACCACAA GGCGCTCGGA AATCGACTGG CTCTATCGCG GCATGGCGCC GCTTTTCCCG GAAGAATGGC ACCGTTTCCG CCGGGCGGCG TGCTGCCAGG AGCAAGACCT GGACATGGTT GCCACCTATC ATCATCTCCT CAATCATCCC GACTCAGAGA CGCGCCTGAA GGCGGCGCGC GACTGGCATG ACTGGGAAGC GGCTTCCATC CTGCTCGCCG ATCCCAAAGG CCTGCCGCGC CGCTGGGCCG ATCCGGCTTA TTTGCTGACG CGCGCCCGCA TCATCACCCA CTATTTTGCC AATGGCGCCT GGCTGGAGGA CGGCCAGCTT TTGAACAATG CCGACCGGCT GGCAGGCATT CCGGCGATCC TGCTGCAGGG GCGGTTCGAT ATCGAGGCGC CGCTGGTCAC TGCCTGGGAA CTGGCCCGCG CCTGGCCGCA AAGCGAGCTG CAGATTCTTC CGCATGCTGC CCATTCCACC GCAAATCCCA ATATGAGCGC CGCGATCGTC GCCGCCACCG ATCGATTTCG CCATCTCCAT CAAAAATAA
|
Protein sequence | MSALYPEIEP YDHGLLDTGD GNLIYWEACG NPAGRPALVL HGGPGSGCST TARRHFDPDA YRIILFDQRN CGRSLPSAAD PETDLSRNTT WHLVADIERL RVFFGIDTWL VFGNSWGSTL ALTYAETHPQ RVAAIVISGV TTTRRSEIDW LYRGMAPLFP EEWHRFRRAA CCQEQDLDMV ATYHHLLNHP DSETRLKAAR DWHDWEAASI LLADPKGLPR RWADPAYLLT RARIITHYFA NGAWLEDGQL LNNADRLAGI PAILLQGRFD IEAPLVTAWE LARAWPQSEL QILPHAAHST ANPNMSAAIV AATDRFRHLH QK
|
| |