Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1180 |
Symbol | |
ID | 8012293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1157547 |
End bp | 1158509 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823764 |
Product | proline iminopeptidase |
Protein accession | YP_002975014 |
Protein GI | 241203918 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.270645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.812597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA TACTGCGCAC GCTCTATCCC GAAATCGAAC CCTATGTTTC CGGCCATCTC GATGTCGGCG ACGGCCATGT GATCTATTGG GAGCGCTCGG GCACGCCTGG CGCCAAGCCC GCCGTCTTCC TGCATGGCGG CCCGGGCGGC GGCATCTCGC CCGCCCATCG CCGGCTCTTC GATCCCGCCC TCTACGACGT CATGCTGTTC GACCAGCGCG GTTGCGGCAG GTCGACGCCG CATGCGGAAC TGCATGCCAA CACGACCTGG CACCTCGTCG CCGATATCGA GCGGCTGCGC GAGATGGCCG GCGTCGACAG CTGGCAGGTG TTCGGCGGCT CCTGGGGCTC GACCCTGGCG CTCGCCTATG CCGAGACGCA TCCGCAGCAC GTCTCCGAGC TCATCCTGCG CGGCATCTAT ACGCTGACCA AGGCCGAGCT CGACTGGTAT TATCAATTCG GCGTCTCGGA GATGTTCCCT GACAAGTGGG AGCGGTTCAT CGCGCCGATT CCGCCGGAAG AGCGGCATGA GATGATGCAT GCCTATCATC GCCGCCTGAC GCATGAGGAC AGGAATGTGC GCCTCGCGGC GGCGCAGGCC TGGAGCATCT GGGAAGGCGA AACGATCACG CTGCTGCCGG AACCTTCGAC GAGCTTCAAG TTCGAGGAGC CGGAATTCGC CTATGCATTT GCGCGCATCG AGAACCATTT CTTTGTCAAT GCCGGCTGGA TGGACGAGGG ACAGCTGATC CGCGATGCCG GCAGGCTCAA GCATATTCCC GGCGTCATCG TGCATGGCCG CTACGACATG CCCTGCCCGG CCAAATATGC CTGGCTGCTG CACAAGGCCT GGCCGAAGGC GGAGTTCCAC CTGATCGAGG GTGCGGGCCA TGCCTATTCG GAGCCGGGGA TTCTCGATCG GCTGATCCGG GCGACGGATA AGTTTGCCAG GAAGCAAGAC TGA
|
Protein sequence | MTEILRTLYP EIEPYVSGHL DVGDGHVIYW ERSGTPGAKP AVFLHGGPGG GISPAHRRLF DPALYDVMLF DQRGCGRSTP HAELHANTTW HLVADIERLR EMAGVDSWQV FGGSWGSTLA LAYAETHPQH VSELILRGIY TLTKAELDWY YQFGVSEMFP DKWERFIAPI PPEERHEMMH AYHRRLTHED RNVRLAAAQA WSIWEGETIT LLPEPSTSFK FEEPEFAYAF ARIENHFFVN AGWMDEGQLI RDAGRLKHIP GVIVHGRYDM PCPAKYAWLL HKAWPKAEFH LIEGAGHAYS EPGILDRLIR ATDKFARKQD
|
| |