Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1033 |
Symbol | |
ID | 6979752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1053366 |
End bp | 1054328 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643395745 |
Product | proline iminopeptidase |
Protein accession | YP_002280553 |
Protein GI | 209548636 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.105269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA GACTGCGCAC GCTCTATCCT GAAATCGAAG CCTATGCTTC AGGCCATCTC GATGTCGGCG ATGGTCATGT GATCTATTGG GAGCGCTCGG GAACGCCTGG CGCCAAGCCC GCCGTTTTCC TGCATGGCGG CCCCGGTGGC GGCTTCTCGC CCGTCCATCG CCGGCTTTTC GATGCCGCCC TCTACGACGT CATGCTGTTC GACCAGCGCG GTTGCGGCAG GTCGACGCCG CATGCGGAAC TGAATGCCAA TACGACCTGG CACCTCGTCG CCGATATCGA GCGGCTGCGG GAAATGGCCG GCGTCGACAG CTGGCAGGTG TTCGGCGGCT CCTGGGGTTC GACGCTGGCG CTCGCCTATG CCGAGGCGCA TCCGGAGCGC GTCTCCGAAC TCATTCTGCG CGGCATCTAT ACGCTGACCA AGGCCGAGCT CGACTGGTAC TATCAGTTCG GCGTCTCGGA AATGTTCCCG GACAAATGGG AGCGCTTCAT CGCTCCCATT CCGCCGGAGG AGCGGCATGA GATGATGCAT GCCTATCATC GCCGCCTGAC ACATGAGGAT CGGTCCGTGC GCCTTGCCGC CGCGCAGACA TGGAGCATCT GGGAAGGCGA AACGATCACG CTGCTGCCTG AGCCTTCGAC CAGCGGCAAG TTCGAAGAAC CGGAATTCGC CTATGCATTT GCGCGCATCG AGAATCATTT CTTCGTCAAT GCCGGCTGGA TGGACGAGGG ACAGCTGATC CGCGATGCCG GCCGGCTGAA GGATATTCCC GGCGTCATCG TACATGGCCG CTACGACATG CCCTGCCCGG CCAAATACGC CTGGCTGCTG CACAAGGCCT GGCCGAAGGC GGAATTTCAC CTGATCGAGG GTGCCGGCCA TGCCTATTCG GAGCCCGGCA TTCTCGACCG GCTGATCCGG GCGACGGACA AGTTTGCCGG GAAACCCGAC TGA
|
Protein sequence | MTERLRTLYP EIEAYASGHL DVGDGHVIYW ERSGTPGAKP AVFLHGGPGG GFSPVHRRLF DAALYDVMLF DQRGCGRSTP HAELNANTTW HLVADIERLR EMAGVDSWQV FGGSWGSTLA LAYAEAHPER VSELILRGIY TLTKAELDWY YQFGVSEMFP DKWERFIAPI PPEERHEMMH AYHRRLTHED RSVRLAAAQT WSIWEGETIT LLPEPSTSGK FEEPEFAYAF ARIENHFFVN AGWMDEGQLI RDAGRLKDIP GVIVHGRYDM PCPAKYAWLL HKAWPKAEFH LIEGAGHAYS EPGILDRLIR ATDKFAGKPD
|
| |