Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0717 |
Symbol | |
ID | 5321554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 768355 |
End bp | 769317 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640789654 |
Product | proline iminopeptidase |
Protein accession | YP_001326408 |
Protein GI | 150395941 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.922147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.435096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTCTC CATTGCGTAC TCTCTATCCG GAAATCGAGC CCTACGCCTC GGGTCGACTC GATGTGGGCG ACGGCCATTC GATCTACTGG GAGCGGGTGG GCACGCCCGG AGCGAAGCCG GCGGTCTTCC TGCACGGTGG CCCGGGCGGT ACGATTTCGC CGAACCACCG GCGGCTCTTC GATCCGGCGC TCTACGACGT GACGCTGTTC GACCAGCGCG GCTGCGGCAA GTCGGAGCCG CATGCCGGGA TCGAGGCGAA CACGACCTGG CATCTCGTCG CCGATATCGA GCGGCTGAGG GAAGCGGCCG GCGCGGACAA ATGGCTGGTT TTCGGCGGTT CCTGGGGTTC GACGCTGGCG CTTGCCTATA CCGAAACCCA TCCCGGGCGG GTCTCCGAAC TCGTCGTCAG GGGCATTTAC ACGCTGACCA GGGCCGAGCT CGACTGGTAC TATCAGTTCG GCGTTTCGGA ACTCTTCCCC GACAAGTGGG AACGCTTCAT CGCCCCGATC CCGCCGGAAG AGCGCCATGA GATGATGCGC GCCTACCATC GCCGCCTCAC GAGCGATGAC CGTGCGATAC GGCTTGCAGC GGCACGCGCC TGGAGCATAT GGGAGGGCGA GACGATAACG CTTCTGCCGG AGCCGGCCAC CAGCACGCCC TTCGAGGAAG ACGAATACGC GCTCGCCTTT GCCCGCATCG AGAACCATTT CTTCGTCAAT GCCGGATGGC TGGAAGAGGG CCAATTGCTG CGCGATGCGC ATAAGCTCCG CGGCATTCCG GGTGTGATCG TGCACGGCCG CTACGATATG CCGTGCCCGG CGAAATATGC ATGGCAATTG CACAAGGCTT GGCCGGAAGC GGAATTCCAT CTGATCGAGG GGGCCGGGCA CGCCTATTCG GAGCCCGGCA TTCTCGATCG GCTGATCCGA TCGACCGACA AATTCGCCGG CAAGGCCGAA TAA
|
Protein sequence | MSSPLRTLYP EIEPYASGRL DVGDGHSIYW ERVGTPGAKP AVFLHGGPGG TISPNHRRLF DPALYDVTLF DQRGCGKSEP HAGIEANTTW HLVADIERLR EAAGADKWLV FGGSWGSTLA LAYTETHPGR VSELVVRGIY TLTRAELDWY YQFGVSELFP DKWERFIAPI PPEERHEMMR AYHRRLTSDD RAIRLAAARA WSIWEGETIT LLPEPATSTP FEEDEYALAF ARIENHFFVN AGWLEEGQLL RDAHKLRGIP GVIVHGRYDM PCPAKYAWQL HKAWPEAEFH LIEGAGHAYS EPGILDRLIR STDKFAGKAE
|
| |