Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2684 |
Symbol | |
ID | 5323553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2786895 |
End bp | 2788778 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640791628 |
Product | pepF/M3 family oligoendopeptidase |
Protein accession | YP_001328349 |
Protein GI | 150397882 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.667149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCG CTTCGCCACG TCTCGCCGCC TCCGGGCCCA CTTTGCTGCG TGCCGCGCTG GCGAACTCCG GGGCTGATCA GGCAGCCTCG GCGCCGGCCG AACTCGGCGA CCTGCCGTCC TGGCGGCTAA CGGACCTCTA CCTTTCGCCA TCCTCCGAGG AATTCCGCTC CGACCTCGCC AGGGCCGAGG CCGACGCGGT CGCCTTTGAG GCTAAGTGGA AGGGCAAGTT GGACCAGGCC GCCGGCCGAA CGGGTGATGA GGGTCTCGGC GCCGCCGTGA ATGAGTTCGA GGCACTGGAC GACCTCATGG GCCGCATCGC TTCGTTTGCC GGGCTCACCT ATTTTTCCGA CACTTCGAAT CCGGCGAACG GCAAGCTCTA CGGCGACATC CAGTCGAAGC TTACCGATAT TTCGGCGCAT CTCCTGTTCT TCTCGCTGGA GCTCAACCGC ATCGACGACA AGCGCGTCGA TGCGGCACTC GCGGCCGATG GCCTTGCCGC CCACTATGCC CCGTGGATTC TCGATCTGCG CAAGGACAAG CCCTATCAGC TCGACGACAG GCTGGAGCAA CTCTTCCTTG AAAAAACGAT GACTGGCGCA AACGCCTTCA ACCGGCTCTT CGACGAAACG ATCGCATCTC TGACCTTTTC GGTGGACGGC AAGGAGCAGC CGCTGGAAGT GACGCTCAAC CTTCTGCAGG ATTCCTCCGT AGAGGTGCGC AGGAAGGCCG CGCTCGCGCT CGCAGAAACC TTCAAGGCAA ATATCCGCAC CTTCACCCTC GTCACCAATA CGCTCGCGAA GGACAAGGAA ATCTCCGACC GCTGGCGCGG ATTCGAGGAT ATAGCCGACA GCCGCCATCT CGCCAATCGC GTGGAGCGCG AGGTCGTGGA TGCTCTGGCC GCATCGGTGA AGGCCGCCTA TCCACGCCTC TCGCACCGGT ATTATGCGAT GAAGGCCAAG TGGCTCGGCA TGGAGCAGAT GGATTTCTGG GACCGGAACG CTCCTCTGCC CGAGACCCCG AACGCGCTCA TTCCCTGGAA CGATGCCAGG GATACGGTCC TTTCCGCCTA TCACGCCTTC GCCCCGGAAA TGGCGTCGAT CGCCCGGCGC TTCTTCGACG ATGGCTGGAT CGATGCGCCG GTCCGCCCCG GCAAGGCGCC AGGCGCCTTC GCGCATCCGA CCGTCCCTTC CGTGCACCCC TATGTTCTCG TCAATTATAT GGGCAAGCCG CGCGACGTCA TGACGCTTGC CCACGAGCTC GGACACGGCG TCCACCAGGT CCTTGCCGGC GCGCAGGGCG CGCTGATGGC CTCGACACCC CTGACACTTG CGGAAACCGC ATCCGTCTTC GGAGAGATGC TGACCTTCCG CGCCCTCCTC GACCGGACGA AGGACCGGCG CGAGCGAAAG GCGATGCTTG CGCAGAAGGT CGAGGATATG ATCAACACGG TCGTCCGGCA GATCGCCTTC TACGATTTCG AACGCAAAGT GCATACGGCC CGCAAGGAAG GCGAACTGAC GGCCGAGGAT CTCGGCCGCA TATGGATGTC GGTGCAGAGC GAAAGCCTCG GCCCGGCGAT CCGGCTTTCG GAGAGTTACG AGACCTACTG GGCCTATATT CCCCACTTCA TCCATTCGCC TTTCTACGTC TATGCCTATG CCTTCGGTGA TTGCCTCGTA AACTCCCTCT ATGCCGTCTA TCAGAACGCC GAGCGCGGTT TCCAGGAGAA GTATTTCGAT ATGCTGAAGG CGGGCGGAAC GAAGCATCAT TCCGAACTTC TGGCGCCCTT CGGGCTCGAC GCCGCCGATC CGTCGTTCTG GGCACAGGGC CTGTCGATGA TTGAGGGGCT CATCGACGAA CTGGAGGCGC TTGATAAGGC GTAA
|
Protein sequence | MKFASPRLAA SGPTLLRAAL ANSGADQAAS APAELGDLPS WRLTDLYLSP SSEEFRSDLA RAEADAVAFE AKWKGKLDQA AGRTGDEGLG AAVNEFEALD DLMGRIASFA GLTYFSDTSN PANGKLYGDI QSKLTDISAH LLFFSLELNR IDDKRVDAAL AADGLAAHYA PWILDLRKDK PYQLDDRLEQ LFLEKTMTGA NAFNRLFDET IASLTFSVDG KEQPLEVTLN LLQDSSVEVR RKAALALAET FKANIRTFTL VTNTLAKDKE ISDRWRGFED IADSRHLANR VEREVVDALA ASVKAAYPRL SHRYYAMKAK WLGMEQMDFW DRNAPLPETP NALIPWNDAR DTVLSAYHAF APEMASIARR FFDDGWIDAP VRPGKAPGAF AHPTVPSVHP YVLVNYMGKP RDVMTLAHEL GHGVHQVLAG AQGALMASTP LTLAETASVF GEMLTFRALL DRTKDRRERK AMLAQKVEDM INTVVRQIAF YDFERKVHTA RKEGELTAED LGRIWMSVQS ESLGPAIRLS ESYETYWAYI PHFIHSPFYV YAYAFGDCLV NSLYAVYQNA ERGFQEKYFD MLKAGGTKHH SELLAPFGLD AADPSFWAQG LSMIEGLIDE LEALDKA
|
| |