Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2588 |
Symbol | |
ID | 8253695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3014527 |
End bp | 3017322 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644936238 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003092854 |
Protein GI | 255532482 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0841472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAGAT TTTTAAGCAT TACGCTACTT CTATTTTCAA GTTTAACTTA CGCTCAAAAA AAACCAATAA ACCATACTGT TTACGATAAC TGGGAGTCGG TTGGCACAAA ACAATTATCC AATAATGGGA TATGGGCTTC TTATTCGGTG CTAAAGCAGG AAGGAGATGG TACTTTGTAC CTCAACAATC TTTTGTCAAA TACCAGACTA AATGTATCAA GGGGTACGAA CCTACAATTT AGCACCGATT CTAAATATGC GGCCTTAGTG GTTAAGCCCA TGTTTAAAGA TGTTCGTGAA AGCAAGATAA AAAAGAAAAA ACCAGATGAG CTGACCAAAG ATTCCCTTTG CCTGGTTAAC CTGACCAATC AGGCTATCGA CAAAGTGCCA AGGGTCAAAT CTTATAAAAT GCCTGAGAAA GGCGCTTCAT TGGTTGCATA TTTACTTGAA AAACCTATAG ACACTTCAAA AAAAGCAAAA CCAGATGCAC CAGAAAGCAA GACCAAACAG GAAGGTACCG ATCTGATAAT AAAAAACTTA CTTACGGGCA CAACCCGTAC TTTTAAGTAT GTAAGTGATT ACAGCTTTAA TAAGAGTGGT AAACAACTGG TTTTTGCCTG TACCGGTTCG AAGAAAGATA AATTGGCCGA CGAAGGCGTA TTCCTCTTAA ATACAGAAAA AGGCAGTGTA AAAACCCTTG TAAAAGGTAA AGGAAACTTT AAAAATTTCA TCTTTAATGA AGAAGGTGAA TACCTTGTAT TTTTAGGTGA GAGAAGTCCT GAAAAAAAGG AGATCAAAGA TTTTAACATC TACTATAATT CTCCCAGTCT TGATACAGCA CAGATTCTGG TAGATAACGA AATTACTGGA ATGCCTGCTA AATGGGCTGT TAGCGGCGAT GGAAAACTGG GTTTCAGTAA AGATGGCAAT AAGCTGTACT TTGGTATTGC TCCGGTTAAA AAACCAAAAG ATACTACCCT GGTTGATTTT GAAAATGCTA AACTGGATAT ATGGGGCTAT AAAGACGATT ATTTGCAACC TATGCAGTTA AAAAACATGG AAAATGAGCT CAAAAGATCC TATTTAACTG TCATGGAAAT ATTCAACAGC AATCCAAAGA TTGTTCCATT AGCGGATATT AAACTTCCTG AAGTTATGCC GGTAGCTGAA GGCAATGCCA ATTTTGCATT GGCCTTTACA GATTACGGCA ATAGGATCCA GTCGCAATGG ACTGGCAGTT CTATAAGAGA TTATTATTTA GTAGATACCA AAACAGGAAG TAGAAAAAAA ATTATTTCAG ACCTTTCGGG ATATGCTATT GCGTCTCCGG CAGGAAAATA TGTTCTTTAT TTTGACAAAA AAACAGCCAA CTGGTACACT TATAATGTAC TTACTGCAAA AATCACCCAT TTAAATAATG GGCTAAATAT CAAATTTGTT GATGAGGAAA ATGATGTTCC GGAAGACCCT TCACCTTATG GTCTGGCTGC CTGGACCGCA GAGGACAAGG CAGCTCTGAT CTATGATCGT TACGATATAT GGGAATTTTC GCCGGAAGGA AAAAATGAGC CTAAAAATAT CACCAATGGA TTTGGGCGGC AAAACAAGAT CACCTTCCGT TATGAAAAAC TGGACACCGA AAGCAGGTTT TTAAATAAAA AAGAAACCAT CTGGTTGAAT GCCTTTAACA ATACCACTAA AGAAAGTGGC TTTTATAAAA AAGAAATTGA CAATGCTAAA AATCCGGAGC TGGTGGTGAT GGAAAAAATG AGATATTCAG GTATGGTTAA AGCCAAAGAT GCAGAACGCT TCATTTTTGA CAAAGGTAGT TTCAGCAATT CACCAAATGT TTATGTGTCG GCCGACATGA AAAACCAGCT AAAACTGAGC AATACCAACC CCCAGCAACA GGATTATAAC TGGGGTACTG CAGAACTGGT AAAATGGACT ACACCTAAGG GTTTTAATGC AGAGGGCATA TTATATAAGC CGGAAAACTT TGATCCGACA AAGAAATACC CGATGATTGC ATATTTCTAT GAAAAACTTT CTGACGGTTT ATATACTTAC CAGTCACCGG CACCTACCCC ATCCAGGTTG AATATTACTT ATTTTGTAAG TAATGGCTAC CTGGTATTTA CGCCGGATAT CAGTTATGAA AAAGGCTACC CTGGTCGTTC TGCTGAAGAG TTTATCAATT CAGGAGTTGA AGCGTTAAAG AAAAACAGTT GGGTAGACGG GACAAAAATA GGCATACAAG GGCAAAGCTG GGGTGGCTAC CAGGTAGCCC ATCTGATTAC CAGAACCAAT ATGTATGCTG CTGCATGGGC TGGTGCACCG GTAGTAAATA TGACTTCAGC TTACGGCGGT ATGCGTTGGG AAAGCGGTAT GAACCGACAG TTCCAGTACG AAAAAACCCA GAGCCGTATT GGAGCTACTT TATGGGAAAA ACCAGAACTG TACATCGAAA ACTCGCCATT ATTTAAATTG CCCAATGTAA CTACACCGGT TGTAATCATG TCCAATGATG CCGATGGAGC AGTGCCATGG TACCAGGGAA TTGAAATGTT TACGGGTTTG CGCAGATTGG GCAAACCAGC ATGGCTGCTA AACTACAATA ACGAAGCACA CAATTTAATG CAGCGTCAAA ATAGAAAAGA CATACAAATT CGTGAGCAGC AATTCTTCGA TCATTACTTA AAAGGTGCAA AAGCACCTGT TTGGATGGTA AATGGAATAC CTGCAACCGA AAAAGGCAAA ACATGGGGAT TTGAACTTAC AGATGAAAAA CCTTAA
|
Protein sequence | MHRFLSITLL LFSSLTYAQK KPINHTVYDN WESVGTKQLS NNGIWASYSV LKQEGDGTLY LNNLLSNTRL NVSRGTNLQF STDSKYAALV VKPMFKDVRE SKIKKKKPDE LTKDSLCLVN LTNQAIDKVP RVKSYKMPEK GASLVAYLLE KPIDTSKKAK PDAPESKTKQ EGTDLIIKNL LTGTTRTFKY VSDYSFNKSG KQLVFACTGS KKDKLADEGV FLLNTEKGSV KTLVKGKGNF KNFIFNEEGE YLVFLGERSP EKKEIKDFNI YYNSPSLDTA QILVDNEITG MPAKWAVSGD GKLGFSKDGN KLYFGIAPVK KPKDTTLVDF ENAKLDIWGY KDDYLQPMQL KNMENELKRS YLTVMEIFNS NPKIVPLADI KLPEVMPVAE GNANFALAFT DYGNRIQSQW TGSSIRDYYL VDTKTGSRKK IISDLSGYAI ASPAGKYVLY FDKKTANWYT YNVLTAKITH LNNGLNIKFV DEENDVPEDP SPYGLAAWTA EDKAALIYDR YDIWEFSPEG KNEPKNITNG FGRQNKITFR YEKLDTESRF LNKKETIWLN AFNNTTKESG FYKKEIDNAK NPELVVMEKM RYSGMVKAKD AERFIFDKGS FSNSPNVYVS ADMKNQLKLS NTNPQQQDYN WGTAELVKWT TPKGFNAEGI LYKPENFDPT KKYPMIAYFY EKLSDGLYTY QSPAPTPSRL NITYFVSNGY LVFTPDISYE KGYPGRSAEE FINSGVEALK KNSWVDGTKI GIQGQSWGGY QVAHLITRTN MYAAAWAGAP VVNMTSAYGG MRWESGMNRQ FQYEKTQSRI GATLWEKPEL YIENSPLFKL PNVTTPVVIM SNDADGAVPW YQGIEMFTGL RRLGKPAWLL NYNNEAHNLM QRQNRKDIQI REQQFFDHYL KGAKAPVWMV NGIPATEKGK TWGFELTDEK P
|
| |