Gene Phep_2588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2588 
Symbol 
ID8253695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3014527 
End bp3017322 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content39% 
IMG OID644936238 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003092854 
Protein GI255532482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0841472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGAT TTTTAAGCAT TACGCTACTT CTATTTTCAA GTTTAACTTA CGCTCAAAAA 
AAACCAATAA ACCATACTGT TTACGATAAC TGGGAGTCGG TTGGCACAAA ACAATTATCC
AATAATGGGA TATGGGCTTC TTATTCGGTG CTAAAGCAGG AAGGAGATGG TACTTTGTAC
CTCAACAATC TTTTGTCAAA TACCAGACTA AATGTATCAA GGGGTACGAA CCTACAATTT
AGCACCGATT CTAAATATGC GGCCTTAGTG GTTAAGCCCA TGTTTAAAGA TGTTCGTGAA
AGCAAGATAA AAAAGAAAAA ACCAGATGAG CTGACCAAAG ATTCCCTTTG CCTGGTTAAC
CTGACCAATC AGGCTATCGA CAAAGTGCCA AGGGTCAAAT CTTATAAAAT GCCTGAGAAA
GGCGCTTCAT TGGTTGCATA TTTACTTGAA AAACCTATAG ACACTTCAAA AAAAGCAAAA
CCAGATGCAC CAGAAAGCAA GACCAAACAG GAAGGTACCG ATCTGATAAT AAAAAACTTA
CTTACGGGCA CAACCCGTAC TTTTAAGTAT GTAAGTGATT ACAGCTTTAA TAAGAGTGGT
AAACAACTGG TTTTTGCCTG TACCGGTTCG AAGAAAGATA AATTGGCCGA CGAAGGCGTA
TTCCTCTTAA ATACAGAAAA AGGCAGTGTA AAAACCCTTG TAAAAGGTAA AGGAAACTTT
AAAAATTTCA TCTTTAATGA AGAAGGTGAA TACCTTGTAT TTTTAGGTGA GAGAAGTCCT
GAAAAAAAGG AGATCAAAGA TTTTAACATC TACTATAATT CTCCCAGTCT TGATACAGCA
CAGATTCTGG TAGATAACGA AATTACTGGA ATGCCTGCTA AATGGGCTGT TAGCGGCGAT
GGAAAACTGG GTTTCAGTAA AGATGGCAAT AAGCTGTACT TTGGTATTGC TCCGGTTAAA
AAACCAAAAG ATACTACCCT GGTTGATTTT GAAAATGCTA AACTGGATAT ATGGGGCTAT
AAAGACGATT ATTTGCAACC TATGCAGTTA AAAAACATGG AAAATGAGCT CAAAAGATCC
TATTTAACTG TCATGGAAAT ATTCAACAGC AATCCAAAGA TTGTTCCATT AGCGGATATT
AAACTTCCTG AAGTTATGCC GGTAGCTGAA GGCAATGCCA ATTTTGCATT GGCCTTTACA
GATTACGGCA ATAGGATCCA GTCGCAATGG ACTGGCAGTT CTATAAGAGA TTATTATTTA
GTAGATACCA AAACAGGAAG TAGAAAAAAA ATTATTTCAG ACCTTTCGGG ATATGCTATT
GCGTCTCCGG CAGGAAAATA TGTTCTTTAT TTTGACAAAA AAACAGCCAA CTGGTACACT
TATAATGTAC TTACTGCAAA AATCACCCAT TTAAATAATG GGCTAAATAT CAAATTTGTT
GATGAGGAAA ATGATGTTCC GGAAGACCCT TCACCTTATG GTCTGGCTGC CTGGACCGCA
GAGGACAAGG CAGCTCTGAT CTATGATCGT TACGATATAT GGGAATTTTC GCCGGAAGGA
AAAAATGAGC CTAAAAATAT CACCAATGGA TTTGGGCGGC AAAACAAGAT CACCTTCCGT
TATGAAAAAC TGGACACCGA AAGCAGGTTT TTAAATAAAA AAGAAACCAT CTGGTTGAAT
GCCTTTAACA ATACCACTAA AGAAAGTGGC TTTTATAAAA AAGAAATTGA CAATGCTAAA
AATCCGGAGC TGGTGGTGAT GGAAAAAATG AGATATTCAG GTATGGTTAA AGCCAAAGAT
GCAGAACGCT TCATTTTTGA CAAAGGTAGT TTCAGCAATT CACCAAATGT TTATGTGTCG
GCCGACATGA AAAACCAGCT AAAACTGAGC AATACCAACC CCCAGCAACA GGATTATAAC
TGGGGTACTG CAGAACTGGT AAAATGGACT ACACCTAAGG GTTTTAATGC AGAGGGCATA
TTATATAAGC CGGAAAACTT TGATCCGACA AAGAAATACC CGATGATTGC ATATTTCTAT
GAAAAACTTT CTGACGGTTT ATATACTTAC CAGTCACCGG CACCTACCCC ATCCAGGTTG
AATATTACTT ATTTTGTAAG TAATGGCTAC CTGGTATTTA CGCCGGATAT CAGTTATGAA
AAAGGCTACC CTGGTCGTTC TGCTGAAGAG TTTATCAATT CAGGAGTTGA AGCGTTAAAG
AAAAACAGTT GGGTAGACGG GACAAAAATA GGCATACAAG GGCAAAGCTG GGGTGGCTAC
CAGGTAGCCC ATCTGATTAC CAGAACCAAT ATGTATGCTG CTGCATGGGC TGGTGCACCG
GTAGTAAATA TGACTTCAGC TTACGGCGGT ATGCGTTGGG AAAGCGGTAT GAACCGACAG
TTCCAGTACG AAAAAACCCA GAGCCGTATT GGAGCTACTT TATGGGAAAA ACCAGAACTG
TACATCGAAA ACTCGCCATT ATTTAAATTG CCCAATGTAA CTACACCGGT TGTAATCATG
TCCAATGATG CCGATGGAGC AGTGCCATGG TACCAGGGAA TTGAAATGTT TACGGGTTTG
CGCAGATTGG GCAAACCAGC ATGGCTGCTA AACTACAATA ACGAAGCACA CAATTTAATG
CAGCGTCAAA ATAGAAAAGA CATACAAATT CGTGAGCAGC AATTCTTCGA TCATTACTTA
AAAGGTGCAA AAGCACCTGT TTGGATGGTA AATGGAATAC CTGCAACCGA AAAAGGCAAA
ACATGGGGAT TTGAACTTAC AGATGAAAAA CCTTAA
 
Protein sequence
MHRFLSITLL LFSSLTYAQK KPINHTVYDN WESVGTKQLS NNGIWASYSV LKQEGDGTLY 
LNNLLSNTRL NVSRGTNLQF STDSKYAALV VKPMFKDVRE SKIKKKKPDE LTKDSLCLVN
LTNQAIDKVP RVKSYKMPEK GASLVAYLLE KPIDTSKKAK PDAPESKTKQ EGTDLIIKNL
LTGTTRTFKY VSDYSFNKSG KQLVFACTGS KKDKLADEGV FLLNTEKGSV KTLVKGKGNF
KNFIFNEEGE YLVFLGERSP EKKEIKDFNI YYNSPSLDTA QILVDNEITG MPAKWAVSGD
GKLGFSKDGN KLYFGIAPVK KPKDTTLVDF ENAKLDIWGY KDDYLQPMQL KNMENELKRS
YLTVMEIFNS NPKIVPLADI KLPEVMPVAE GNANFALAFT DYGNRIQSQW TGSSIRDYYL
VDTKTGSRKK IISDLSGYAI ASPAGKYVLY FDKKTANWYT YNVLTAKITH LNNGLNIKFV
DEENDVPEDP SPYGLAAWTA EDKAALIYDR YDIWEFSPEG KNEPKNITNG FGRQNKITFR
YEKLDTESRF LNKKETIWLN AFNNTTKESG FYKKEIDNAK NPELVVMEKM RYSGMVKAKD
AERFIFDKGS FSNSPNVYVS ADMKNQLKLS NTNPQQQDYN WGTAELVKWT TPKGFNAEGI
LYKPENFDPT KKYPMIAYFY EKLSDGLYTY QSPAPTPSRL NITYFVSNGY LVFTPDISYE
KGYPGRSAEE FINSGVEALK KNSWVDGTKI GIQGQSWGGY QVAHLITRTN MYAAAWAGAP
VVNMTSAYGG MRWESGMNRQ FQYEKTQSRI GATLWEKPEL YIENSPLFKL PNVTTPVVIM
SNDADGAVPW YQGIEMFTGL RRLGKPAWLL NYNNEAHNLM QRQNRKDIQI REQQFFDHYL
KGAKAPVWMV NGIPATEKGK TWGFELTDEK P