Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4144 |
Symbol | |
ID | 8255279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 5013315 |
End bp | 5016179 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644937809 |
Product | peptidase M16 domain protein |
Protein accession | YP_003094397 |
Protein GI | 255534025 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.324827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC CTTTTAATAT CCTTGCGATA TGCCTGGTGC TGAATTTATC ATTTTATGCA CCAGCGTATC CACAAAAGAA AACTGTAACC GCTAAAAAAA CAATAGCTGT TGCTACGCAA AAAAATGAGG GAACACCAAT TCCCAATGAT CCTGATGTAA AAATTGGTAA ACTGGCAAAC GGTCTTACCT ACTACATCAG GAAAAATACA GAACCTAAGA ACAGGGCAGA ATTATATCTG GCTACCCGGA TTGGTTCGCT GATGGAAAAT GACGACCAGC AAGGACTTGC CCACTTTACG GAACACATGG CCTTTAACGG CACAAAAGAT TTCCCTAAAA ACGAAATGAT CAATTATCTG CAAAAGGCAG GTGTACGTTT TGGGGCCGAT CTGAACGCCT ACACAAGTTT TGATCAGACC GTTTACCAAC TGCCTATTCC TACAGATAGT GTAGCTGTAT TTAAGAATGG TTTCAAAATT TTGGCCAACT GGGCGGGTAA AATTGTAATG GAAGGAGATG AAATTGATAA AGAACGTGGT GTGATTGTAG AAGAAGACCG TCAGCGTGGT AAAAATGCCA AAGAGCGCAT GAGTAAGCAA TTGCTTCCCC TGCTCTTAAA AGACTCCCGC TATGCCAACC GTTTACCTAT TGGTAAATTA GACATCTTGC ATTCTTTTAC ACATGATAAG ATCAGAAATT TTTATAAGGA TTGGTACCGG CCAAATCTGC AGGCTGTAAT TGCAGTAGGC GATTTCGACG TAAACGAAGT AGAACGGCTG ATCAAAGCAA ATTTTTCTGA ACTGACCAAT CCTGTAAACC CAAGACCTCG TGTGGCTTAT GATCTTCCTG ACAATATAGC TCCGCTGGTA AAAATAATCA CCGATCCGGA GCAGCAATAT AATGTTGCAC AAGTAATGTA CAAACAACGG GGCAGGATTA TGAAAACCAC TGCCGATTAC AAAAAAAGTC TGATGTACAA TATGATCAAC AGCATGTTGG GGGCAAGGCT GCAGGAGATC ATGCAAAAAG GCAATGCACC TTTTATCCAG GCACAAAGCG GATACGGTCC ATACCAGGGC GGCCTGGTCC CAGGTATCAA TGCTTTTCAA TCATTTGCCG TTTCAAGTTC GGGAGCTACA CTAGAAAAAG CACTGACTGC TGTACTTGCC GAGAATGAAA GGATGAGCAA ATACGGCTTT TTGCAGTCTG AACTGGATGT AGCCAGAAAA AACATCCTTG CCGGAAACGA AAAAAGGTTA AAAGAAAAAG ACAAAACAGC TTCCTCCTCA TTTGTACAGA AATACCTTAA CAATTTTCTG ACAGGCACCA GCATCCCTTC TACTGAATTC GCTTATGAAC TGACGAATAA GCTGGTTGGA GAAATTACAC TGGAACAGGT AAATGCACTT GCAAAAACAC TGATCACTAC AGAAAACCAA ATCATTATTG TACAGGCACC CGAAAAAGAA AAAGCAGGCC TGCCTACTGA GGCGCAATTG CTTGCTGCCT TAAAAAATGC TGGCAATGGC GTAACTGCTT ATGTAGACAA TTCCTTAAAC AAACCTTTGC TGGAGCAAAA ACCTGCCGCC GGAAAGATTG TTAATGAACA GAAAATAGAT CAGATCGGCG TTACAGAACT GACCTTAAGC AATGGCATCA AAGTGTTACT GAAACCAACA GATTTCAAAA ATGACCAGAT CATTTTCAGT TCTTTTTCTA AAGGGGGCAC CTCATTGGCT ACAGACGCCA ATTTCCAGTC AGCCGAAACA GTTGGCCTCA TTCCGCAAAG TGGTGTGGGC GACTTTAACC CTTCGCAACT GAACAAGCTT CTTGCTGGGA ATACAGGAAG GGGCGGTGCT TATATAGATG GTTTATACCA GGGCTATCGT GGCAGCGCAT CGCCAAAAGA TCTGGAAACA GCTTTTCAAA TGGTATATGC CTATGCTACA AATCCACGTA AAGATGCGGA GCTCTTTAAC AAGAGCATCA GCGATTATAA AGTAGTACTG GCCAATAAAA GCGCTGATCC CGGAAGCGTT TTTGCAGATA CTGTGCAGGC AGTATTATCT TCTTATCACA AACGGGGCAT GCCTACAAAT TTGTCTGACC TGGACAAAAT CTCGCTGGAT GAAAGTTTTA ATTTTTATAA GAACCTGTTT GCAGATAACA GCGGGCAAAC TTTTGTAATT GTGGGCGCTT TTAATATGGA AACCATTAAG CCCCTGATTG AAACTTATAT TGCCAGTTTA CCAGCCTCAG GGCAGGCACA TAACTTTGTC GACAACGGAA TATACCCTCC CCTTGGAAAA GTAAGTAAAA CTGTTTACAA AGGACTAGAA GACAAAGCTT CGGTTGAACT GTATCTGCAT GGGGATTATG AGTTTAATGC CCAAAACAAT GTACAGCTGG AGGCATTAAA AGCGGCACTT GAGATAAAGA TACTCGAACG CCTGCGTGAA AAGGAAAGCG GCGTTTATAG TCCCAGGGTT GGGCTGAGCA TCAAGAAATA CCCAAAAGCC CATTATTATT TTACCATTTC TTTTAGCTGT GCCACAGCCA ATGTTGAAAA ACTGATTGGA GCCGCATTAG ATGAGGTTAA ACAGATTAAA GATAGTGGCG CAACTGCAGA TGACATCAGC AAGTTTAAAT CGGAAGAACA GCGCCAGATA GAGCTGAGCC TGCGTGACAA TAGTTACTGG CTGAGCTACC TGACCAACCG TTTGAAGAAC GGAGAAGCTC TTACACAATT ACTGGACGCC CAGCAAAGAA TAAACAATGT TACTGTAGAA ACTACCAGGG CAACAGCGCA AAAGTATTTA AACGAAGACA ACTATATTCG TTTGGTATTA CTGCCACAGA AATAA
|
Protein sequence | MKKPFNILAI CLVLNLSFYA PAYPQKKTVT AKKTIAVATQ KNEGTPIPND PDVKIGKLAN GLTYYIRKNT EPKNRAELYL ATRIGSLMEN DDQQGLAHFT EHMAFNGTKD FPKNEMINYL QKAGVRFGAD LNAYTSFDQT VYQLPIPTDS VAVFKNGFKI LANWAGKIVM EGDEIDKERG VIVEEDRQRG KNAKERMSKQ LLPLLLKDSR YANRLPIGKL DILHSFTHDK IRNFYKDWYR PNLQAVIAVG DFDVNEVERL IKANFSELTN PVNPRPRVAY DLPDNIAPLV KIITDPEQQY NVAQVMYKQR GRIMKTTADY KKSLMYNMIN SMLGARLQEI MQKGNAPFIQ AQSGYGPYQG GLVPGINAFQ SFAVSSSGAT LEKALTAVLA ENERMSKYGF LQSELDVARK NILAGNEKRL KEKDKTASSS FVQKYLNNFL TGTSIPSTEF AYELTNKLVG EITLEQVNAL AKTLITTENQ IIIVQAPEKE KAGLPTEAQL LAALKNAGNG VTAYVDNSLN KPLLEQKPAA GKIVNEQKID QIGVTELTLS NGIKVLLKPT DFKNDQIIFS SFSKGGTSLA TDANFQSAET VGLIPQSGVG DFNPSQLNKL LAGNTGRGGA YIDGLYQGYR GSASPKDLET AFQMVYAYAT NPRKDAELFN KSISDYKVVL ANKSADPGSV FADTVQAVLS SYHKRGMPTN LSDLDKISLD ESFNFYKNLF ADNSGQTFVI VGAFNMETIK PLIETYIASL PASGQAHNFV DNGIYPPLGK VSKTVYKGLE DKASVELYLH GDYEFNAQNN VQLEALKAAL EIKILERLRE KESGVYSPRV GLSIKKYPKA HYYFTISFSC ATANVEKLIG AALDEVKQIK DSGATADDIS KFKSEEQRQI ELSLRDNSYW LSYLTNRLKN GEALTQLLDA QQRINNVTVE TTRATAQKYL NEDNYIRLVL LPQK
|
| |