Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_84390 |
Symbol | DAP2 |
ID | 4840040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 974207 |
End bp | 976980 |
Gene Length | 2774 bp |
Protein Length | 852 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391355 |
Product | dipeptidyl aminopeptidase B |
Protein accession | XP_001385880 |
Protein GI | 150866325 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.309155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.832688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAAAATTACC AGATACATTC ATTATCTGGA ATACTTTTTC GTGGTATTGA TACATAGTCA GTTCCTGCTC AGCAGCCTGA TTTCGTCCTC TTATCATATC CAGACTTCTC ATTCTTTGCT ATATTCAAAG TTCTAATCTC ACATGACCAA CAAGTACGAT AGTGACGCTC ACGTCCCGGA GGTCAAGAAG AAAGACTTTC GGCGATCCTA TGAGCTGGAG CTCGACAATG AGTTTAGTTC TCGGGTCAAC CATTACAACT ACAAAAATTT CTTCATTTCT GGAGTCCTTT TGTCTATTTT GATATGGGGG TCATCCTTCT TAATAACTGC CATCACAAAC TTGCGAACCA CTGAAATACT ACATCATGAA TACGAACAAT TGTTGAAGGC CAACTCGCCT CCATTATTGC AAAGCAGGCC GGCTGTAGCA TGGGATGGTT CCGGAAAGAT TCCCTTGAAC TTCTCTGCTG TAAGAGACGG AAAGTTCAGA CCAAATTATA AATCGTTACA GTGGATCCAC GAACCACTGT CCATCGAAAG CGATAAGGGT ACATATGTGT TAAAGGATGA CTCCAACGGA CATTTAGAAT ATACTATCAA GTCTATTGTT GACGATGACT ATCTGTTCAC TTTGTACAAT GGTTCCACAT TTAACTACAG CGATACAGAG TACAAAATTG AGTCCTTGAT AGCGTCTCCA GACTTAACAA AGGCAATCTT GAAAACAAAC GCTACACACA ACTGGAGACA TTCAAGTTTT GCATTGTATT GGTTGCTCGA TATCTCTGAA TCCTCGATAA CTCCGATATT AGACACCACA AGCAAGTTGG CCGTCACTTC GTGGTCGCCA AAGTCGACCG ACATTGCATT TATTTTTGAC AATAATGTCT ATGTGAAGAA TATTGCATCT GGAGAAGTCA AACAGGTCAC GTTTGATGGA AGTTCTCAGG TCTTCAATGG TAAGCCGGAC TGGGTTTACG AAGAAGAGGT TTTTGCTGGC GATATTGTTT TATGGTGGTC CCCAAGTGGA GATAAATTTA CTTTTTTGAA GTCAAATGAC ACCGAAGTTC CTGAGTTCAC TATTCCTTAT TACGTTCAAA ATGGCCATGA AGACTACCCG GAAGTTGTAC AAATCAAGTA TCCAAAGGCT GGGTACCCAA ACCCAAGTGT TGAGTTGGTT ATTTACGACT TGGATACCGA AAAAGATCAG CTTTTGGAAT TGAAATCTGA GAAGATTGTG TCGCTGGACA GATTAATTAC TGAGGTTGTT TGGGTGGGTT CATCTGTGTT GGTTAAGACT TCAAACAGAG CTAGTGATTT ATTGGAAATA TTTTTGGTTG ACAGTGTTCA AAATGAATCC AAGATAGTAA GAAGCTTAAC CGCTGATGAC AGCTGGTTTG AAGTAACTTC TAATACCTTG TTTGTTCCTA AAAATGAATC GTTGGGTCGT TTAGAAGATG GTTATGTTGA CACTGTTGTC TCAGGAGGTT TCAACCATTT GGCTTATTTC TCACCCCCCA GCGCATCTGA AGGTGTCTTG TTGACCAACG GCCAATGGGA AGTGGTAGGC GGAGTTGAGG CATTTGATTT CAACAAGAAT AAAGTTTACT TTGTCAGTAC CATGAAATCT TCTGTCGAAA GACACATCCA CTCAGTTAGT TTGTTTGACA AGTCCAATAA TGGACTTCCA AAAGTAGAAA ATATCACAAG TGGTGAAGGT TGGTTCGCTG GGTCATTTTC ATCTGGATCC AGATATTTAT TATTGACCTA TGAAGGTCCA GAAGTTCCAT ATCAAAAGTT GATTGATTTA TCCTCTCTTA AGGATGTCAA GACTATTGAG TCAAATCAAG AGGTCATCGA CAATTTAGAT GATTACCTTA TTCCTGAGGT AAAATATGAA GTTATTGAAT TACAAGACGA AGAAACTGGT GAAATTTTCA AAGCGAATGC CATTGAGACA TTGCCATTAC ACTTTGATAG TCGTCATAAG TACCCGGTGT TATTTTTTGT ATATGGTGGA CCAGGCTCTC AGCTTGTAAC AAAGAACTTT GCAGTTAGTT TCAGCTCGGT TGTTGCAGCT GAATTGAATG CTATTGTGGT TACAGTTGAC GGCAGAGGTA CTGGCTACAA TAACTACAAT GATGATTTGG GATCACAGTA CAAATTTATT GTCCGCGACA AATTAGGAAA GTATGAACCG TTGGATCAAA TAGCAGCTGC TAGATTATGG TCAGAGAAGA GTTACGTTGA TTCTGATAGA ATTGCTATTT GGGGATGGTC ATATGGAGGA TTTCTTACAT TGAAAACACT TGAAACTGAT GTTAAGCATA AAGTGTTTTC TTATGGAGTT TCCATCGCAC CAGTTACCAA ATGGAAGTTG TATGACTCCA TTTACACAGA AAGATACATG AGAACCCCTC AAGAAAATCC AGCGGGATAC GAAATTGCTT CCATTCATAA TATAACCAAT TTTGAACACG TCAAAAAGTT TTTCATTGGC CATGGAAGTG GGGATGATAA CGTGCATGTT CAGAATACAT TGAAATTAAT TGACGAGTTC AACTTGGGAA ACATTGAAAA CTTCGACTTC ATGATATTTC CAGACAGCGA CCATTCTATT CGTTACCACA ACGGTAATAA GGTTGTGTAC GATCGTATTT TGACTTTCTT AAGAAGAGCA TTCAATGGTG AGTTTGCATA AGCAATTCAC ATGTAACAGT ACATCATTTA TAGTAATAAC AGTAATAATA ACAATAATAA CGAAAATGAG AAGT
|
Protein sequence | MTNKYDSDAH VPEVKKKDFR RSYESELDNE FSSRVNHYNY KNFFISGVLL SILIWGSSFL ITAITNLRTT EILHHEYEQL LKANSPPLLQ SRPAVAWDGS GKIPLNFSAV RDGKFRPNYK SLQWIHEPSS IESDKGTYVL KDDSNGHLEY TIKSIVDDDY SFTLYNGSTF NYSDTEYKIE SLIASPDLTK AILKTNATHN WRHSSFALYW LLDISESSIT PILDTTSKLA VTSWSPKSTD IAFIFDNNVY VKNIASGEVK QVTFDGSSQV FNGKPDWVYE EEVFAGDIVL WWSPSGDKFT FLKSNDTEVP EFTIPYYVQN GHEDYPEVVQ IKYPKAGYPN PSVELVIYDL DTEKDQLLEL KSEKIVSSDR LITEVVWVGS SVLVKTSNRA SDLLEIFLVD SVQNESKIVR SLTADDSWFE VTSNTLFVPK NESLGRLEDG YVDTVVSGGF NHLAYFSPPS ASEGVLLTNG QWEVVGGVEA FDFNKNKVYF VSTMKSSVER HIHSVSLFDK SNNGLPKVEN ITSGEGWFAG SFSSGSRYLL LTYEGPEVPY QKLIDLSSLK DVKTIESNQE VIDNLDDYLI PEVKYEVIEL QDEETGEIFK ANAIETLPLH FDSRHKYPVL FFVYGGPGSQ LVTKNFAVSF SSVVAAELNA IVVTVDGRGT GYNNYNDDLG SQYKFIVRDK LGKYEPLDQI AAARLWSEKS YVDSDRIAIW GWSYGGFLTL KTLETDVKHK VFSYGVSIAP VTKWKLYDSI YTERYMRTPQ ENPAGYEIAS IHNITNFEHV KKFFIGHGSG DDNVHVQNTL KLIDEFNLGN IENFDFMIFP DSDHSIRYHN GNKVVYDRIL TFLRRAFNGE FA
|
| |