Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_51091 |
Symbol | PRD1.1 |
ID | 4850851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 226796 |
End bp | 228772 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | |
GC content | 40% |
IMG OID | 640392559 |
Product | saccharolysin (oligopeptidase) |
Protein accession | XP_001387284 |
Protein GI | 126273711 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCATGGAACC ATACCCCACA ACAGATCGCT GATCTTACGG AAGAATTGAT AGAAACAACA AAAGCATTCA ATGATCACAT CGCTTCTTTA AGTAGTAATT TAACTGTAGA GGGAGTCTTG CTTCCGTATA TCGATTTTGA AAACGAGAGC CAGCTACTCA TCAATCAATT GTTCTTCTAC CAGTACGTTT CCAGTGATAA GGATATCAGG GATGCCTCTA CTGCTGCCGA AGAACTCTTT CTGGAGAAGA TGATTGAGCA GTCATTGAGA ACTGATGTCT ACGAGGTATT TAAAAAACTA CAAGAAAAAG TAGATTCTGG AATGTTGTCT ATTGCTAGCA AAGAACACCA ATTGTTCTTG AGCAAAACCA TGCTTGGGTT TAGAAAAAAC GGATTACATT TACCAGAGGA CCAGAGACAA GTGGTCAAGT CGTATCTCCT GAAATTGAAA GAGCTCTGTA TACATTTTTC CAAGAATGCC AACGAAGAAA ATGGGTACAT TCTTTTCAGC AAAGAAGAGC TTGAAGGTGT CCCTAAACTG ACGGTAGATT CATTCGAGCA AGTAGACAAG GATGGCGTTC AATTGTATAA GATGACATTC AAGTATCCAG ACATTTTTCC CGTTCTGGGA TTTGCTAATA ATGAAACGAC AAGAAAAACA GTATATCTAG GCAATGGTGA CAAATGCAAG GCAAACAATG TAATATTGGA GGAAATAATA GCGTTGAGAT ATAAGCTCGC AAAGCTTCTT GGGTTCAACA GTTTTTCAGA CTACGTTCTT GATGAGACTT TGGCTCAGAA CGTAACTACT GCTGTATCAT TTTTGACGGA TTTGAGAAGA AAATTAACTC CATTGGCTCA AATAGAACTT GAGAAACTTT CTGAGTTCAA GGGTGCAGAA GTATTCAAGT GGGACTTCAA ATACCTCGAG AACAAGATGT TATCGAAACA ATACCAGGTC AACGAGACAG AAATAGCCGA ATACTTCCCC ATGGAATCAA CCATAGAGAA GATGCTTGCT ATTTATGAGA AGCTCTTTGA TTTGGAGTTC CAACCGGTTC TAACCAATAC CTCGGTCTGG CATGAGGATG TCAGACAGTA TTTGGTGTTG ATTGGTGAAG GTTCAAACAA AAAGTTTCTT GGTGTCATTT ATTTTGATTT GCATCCAAGA GAGGGCAAGT ATGGTCATGC TGCCAACTTT GGAATTGCTC CAGGGTACGC AAAGAGAGAT GGAAAATCTC GTGCTTATCC GATCACTGCT TTAGTTTGCA ACTTTAGCAA GAAAACGGAA TCAAAGCCTT CTCTTCTTAA GCATTACGAA GTGAAGACTT TCTTCCATGA GTTAGGACAT GGTATTCATG ATCTTTTGGG CAGGACAGAG GTGGCTCGGT TCCATGGAAC AAATGTACCA CGGGACTTTG TAGAGACGCC TTCGCAGTCG TTTGAGTTCT GGACGTGGGA AAAGTCGATA TTGAAGAACT TATCGTCTCA TTATCTTACC AACGAGTCAT TGAGTGATAC GTTGATCGAC AACCTAGTAT CTACAAAGCA TGTCAACGGA GCATTGCATG CTTTAAGACA GTTGCATTTC GGACTCTTTG ATCTTGCTGT TCATCAACTT GAAGACGATG AGAGCCTCGA GCTATTGAAT ATCAGCCGGT TATGGAACAA CTTGAGCAAT GAAGTTTCTT TGATTTCACT GGGTAATTAC ACAGTTGATT CGTACGGGTC TTTTGGACAT ATTGCCGGAG GTTACGAATC AGGATACTAC AGCTACTTCT TCAGCGAAGT GTTTGGTGAT GATATTTATT ACACATTGTT CAAAGACGAT CCCATGAGTG TAGAAAATGG AAGAAAGTAT AGAGATATCG TTTTATCTAA GGGAAATTCA GAGGATATAA TGGACAACTT GAAGTTGTTG CTAGGAAGAG AACCTACCTC AGATGCTTTC TTAAAGGAAT ATGGATTGGA CAAGTGA
|
Protein sequence | SWNHTPQQIA DLTEELIETT KAFNDHIASL SSNLTVEGVL LPYIDFENES QLLINQLFFY QYVSSDKDIR DASTAAEELF LEKMIEQSLR TDVYEVFKKL QEKVDSGMLS IASKEHQLFL SKTMLGFRKN GLHLPEDQRQ VVKSYLLKLK ELCIHFSKNA NEENGYILFS KEELEGVPKL TVDSFEQVDK DGVQLYKMTF KYPDIFPVLG FANNETTRKT VYLGNGDKCK ANNVILEEII ALRYKLAKLL GFNSFSDYVL DETLAQNVTT AVSFLTDLRR KLTPLAQIEL EKLSEFKGAE VFKWDFKYLE NKMLSKQYQV NETEIAEYFP MESTIEKMLA IYEKLFDLEF QPVLTNTSVW HEDVRQYLVL IGEGSNKKFL GVIYFDLHPR EGKYGHAANF GIAPGYAKRD GKSRAYPITA LVCNFSKKTE SKPSLLKHYE VKTFFHELGH GIHDLLGRTE VARFHGTNVP RDFVETPSQS FEFWTWEKSI LKNLSSHYLT NESLSDTLID NLVSTKHVNG ALHALRQLHF GLFDLAVHQL EDDESLELLN ISRLWNNLSN EVSLISLGNY TVDSYGSFGH IAGGYESGYY SYFFSEVFGD DIYYTLFKDD PMSVENGRKY RDIVLSKGNS EDIMDNLKLL LGREPTSDAF LKEYGLDK
|
| |