Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29736 |
Symbol | HYU1.2 |
ID | 4837633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1000679 |
End bp | 1004638 |
Gene Length | 3960 bp |
Protein Length | 1319 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388948 |
Product | 5-oxoprolinase |
Protein accession | XP_001382968 |
Protein GI | 150864232 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.434949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTGT CTGGCCATGG AAAAGTAGAA ATCGCCATAG ATAGAGGCGG AACCTTCACC GATGTCATCT ACAAATGTAA TAACCAAGAA GAACATTCGT TCAAGTTGCT ATCAGAGGAT CCGGCCAACT ATCAGGACGC AAATATCGAG GGCATACGTC GGGTTCTAGA AAAACTCACA GACTCTACTA TCCCTAGAGG CACACCTCTT GATACATCGA TTATATCTTC TATACGATTG GGAACCACAG TGGCTACCAA TGCTTTGCTA GAGAGAAAAG GTGCTCGCAT TGCCCTTGTT ACTACCAAAG GGTTCAAAGA TTTGCTCCAT ATAGGAGACC AATCCAGGCC AGACTTGTTT GCATTAAATA TCGTAAAACC TGGGGTACTC TATGATAAGG TGGTAGAAGC CGACGAAAGA GTTACCATGC CTGCTTTTTC CGTGGATCCA ACCGGTTATG ATGCCCAAGA TCTTGTAGAC GATGTTCGAT ATGTTTATGG AGAGACAAAT GAAGTCTTTG AAATATTGCA ACCTTTAGAC ACAGATAAAC TAACTGAGGA TTTGATCCAG TTGAAGAACG AAGGAATAAA TTCCATCGCT ATTGTATTTA TCCATGGCTT CAACTTCCAG AAGCACGAGA AGCTTGCTGG AGCAATTGCC AGAGAATTGG GATTTTCCAA CGTGTCTCTT TCTCACGAAA TCTTGCCTGT TATAAAAACG GTCTCGCGTG GCCAGTCAAC AACATTAGAT GCCTATCTAA CTCCAGTGGT CAAGCGGTAT ATCACCAATT TTGTTAGCGG TTTTAAGCCT GGTTTTGAAT CCCACACCAG AATCGAGTTC ATGCAATCAG ATGGTGGTTT ATGTACATGG AAAAATTTTA CAGGCTTACG ATCTTTGCTT TCAGGTCCTG CTGGTGGAGT CGTGGGGGCT GCAAAGACAT GCTATGATGC CGATGTTAAA ATCCCAGTCA TAGGGTTTGA TATGGGAGGC ACATCAACGG ATGTCTCCAG ATTCGCAGGT GATTATGAAT TTTCATTTGA AAGTGTAACT GCTGGCATAA AAATAGCTGC TCCTCAATTA GACATCAACA CTGTGGCTGC TGGTGGAGGA TCGATTTTGT TTTATAGAAA TGGGGCATTT GCAGTTGGTC CAGAATCCGC AGGGGCCCAT CCAGGTCCAG CTTGTTATAG AAAAGGCGGT CCTTTGACTA TTACTGATGC CAACTTATTC ACTGGCAGAA TCATTCCAGA GTTCTTTCCT AAGATATTTG GATATTCTGA AGATCAACCG TTGGACTACG AAATTACAAA GAAAAAATTC GAAGAATTGG CGAATGTGAT AAACAAAGAT AATCCAGATG TGCCAAAAAC TCCATTGGAA ATTGCCTTGG GGTTCTTAAA AGTTGCAGAT TTTCAAATGG CAAGACCAAT TCGAGATTTA ACAGAATCGA AAGGTCATGA TGTTAGCAAG CATTCTCTTG CCTCTTTTGG TGGAGCTGGT GGTCAGCATG CAACTTCAAT TGCCAAAATA TTAAAAATAA AGAGAGTCTT GATACACAAA CACTCATCCA TTTTATCAGC ATATGGTATT TATTTATCTT CTGTTGTTAA TGAGCAACAA GAACCGGTCT CAGAGGTTTA CTCCAAGGAG ATTGCACTTA TGTTGCTTGA TAAATGTAGT CAATTGAAGG AAAAATGCCA GAAGGAGTTA TTAAACCAAG GAGTACTACC CAACACCATC AACTACACTG TCTATTTTAA TATGGGCTAC AAAGGCTCTG ATACAAGAAT AATGATTAAA CAAAGAAAGA AGCAAGATTT TTTGGACTCA TTTTATAGAA GACATGACCT TGAATTTGCA TTCAATGATT ACGAAAAGCC TGTTTTAGTA TCTAATATAA GGGTAAGAGC ATGTGGGAGT GCTTCTGAAC TGATCCAAGA ACGTTCACCA TATAAAGATT TTGAATCAGT CACAAAATTT CCAGTGTCAT CTGATCTTAT CAAAAAGGTC ACAGAAGTAC ATTTCGAACA AGGTACTTTG GCAACTGCGG TATTCTTCTT GGATACATTG CCTGTGGGGG CAGTAATACC AGGACCAGCT CTAATTTTGG ATGAAACTCA AACCATCGTT GTTTCTCCCG ATTCTACAGC TACAATTCTT CCAAGACATG TGGTGCTTGA TTTGAAAACT GACGAGAAAC ATTCCATTTC CACCAACTTT GTCGATCCTA TACAATTATC TATATTTGCA AACAGATTCA TGTCAATCGC TGATGACATG TCAAGAACTC TTCAAAAGAT ATCTGTAAGT GCTAATATCA AGGAACGGTT GGATTATTCT TGTGCTCTAT TTGACAATCA AGGAAATCTC GCAGCTAATG CTCCAAATGT TCCTGTTCAT CTAGGTTCTA TGTCCACTGC AATTAAATAC CAGTTGGATT ATTGGAAAGA CAATTTACGT GAAGGTGACA TCTTGTGCTC AAATTCCCCA AGCGTTGGCG GTACACACTT ACCGGATGTA ACTGTTATTT CTCCCGTATT TATAGATGGA AAGATTCAGT TTGTTGTTGC TGCTCGTGCA CACCACTCGG AAATTGGCGG ATCTGCGCCT GGATCTTCTT CTTCATATGC CAGAGACATT TTTGAAGAAG GTGCCAATAT TGAAGCCTGG AAAATTGTTT CAAACGGCAA GTTTGATTAT GAGGGATTAC AAAAATATTT TGTTGAAGTT CCAAGGAGTC ATGGTGTTTC GGGAACCAGA AATATAGATG ACAATATTTC TGATCTCAAA GCCCAAATCG CGTCTAATCA ACGAGGAATT AATTTGTTGA AGGATTTATT TGAGGAGTAT GGAAGTGAAA CAGTGTTATT TTATATGCGA AATGTCAAGA AGTCTGCCGA ATTAGCCGTT CGAAACTTTT TCAAAGATTA CGCTACGAAG AACAAGAACA AATTGCCGCT AACTGCAGAA GATTTTATGG ACGATGGATG CAAAGTTCAA GTCAAAATAG AAATTGATGA AAATGATGGA TCTGCTGTCT TTGATTTCAC GGGTACTTCA TTGGAATCAT ATTCCAACCT TAATGCACCA AAATCGATTA CATATTCAAC GGTTATTTAC GTCTTGAGGT GTTTGGTAAA CCTAGAAATT CCTTTGAACC AAGGCTGTTT GGACCCCTGC ACCTTGATAA TTCCAGACAA CAGTTTGATT AATCCTAGTC GTTATGCGGC CGTTTGTGCT GGAAACGGTA TGACGTCTCA AAGATTGACT GATTGCTTAT TCAGAGCCTT TGGTTTGACT TCAGCTACTG GTGGATGTAT GAACGGTATT AATTTTGGCA CTGGTGGTGA AGACGCAAAC GGTAAAATGA TTAAGGGTTT TGGTTATACC GAGACTATTG GGCAAGGAAG CTGCGCTGGA ATTCTTGAGA AGAATGGAGA GAAATATGGT TTTGATGGTT TTTCAGGAAC GCAAACTAAT ATGACCAATA CGAAGATAAC TGATCCTGAA GTTTTAGAGC ACAGGTATCC CTGTTTGATA CTTCATTATG GAATAAGACA CAATTCTGGT GGTAAAGGGA AATGGAGAGG AGGTGATGGC TTGATTCGAG AAATAAGATT TAACTCACCT GTTCATGTTT CGTTGGTAAC TCAAAGACGA GTTTTCCAGC CTTGGGGAAT TTATGGAGGC GATCCGGGAG CTCGAGGTGA GAATTTTTTG GGAAGAGATA GAGGGAACGG TGTTATTCAG TGGATACAAT TGCCATCTTT AGCTGAAATC GAGATTGGCA AGGGAGACAT TCTCAAAATT CTCACTCCAG GCGGAGGGGG ATTTGGTCGA CCTGAAGATA CTGACGAGTT TTGGGGTGTA ACCAGGAAAC AAAAGGAAAA TTTTAAAACT ACTGTGAACT GTGAGGGCAC TCTTGGTAAA ATAAGAGATG CATGCAACTC ATCCCAGTAG
|
Protein sequence | MTLSGHGKVE IAIDRGGTFT DVIYKCNNQE EHSFKLLSED PANYQDANIE GIRRVLEKLT DSTIPRGTPL DTSIISSIRL GTTVATNALL ERKGARIALV TTKGFKDLLH IGDQSRPDLF ALNIVKPGVL YDKVVEADER VTMPAFSVDP TGYDAQDLVD DVRYVYGETN EVFEILQPLD TDKLTEDLIQ LKNEGINSIA IVFIHGFNFQ KHEKLAGAIA RELGFSNVSL SHEILPVIKT VSRGQSTTLD AYLTPVVKRY ITNFVSGFKP GFESHTRIEF MQSDGGLCTW KNFTGLRSLL SGPAGGVVGA AKTCYDADVK IPVIGFDMGG TSTDVSRFAG DYEFSFESVT AGIKIAAPQL DINTVAAGGG SILFYRNGAF AVGPESAGAH PGPACYRKGG PLTITDANLF TGRIIPEFFP KIFGYSEDQP LDYEITKKKF EELANVINKD NPDVPKTPLE IALGFLKVAD FQMARPIRDL TESKGHDVSK HSLASFGGAG GQHATSIAKI LKIKRVLIHK HSSILSAYGI YLSSVVNEQQ EPVSEVYSKE IALMLLDKCS QLKEKCQKEL LNQGVLPNTI NYTVYFNMGY KGSDTRIMIK QRKKQDFLDS FYRRHDLEFA FNDYEKPVLV SNIRVRACGS ASESIQERSP YKDFESVTKF PVSSDLIKKV TEVHFEQGTL ATAVFFLDTL PVGAVIPGPA LILDETQTIV VSPDSTATIL PRHVVLDLKT DEKHSISTNF VDPIQLSIFA NRFMSIADDM SRTLQKISVS ANIKERLDYS CALFDNQGNL AANAPNVPVH LGSMSTAIKY QLDYWKDNLR EGDILCSNSP SVGGTHLPDV TVISPVFIDG KIQFVVAARA HHSEIGGSAP GSSSSYARDI FEEGANIEAW KIVSNGKFDY EGLQKYFVEV PRSHGVSGTR NIDDNISDLK AQIASNQRGI NLLKDLFEEY GSETVLFYMR NVKKSAELAV RNFFKDYATK NKNKLPLTAE DFMDDGCKVQ VKIEIDENDG SAVFDFTGTS LESYSNLNAP KSITYSTVIY VLRCLVNLEI PLNQGCLDPC TLIIPDNSLI NPSRYAAVCA GNGMTSQRLT DCLFRAFGLT SATGGCMNGI NFGTGGEDAN GKMIKGFGYT ETIGQGSCAG ILEKNGEKYG FDGFSGTQTN MTNTKITDPE VLEHRYPCLI LHYGIRHNSG GKGKWRGGDG LIREIRFNSP VHVSLVTQRR VFQPWGIYGG DPGARGENFL GRDRGNGVIQ WIQLPSLAEI EIGKGDILKI LTPGGGGFGR PEDTDEFWGV TRKQKENFKT TVNCEGTLGK IRDACNSSQ
|
| |