Gene PICST_29736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29736 
SymbolHYU1.2 
ID4837633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1000679 
End bp1004638 
Gene Length3960 bp 
Protein Length1319 aa 
Translation table12 
GC content40% 
IMG OID640388948 
Product5-oxoprolinase 
Protein accessionXP_001382968 
Protein GI150864232 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit
[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.434949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTGT CTGGCCATGG AAAAGTAGAA ATCGCCATAG ATAGAGGCGG AACCTTCACC 
GATGTCATCT ACAAATGTAA TAACCAAGAA GAACATTCGT TCAAGTTGCT ATCAGAGGAT
CCGGCCAACT ATCAGGACGC AAATATCGAG GGCATACGTC GGGTTCTAGA AAAACTCACA
GACTCTACTA TCCCTAGAGG CACACCTCTT GATACATCGA TTATATCTTC TATACGATTG
GGAACCACAG TGGCTACCAA TGCTTTGCTA GAGAGAAAAG GTGCTCGCAT TGCCCTTGTT
ACTACCAAAG GGTTCAAAGA TTTGCTCCAT ATAGGAGACC AATCCAGGCC AGACTTGTTT
GCATTAAATA TCGTAAAACC TGGGGTACTC TATGATAAGG TGGTAGAAGC CGACGAAAGA
GTTACCATGC CTGCTTTTTC CGTGGATCCA ACCGGTTATG ATGCCCAAGA TCTTGTAGAC
GATGTTCGAT ATGTTTATGG AGAGACAAAT GAAGTCTTTG AAATATTGCA ACCTTTAGAC
ACAGATAAAC TAACTGAGGA TTTGATCCAG TTGAAGAACG AAGGAATAAA TTCCATCGCT
ATTGTATTTA TCCATGGCTT CAACTTCCAG AAGCACGAGA AGCTTGCTGG AGCAATTGCC
AGAGAATTGG GATTTTCCAA CGTGTCTCTT TCTCACGAAA TCTTGCCTGT TATAAAAACG
GTCTCGCGTG GCCAGTCAAC AACATTAGAT GCCTATCTAA CTCCAGTGGT CAAGCGGTAT
ATCACCAATT TTGTTAGCGG TTTTAAGCCT GGTTTTGAAT CCCACACCAG AATCGAGTTC
ATGCAATCAG ATGGTGGTTT ATGTACATGG AAAAATTTTA CAGGCTTACG ATCTTTGCTT
TCAGGTCCTG CTGGTGGAGT CGTGGGGGCT GCAAAGACAT GCTATGATGC CGATGTTAAA
ATCCCAGTCA TAGGGTTTGA TATGGGAGGC ACATCAACGG ATGTCTCCAG ATTCGCAGGT
GATTATGAAT TTTCATTTGA AAGTGTAACT GCTGGCATAA AAATAGCTGC TCCTCAATTA
GACATCAACA CTGTGGCTGC TGGTGGAGGA TCGATTTTGT TTTATAGAAA TGGGGCATTT
GCAGTTGGTC CAGAATCCGC AGGGGCCCAT CCAGGTCCAG CTTGTTATAG AAAAGGCGGT
CCTTTGACTA TTACTGATGC CAACTTATTC ACTGGCAGAA TCATTCCAGA GTTCTTTCCT
AAGATATTTG GATATTCTGA AGATCAACCG TTGGACTACG AAATTACAAA GAAAAAATTC
GAAGAATTGG CGAATGTGAT AAACAAAGAT AATCCAGATG TGCCAAAAAC TCCATTGGAA
ATTGCCTTGG GGTTCTTAAA AGTTGCAGAT TTTCAAATGG CAAGACCAAT TCGAGATTTA
ACAGAATCGA AAGGTCATGA TGTTAGCAAG CATTCTCTTG CCTCTTTTGG TGGAGCTGGT
GGTCAGCATG CAACTTCAAT TGCCAAAATA TTAAAAATAA AGAGAGTCTT GATACACAAA
CACTCATCCA TTTTATCAGC ATATGGTATT TATTTATCTT CTGTTGTTAA TGAGCAACAA
GAACCGGTCT CAGAGGTTTA CTCCAAGGAG ATTGCACTTA TGTTGCTTGA TAAATGTAGT
CAATTGAAGG AAAAATGCCA GAAGGAGTTA TTAAACCAAG GAGTACTACC CAACACCATC
AACTACACTG TCTATTTTAA TATGGGCTAC AAAGGCTCTG ATACAAGAAT AATGATTAAA
CAAAGAAAGA AGCAAGATTT TTTGGACTCA TTTTATAGAA GACATGACCT TGAATTTGCA
TTCAATGATT ACGAAAAGCC TGTTTTAGTA TCTAATATAA GGGTAAGAGC ATGTGGGAGT
GCTTCTGAAC TGATCCAAGA ACGTTCACCA TATAAAGATT TTGAATCAGT CACAAAATTT
CCAGTGTCAT CTGATCTTAT CAAAAAGGTC ACAGAAGTAC ATTTCGAACA AGGTACTTTG
GCAACTGCGG TATTCTTCTT GGATACATTG CCTGTGGGGG CAGTAATACC AGGACCAGCT
CTAATTTTGG ATGAAACTCA AACCATCGTT GTTTCTCCCG ATTCTACAGC TACAATTCTT
CCAAGACATG TGGTGCTTGA TTTGAAAACT GACGAGAAAC ATTCCATTTC CACCAACTTT
GTCGATCCTA TACAATTATC TATATTTGCA AACAGATTCA TGTCAATCGC TGATGACATG
TCAAGAACTC TTCAAAAGAT ATCTGTAAGT GCTAATATCA AGGAACGGTT GGATTATTCT
TGTGCTCTAT TTGACAATCA AGGAAATCTC GCAGCTAATG CTCCAAATGT TCCTGTTCAT
CTAGGTTCTA TGTCCACTGC AATTAAATAC CAGTTGGATT ATTGGAAAGA CAATTTACGT
GAAGGTGACA TCTTGTGCTC AAATTCCCCA AGCGTTGGCG GTACACACTT ACCGGATGTA
ACTGTTATTT CTCCCGTATT TATAGATGGA AAGATTCAGT TTGTTGTTGC TGCTCGTGCA
CACCACTCGG AAATTGGCGG ATCTGCGCCT GGATCTTCTT CTTCATATGC CAGAGACATT
TTTGAAGAAG GTGCCAATAT TGAAGCCTGG AAAATTGTTT CAAACGGCAA GTTTGATTAT
GAGGGATTAC AAAAATATTT TGTTGAAGTT CCAAGGAGTC ATGGTGTTTC GGGAACCAGA
AATATAGATG ACAATATTTC TGATCTCAAA GCCCAAATCG CGTCTAATCA ACGAGGAATT
AATTTGTTGA AGGATTTATT TGAGGAGTAT GGAAGTGAAA CAGTGTTATT TTATATGCGA
AATGTCAAGA AGTCTGCCGA ATTAGCCGTT CGAAACTTTT TCAAAGATTA CGCTACGAAG
AACAAGAACA AATTGCCGCT AACTGCAGAA GATTTTATGG ACGATGGATG CAAAGTTCAA
GTCAAAATAG AAATTGATGA AAATGATGGA TCTGCTGTCT TTGATTTCAC GGGTACTTCA
TTGGAATCAT ATTCCAACCT TAATGCACCA AAATCGATTA CATATTCAAC GGTTATTTAC
GTCTTGAGGT GTTTGGTAAA CCTAGAAATT CCTTTGAACC AAGGCTGTTT GGACCCCTGC
ACCTTGATAA TTCCAGACAA CAGTTTGATT AATCCTAGTC GTTATGCGGC CGTTTGTGCT
GGAAACGGTA TGACGTCTCA AAGATTGACT GATTGCTTAT TCAGAGCCTT TGGTTTGACT
TCAGCTACTG GTGGATGTAT GAACGGTATT AATTTTGGCA CTGGTGGTGA AGACGCAAAC
GGTAAAATGA TTAAGGGTTT TGGTTATACC GAGACTATTG GGCAAGGAAG CTGCGCTGGA
ATTCTTGAGA AGAATGGAGA GAAATATGGT TTTGATGGTT TTTCAGGAAC GCAAACTAAT
ATGACCAATA CGAAGATAAC TGATCCTGAA GTTTTAGAGC ACAGGTATCC CTGTTTGATA
CTTCATTATG GAATAAGACA CAATTCTGGT GGTAAAGGGA AATGGAGAGG AGGTGATGGC
TTGATTCGAG AAATAAGATT TAACTCACCT GTTCATGTTT CGTTGGTAAC TCAAAGACGA
GTTTTCCAGC CTTGGGGAAT TTATGGAGGC GATCCGGGAG CTCGAGGTGA GAATTTTTTG
GGAAGAGATA GAGGGAACGG TGTTATTCAG TGGATACAAT TGCCATCTTT AGCTGAAATC
GAGATTGGCA AGGGAGACAT TCTCAAAATT CTCACTCCAG GCGGAGGGGG ATTTGGTCGA
CCTGAAGATA CTGACGAGTT TTGGGGTGTA ACCAGGAAAC AAAAGGAAAA TTTTAAAACT
ACTGTGAACT GTGAGGGCAC TCTTGGTAAA ATAAGAGATG CATGCAACTC ATCCCAGTAG
 
Protein sequence
MTLSGHGKVE IAIDRGGTFT DVIYKCNNQE EHSFKLLSED PANYQDANIE GIRRVLEKLT 
DSTIPRGTPL DTSIISSIRL GTTVATNALL ERKGARIALV TTKGFKDLLH IGDQSRPDLF
ALNIVKPGVL YDKVVEADER VTMPAFSVDP TGYDAQDLVD DVRYVYGETN EVFEILQPLD
TDKLTEDLIQ LKNEGINSIA IVFIHGFNFQ KHEKLAGAIA RELGFSNVSL SHEILPVIKT
VSRGQSTTLD AYLTPVVKRY ITNFVSGFKP GFESHTRIEF MQSDGGLCTW KNFTGLRSLL
SGPAGGVVGA AKTCYDADVK IPVIGFDMGG TSTDVSRFAG DYEFSFESVT AGIKIAAPQL
DINTVAAGGG SILFYRNGAF AVGPESAGAH PGPACYRKGG PLTITDANLF TGRIIPEFFP
KIFGYSEDQP LDYEITKKKF EELANVINKD NPDVPKTPLE IALGFLKVAD FQMARPIRDL
TESKGHDVSK HSLASFGGAG GQHATSIAKI LKIKRVLIHK HSSILSAYGI YLSSVVNEQQ
EPVSEVYSKE IALMLLDKCS QLKEKCQKEL LNQGVLPNTI NYTVYFNMGY KGSDTRIMIK
QRKKQDFLDS FYRRHDLEFA FNDYEKPVLV SNIRVRACGS ASESIQERSP YKDFESVTKF
PVSSDLIKKV TEVHFEQGTL ATAVFFLDTL PVGAVIPGPA LILDETQTIV VSPDSTATIL
PRHVVLDLKT DEKHSISTNF VDPIQLSIFA NRFMSIADDM SRTLQKISVS ANIKERLDYS
CALFDNQGNL AANAPNVPVH LGSMSTAIKY QLDYWKDNLR EGDILCSNSP SVGGTHLPDV
TVISPVFIDG KIQFVVAARA HHSEIGGSAP GSSSSYARDI FEEGANIEAW KIVSNGKFDY
EGLQKYFVEV PRSHGVSGTR NIDDNISDLK AQIASNQRGI NLLKDLFEEY GSETVLFYMR
NVKKSAELAV RNFFKDYATK NKNKLPLTAE DFMDDGCKVQ VKIEIDENDG SAVFDFTGTS
LESYSNLNAP KSITYSTVIY VLRCLVNLEI PLNQGCLDPC TLIIPDNSLI NPSRYAAVCA
GNGMTSQRLT DCLFRAFGLT SATGGCMNGI NFGTGGEDAN GKMIKGFGYT ETIGQGSCAG
ILEKNGEKYG FDGFSGTQTN MTNTKITDPE VLEHRYPCLI LHYGIRHNSG GKGKWRGGDG
LIREIRFNSP VHVSLVTQRR VFQPWGIYGG DPGARGENFL GRDRGNGVIQ WIQLPSLAEI
EIGKGDILKI LTPGGGGFGR PEDTDEFWGV TRKQKENFKT TVNCEGTLGK IRDACNSSQ