Gene PICST_81320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81320 
SymbolHYU1.1 
ID4836839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp526325 
End bp530835 
Gene Length4511 bp 
Protein Length1309 aa 
Translation table12 
GC content41% 
IMG OID640388154 
Product5-oxoprolinase 
Protein accessionXP_001382336 
Protein GI150863758 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit
[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAATCCTGGT CTATTCAGGT TATTGGAGAA ATAACTGTGA CTGCAATAGA AATATATTGG 
TACTACCGAA ATTTCAATTC ATTTTATTCA TTTGTCGCCG TCCTAAAGTC AATACTGCTA
AGACACTAAT TGAATTTTTC ATTTTTCAAT TTTTCACTTT TACATTTTCA GAATTTCTAT
TGCATTTTAC ATTTTGCAAT TTTCAATTTC CAATCGCTTG CTCTCTCAAT GACTCACAAA
GGAATCCAGA TTGCCATTGA CAGAGGTGGA ACCTTCTGTG ATGTCATAGC CAAGATCCCT
GGGCAGCCAG ATCATGTGTT CAAGTTACTT TCGGTAGATC CCAAGAACTA TCCAGATGCT
CCTACCGAAG GAATCAGAAG AGTGCTTGAA AAGGTGCATG GAATTGAAAT TCCAAAAGGA
GCCAAGTTGA AGTTGGACAG TATCCAGTCA ATCCGCATGG GTACGACTGT AGCTACCAAT
GCTTTGTTAG AAAGGAAAGG TGCCGGTGTC TTGCTAGTGA CTACCAAAGG TTTCAAAGAC
GTCTTGGTAA TTGGAAACCA GACCAGACCC AGCATTTTCG ATTTGACAGC CAAGAAGCTC
AGCTACTTAT ATGACCAAGT CTTGGAAATC GACGAAAGAG TCACCGTGCA CGGTTTTTCT
GAAGGTGGAG GTGATAAGCT TGAATTGGAT GAAACTTCGG ATGAGTTGGT ACAAGGTGTA
ACTGGTGACA CTATTAGAAT CTTAAAGAAA CCAGACTACG AAAAAGTGAC ACAAGACTTG
CAAGCAATCT TTGATCAAGG AAAGATAAAG ACTATTGCTT TGTCGTTATT GCATTCATAC
GCCTATCCTG AACATGAAGC ACAGGTAGCT AAGATCGCCA AAGATATTGG CTACACTGTG
TCCGTTTCCC ATGAATTGCA GCCAATGATC GGTATGGTAA ATCGTACTTC CTCAACTGTT
GCTGATGCCT ACTTAAGTCC TATCATCAAT GACTACATCC ACAACTTCGG AAATGGTTTT
GAAGGTGGAC TTGAAGCTTT TGGTAACAAG TTGCTCTTCA TGCAATCAAA TGGTGGTCTT
TGTCCCTGGT ACAAGTTCAC CGGGTTGAAA GCCATCTTAT CTGGACCTGC CGGCGGTATG
GTTGGATATG GTGAAACTTG TTATGATGAC GTTACCAAGA AGGCCACCAT AGGTTTTGAT
GCTGGTGGGA CCTCCACCGA TGTCTCTAGA TACTCAGGTA TTTTGGAACA TATATACGAA
ACTGTAGTCA GCGAGGTCAG CCTTCAAACC CCACAGTTAG ATATCTCTAC GGTGGCTGCA
GGAGGTGGTT CCATTTTGTT TTGGAAGAAT GGCATGTTTG TAGTTGGTCC TGAATCCGCA
GGATCTGATC CAGGACCCGC TGCCTATAGA AAGGGTGGCC CTTTGACAGT CACTGATGCC
AATTTATATT TAGGCAGATT ATTGCCAGAC TTCTTCCCGA AGATTTTCGG TCCTAATCAA
GACCAGCCTT TGGATTACGA GTTAACTAGA AAGAAGTTTA AAGAATTGAC AGAAGAAATC
AACAAGGACA AAGCTAAAGA AGGAATCAGT TTGACTCCAG AGGAAGTAGC CAGCGGATTC
TTGAAGGTGG CTGTAGAAGC AATGGCTAGG CCAATCAGAA ACTTGACTGA AGCTAAGGGT
TTCAACACTT CGGACCATAA TTTGGCATGT TTTGGCGGAT CTGGTGGTCA ATTCTCCGTT
TCTTTGGCTA AAAATTTGGG AATATCGCAT GTCGCTATTC ACAAGTATTC CTCTTTGTTG
TCTGCGTACG GTATCCAATT AGCTGATATT GTAATCGAAA AACAATCGCC AGCCTCATTT
GTCTATTCTG AGCAGAACTT CAACAGTATT GACACCAAAG TTAATCTGTT GATTGATTTA
GCTTACAAGG ATTACAAGGA TCAACACTTG TCAGAATTTA AGACCAAACT TGAGGTCTAC
TTGAACATGA GATATGTGGG CTCAGATACT CATCTCCTTA TACCTAGAAT TGAAGGAGAG
TACGATGCAG ACCAAAGGTT TATTCAAAGA CATCAGAGTG AATTCGGATT CACTTTGGAT
AGAAAGTTGT TAGTGGACGA TGTCCAAGTC TTATTAATTG TTGAAAGTGA AGACAAGCAG
AGCCATAATC CATATGAAGA ATTCAACAAA TTGACCAAGA CAATTATCGC TCAGAAATCT
GAAACCATCA GACCAATTTA TTTCGAAGGT GAAGGATGGT TGGACACTTC GGTGTACTTG
TTGCCCGAAT TGAAGTTTGG TACAATTATA GAAGGTCCAT CAATTATCAT CGATAACACA
CAAACTATCT TAGTAGAACC GAAATCAAAG GCTGCCATCT TATCAGATCA TATTTTGATC
TTAGTTGAAC AAGAAGAACG TCAAAATTTA TCTAGCAAGA TTGTTGATCC AATTCAGTTG
TCGGTTTTTG GTCATAGATT CATGTCCATA GCTGAGCAGA TGGGAAGAAC TTTACAACAA
ACAGCCATTT CAACCAACAT CAAAGAAAGG TTAGATTTCT CCTGTGCTTT ATTTGATGGA
AATGGTGATT TAGTTGCTAA TGCTCCGCAT GTTCCAATCC ATTTGGGTGC AATGTCATTT
GCTGTAAAGG CTCAGAGGAG TCTTTGGGAT GGAAAGTTGG AGCAAGGTGA TGTTCTTGTA
TCCAACCACC CGCTGGCGGG AGGCTCTCAT TTACCAGATA TTACTGTTAT AACTCCCGTG
TTAGATGAGA ACAATAATCC TATATTCTGG ACTGCTTCCA GAGGTCACCA TGCTGACATT
GGTTCAATTT CTGCTGGTTC CATGCCTCCT AATTCCAAGA CCATCTATGA CGAAGGTGCT
GCTATTGTGA CTCATAAGTT GTGTTCAAGA GGCAAGTTTG ATGAAGTTGG TATCACCAGA
ATATTGTTGG AAGAACCAGC AAAGCATCCA GGTGGCTCAG GAACCAGAAC CTTGAACGAC
AACATTTCCG ACTTGAAGGC TCAGGTTTCT GCAAACTACA AAGGGATAAC TTTATTGCAA
AGATTAGTTG ACGAATTCAG TCTCGATGTT ATCAACTTGT ATATGGGAGC CATTCAGTCT
ACAGCCGAAA TTGCTGTGCG TAATCTTTTA CGATTGGCTT ACGAAAAGTT TGGTGGTGAT
GATTTGAAGG CTATTGACTA TTTGGATGAT GGTACACCCA TTGCTTTGAC AGTGAAAATC
AATAATGACA CGGGCAGTGC TGTGTTTGAT TTCACAGAAT CAGGTGATGA AATCTACGGA
AATTTGAATG CACCAAAGGC AATCTTGTAT TCTGCTGTGT TGTATGTTTT GAGATCGTTG
ATTAGCAGTG ATATCCCTTT AAACAATGGC TGTTTGAGGC CGATAATCTT CAAAACAAGA
CCTGGTTCTG TTGTGGACCC TTCATTTGAA GCTGCTGTTG TTGGTGGTAA CGTTGAAACT
ACTCAACGTA TAGTTGATGT CATGTTAAAG GCATTTGAGG CTGCCGCAGC TTCCCAAGGA
ACTTGTAACA ACTTCACTTT TGGTATAACC GACAAGAAGA ACAATGTTTC CTTCGGCTAC
TATGAGACGA TCTGTGGCGG TTCTGGAGCT GGCCCTACTT GGGATGGTCA GTCTGTAGTC
CAATGTCATA CAACAAACAC CAGAATTACC GACACTGAGT TATTTGAGAA ACGTTACCCT
GTAATATTGC ACGAGTATTC TGTTCGCCAA GGATCTGGCG GTGATGGTTT TCATAAGGGT
GGCAATGGTG TTGTTAGAGA TATTGAGTTT ACCTATCCGA ACTTGCAGGT GTCGTGTTTG
ATGGAAAGAA GATCATTGGC TCCATTTGGA TTGTTGGGTG GTAAGTCTGG TTCTAGAGGC
AGAAACTATT GGTACAGGCA CAATGAAGAA GAGCCCGGTA CGTTCAGACG AATTTATTTG
GGTGGAAAAT GTACTGTTTC TATTTCTAAG GGTGATAGAG TTGTAATTAT GACTCCTGGC
GGTGGTGGTT TTGGAGAAGC AAGAGTTGAT GGGGCTGTGA ACGATTCCAA TGCAGTTAGC
TACCCGGTGA CCTCTAGCGT CCCAAGTATC CTCACGGGCT CAGTAGGAAT GAGATCAATT
ACCCAGGAGA CGAACTGATT TTCTATATAT GAACATTTCG TCAATACATG TACAGACTTT
AGAAGCATCA CAATATGTGT GAGAGTGCCA AGTTGAATCA AAAGGGTGCA AAAAAGTACT
GAAAAAAGAT CTACCCAGAA TGGTAAGAGT TGCAACTGGT GACAAATATT TATGATTGTT
TTGTCTAACT ATTCTAAATT CAAATAATCT CATGTATAAT TGAATACTTC AGTTTCTCAC
ACGCCAACTT TGTATAATAT TGTACCTCTA GTCGATGTAG ACTACAACGT ATAAGTCATC
TGTTAGGATA GCTCCATAGT TAGTATTCAG TCAATGTCTC CCCACAAAAT GCGACATCAC
ACTCACCTGA T
 
Protein sequence
MTHKGIQIAI DRGGTFCDVI AKIPGQPDHV FKLLSVDPKN YPDAPTEGIR RVLEKVHGIE 
IPKGAKLKLD SIQSIRMGTT VATNALLERK GAGVLLVTTK GFKDVLVIGN QTRPSIFDLT
AKKLSYLYDQ VLEIDERVTV HGFSEGGGDK LELDETSDEL VQGVTGDTIR ILKKPDYEKV
TQDLQAIFDQ GKIKTIALSL LHSYAYPEHE AQVAKIAKDI GYTVSVSHEL QPMIGMVNRT
SSTVADAYLS PIINDYIHNF GNGFEGGLEA FGNKLLFMQS NGGLCPWYKF TGLKAILSGP
AGGMVGYGET CYDDVTKKAT IGFDAGGTST DVSRYSGILE HIYETVVSEV SLQTPQLDIS
TVAAGGGSIL FWKNGMFVVG PESAGSDPGP AAYRKGGPLT VTDANLYLGR LLPDFFPKIF
GPNQDQPLDY ELTRKKFKEL TEEINKDKAK EGISLTPEEV ASGFLKVAVE AMARPIRNLT
EAKGFNTSDH NLACFGGSGG QFSVSLAKNL GISHVAIHKY SSLLSAYGIQ LADIVIEKQS
PASFVYSEQN FNSIDTKVNS LIDLAYKDYK DQHLSEFKTK LEVYLNMRYV GSDTHLLIPR
IEGEYDADQR FIQRHQSEFG FTLDRKLLVD DVQVLLIVES EDKQSHNPYE EFNKLTKTII
AQKSETIRPI YFEGEGWLDT SVYLLPELKF GTIIEGPSII IDNTQTILVE PKSKAAILSD
HILILVEQEE RQNLSSKIVD PIQLSVFGHR FMSIAEQMGR TLQQTAISTN IKERLDFSCA
LFDGNGDLVA NAPHVPIHLG AMSFAVKAQR SLWDGKLEQG DVLVSNHPSA GGSHLPDITV
ITPVLDENNN PIFWTASRGH HADIGSISAG SMPPNSKTIY DEGAAIVTHK LCSRGKFDEV
GITRILLEEP AKHPGGSGTR TLNDNISDLK AQVSANYKGI TLLQRLVDEF SLDVINLYMG
AIQSTAEIAV RNLLRLAYEK FGGDDLKAID YLDDGTPIAL TVKINNDTGS AVFDFTESGD
EIYGNLNAPK AILYSAVLYV LRSLISSDIP LNNGCLRPII FKTRPGSVVD PSFEAAVVGG
NVETTQRIVD VMLKAFEAAA ASQGTCNNFT FGITDKKNNV SFGYYETICG GSGAGPTWDG
QSVVQCHTTN TRITDTELFE KRYPVILHEY SVRQGSGGDG FHKGGNGVVR DIEFTYPNLQ
VSCLMERRSL APFGLLGGKS GSRGRNYWYR HNEEEPGTFR RIYLGGKCTV SISKGDRVVI
MTPGGGGFGE ARVDGAVNDS NAVSYPVTSS VPSILTGSVG MRSITQETN