Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81320 |
Symbol | HYU1.1 |
ID | 4836839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 526325 |
End bp | 530835 |
Gene Length | 4511 bp |
Protein Length | 1309 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388154 |
Product | 5-oxoprolinase |
Protein accession | XP_001382336 |
Protein GI | 150863758 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAATCCTGGT CTATTCAGGT TATTGGAGAA ATAACTGTGA CTGCAATAGA AATATATTGG TACTACCGAA ATTTCAATTC ATTTTATTCA TTTGTCGCCG TCCTAAAGTC AATACTGCTA AGACACTAAT TGAATTTTTC ATTTTTCAAT TTTTCACTTT TACATTTTCA GAATTTCTAT TGCATTTTAC ATTTTGCAAT TTTCAATTTC CAATCGCTTG CTCTCTCAAT GACTCACAAA GGAATCCAGA TTGCCATTGA CAGAGGTGGA ACCTTCTGTG ATGTCATAGC CAAGATCCCT GGGCAGCCAG ATCATGTGTT CAAGTTACTT TCGGTAGATC CCAAGAACTA TCCAGATGCT CCTACCGAAG GAATCAGAAG AGTGCTTGAA AAGGTGCATG GAATTGAAAT TCCAAAAGGA GCCAAGTTGA AGTTGGACAG TATCCAGTCA ATCCGCATGG GTACGACTGT AGCTACCAAT GCTTTGTTAG AAAGGAAAGG TGCCGGTGTC TTGCTAGTGA CTACCAAAGG TTTCAAAGAC GTCTTGGTAA TTGGAAACCA GACCAGACCC AGCATTTTCG ATTTGACAGC CAAGAAGCTC AGCTACTTAT ATGACCAAGT CTTGGAAATC GACGAAAGAG TCACCGTGCA CGGTTTTTCT GAAGGTGGAG GTGATAAGCT TGAATTGGAT GAAACTTCGG ATGAGTTGGT ACAAGGTGTA ACTGGTGACA CTATTAGAAT CTTAAAGAAA CCAGACTACG AAAAAGTGAC ACAAGACTTG CAAGCAATCT TTGATCAAGG AAAGATAAAG ACTATTGCTT TGTCGTTATT GCATTCATAC GCCTATCCTG AACATGAAGC ACAGGTAGCT AAGATCGCCA AAGATATTGG CTACACTGTG TCCGTTTCCC ATGAATTGCA GCCAATGATC GGTATGGTAA ATCGTACTTC CTCAACTGTT GCTGATGCCT ACTTAAGTCC TATCATCAAT GACTACATCC ACAACTTCGG AAATGGTTTT GAAGGTGGAC TTGAAGCTTT TGGTAACAAG TTGCTCTTCA TGCAATCAAA TGGTGGTCTT TGTCCCTGGT ACAAGTTCAC CGGGTTGAAA GCCATCTTAT CTGGACCTGC CGGCGGTATG GTTGGATATG GTGAAACTTG TTATGATGAC GTTACCAAGA AGGCCACCAT AGGTTTTGAT GCTGGTGGGA CCTCCACCGA TGTCTCTAGA TACTCAGGTA TTTTGGAACA TATATACGAA ACTGTAGTCA GCGAGGTCAG CCTTCAAACC CCACAGTTAG ATATCTCTAC GGTGGCTGCA GGAGGTGGTT CCATTTTGTT TTGGAAGAAT GGCATGTTTG TAGTTGGTCC TGAATCCGCA GGATCTGATC CAGGACCCGC TGCCTATAGA AAGGGTGGCC CTTTGACAGT CACTGATGCC AATTTATATT TAGGCAGATT ATTGCCAGAC TTCTTCCCGA AGATTTTCGG TCCTAATCAA GACCAGCCTT TGGATTACGA GTTAACTAGA AAGAAGTTTA AAGAATTGAC AGAAGAAATC AACAAGGACA AAGCTAAAGA AGGAATCAGT TTGACTCCAG AGGAAGTAGC CAGCGGATTC TTGAAGGTGG CTGTAGAAGC AATGGCTAGG CCAATCAGAA ACTTGACTGA AGCTAAGGGT TTCAACACTT CGGACCATAA TTTGGCATGT TTTGGCGGAT CTGGTGGTCA ATTCTCCGTT TCTTTGGCTA AAAATTTGGG AATATCGCAT GTCGCTATTC ACAAGTATTC CTCTTTGTTG TCTGCGTACG GTATCCAATT AGCTGATATT GTAATCGAAA AACAATCGCC AGCCTCATTT GTCTATTCTG AGCAGAACTT CAACAGTATT GACACCAAAG TTAATCTGTT GATTGATTTA GCTTACAAGG ATTACAAGGA TCAACACTTG TCAGAATTTA AGACCAAACT TGAGGTCTAC TTGAACATGA GATATGTGGG CTCAGATACT CATCTCCTTA TACCTAGAAT TGAAGGAGAG TACGATGCAG ACCAAAGGTT TATTCAAAGA CATCAGAGTG AATTCGGATT CACTTTGGAT AGAAAGTTGT TAGTGGACGA TGTCCAAGTC TTATTAATTG TTGAAAGTGA AGACAAGCAG AGCCATAATC CATATGAAGA ATTCAACAAA TTGACCAAGA CAATTATCGC TCAGAAATCT GAAACCATCA GACCAATTTA TTTCGAAGGT GAAGGATGGT TGGACACTTC GGTGTACTTG TTGCCCGAAT TGAAGTTTGG TACAATTATA GAAGGTCCAT CAATTATCAT CGATAACACA CAAACTATCT TAGTAGAACC GAAATCAAAG GCTGCCATCT TATCAGATCA TATTTTGATC TTAGTTGAAC AAGAAGAACG TCAAAATTTA TCTAGCAAGA TTGTTGATCC AATTCAGTTG TCGGTTTTTG GTCATAGATT CATGTCCATA GCTGAGCAGA TGGGAAGAAC TTTACAACAA ACAGCCATTT CAACCAACAT CAAAGAAAGG TTAGATTTCT CCTGTGCTTT ATTTGATGGA AATGGTGATT TAGTTGCTAA TGCTCCGCAT GTTCCAATCC ATTTGGGTGC AATGTCATTT GCTGTAAAGG CTCAGAGGAG TCTTTGGGAT GGAAAGTTGG AGCAAGGTGA TGTTCTTGTA TCCAACCACC CGCTGGCGGG AGGCTCTCAT TTACCAGATA TTACTGTTAT AACTCCCGTG TTAGATGAGA ACAATAATCC TATATTCTGG ACTGCTTCCA GAGGTCACCA TGCTGACATT GGTTCAATTT CTGCTGGTTC CATGCCTCCT AATTCCAAGA CCATCTATGA CGAAGGTGCT GCTATTGTGA CTCATAAGTT GTGTTCAAGA GGCAAGTTTG ATGAAGTTGG TATCACCAGA ATATTGTTGG AAGAACCAGC AAAGCATCCA GGTGGCTCAG GAACCAGAAC CTTGAACGAC AACATTTCCG ACTTGAAGGC TCAGGTTTCT GCAAACTACA AAGGGATAAC TTTATTGCAA AGATTAGTTG ACGAATTCAG TCTCGATGTT ATCAACTTGT ATATGGGAGC CATTCAGTCT ACAGCCGAAA TTGCTGTGCG TAATCTTTTA CGATTGGCTT ACGAAAAGTT TGGTGGTGAT GATTTGAAGG CTATTGACTA TTTGGATGAT GGTACACCCA TTGCTTTGAC AGTGAAAATC AATAATGACA CGGGCAGTGC TGTGTTTGAT TTCACAGAAT CAGGTGATGA AATCTACGGA AATTTGAATG CACCAAAGGC AATCTTGTAT TCTGCTGTGT TGTATGTTTT GAGATCGTTG ATTAGCAGTG ATATCCCTTT AAACAATGGC TGTTTGAGGC CGATAATCTT CAAAACAAGA CCTGGTTCTG TTGTGGACCC TTCATTTGAA GCTGCTGTTG TTGGTGGTAA CGTTGAAACT ACTCAACGTA TAGTTGATGT CATGTTAAAG GCATTTGAGG CTGCCGCAGC TTCCCAAGGA ACTTGTAACA ACTTCACTTT TGGTATAACC GACAAGAAGA ACAATGTTTC CTTCGGCTAC TATGAGACGA TCTGTGGCGG TTCTGGAGCT GGCCCTACTT GGGATGGTCA GTCTGTAGTC CAATGTCATA CAACAAACAC CAGAATTACC GACACTGAGT TATTTGAGAA ACGTTACCCT GTAATATTGC ACGAGTATTC TGTTCGCCAA GGATCTGGCG GTGATGGTTT TCATAAGGGT GGCAATGGTG TTGTTAGAGA TATTGAGTTT ACCTATCCGA ACTTGCAGGT GTCGTGTTTG ATGGAAAGAA GATCATTGGC TCCATTTGGA TTGTTGGGTG GTAAGTCTGG TTCTAGAGGC AGAAACTATT GGTACAGGCA CAATGAAGAA GAGCCCGGTA CGTTCAGACG AATTTATTTG GGTGGAAAAT GTACTGTTTC TATTTCTAAG GGTGATAGAG TTGTAATTAT GACTCCTGGC GGTGGTGGTT TTGGAGAAGC AAGAGTTGAT GGGGCTGTGA ACGATTCCAA TGCAGTTAGC TACCCGGTGA CCTCTAGCGT CCCAAGTATC CTCACGGGCT CAGTAGGAAT GAGATCAATT ACCCAGGAGA CGAACTGATT TTCTATATAT GAACATTTCG TCAATACATG TACAGACTTT AGAAGCATCA CAATATGTGT GAGAGTGCCA AGTTGAATCA AAAGGGTGCA AAAAAGTACT GAAAAAAGAT CTACCCAGAA TGGTAAGAGT TGCAACTGGT GACAAATATT TATGATTGTT TTGTCTAACT ATTCTAAATT CAAATAATCT CATGTATAAT TGAATACTTC AGTTTCTCAC ACGCCAACTT TGTATAATAT TGTACCTCTA GTCGATGTAG ACTACAACGT ATAAGTCATC TGTTAGGATA GCTCCATAGT TAGTATTCAG TCAATGTCTC CCCACAAAAT GCGACATCAC ACTCACCTGA T
|
Protein sequence | MTHKGIQIAI DRGGTFCDVI AKIPGQPDHV FKLLSVDPKN YPDAPTEGIR RVLEKVHGIE IPKGAKLKLD SIQSIRMGTT VATNALLERK GAGVLLVTTK GFKDVLVIGN QTRPSIFDLT AKKLSYLYDQ VLEIDERVTV HGFSEGGGDK LELDETSDEL VQGVTGDTIR ILKKPDYEKV TQDLQAIFDQ GKIKTIALSL LHSYAYPEHE AQVAKIAKDI GYTVSVSHEL QPMIGMVNRT SSTVADAYLS PIINDYIHNF GNGFEGGLEA FGNKLLFMQS NGGLCPWYKF TGLKAILSGP AGGMVGYGET CYDDVTKKAT IGFDAGGTST DVSRYSGILE HIYETVVSEV SLQTPQLDIS TVAAGGGSIL FWKNGMFVVG PESAGSDPGP AAYRKGGPLT VTDANLYLGR LLPDFFPKIF GPNQDQPLDY ELTRKKFKEL TEEINKDKAK EGISLTPEEV ASGFLKVAVE AMARPIRNLT EAKGFNTSDH NLACFGGSGG QFSVSLAKNL GISHVAIHKY SSLLSAYGIQ LADIVIEKQS PASFVYSEQN FNSIDTKVNS LIDLAYKDYK DQHLSEFKTK LEVYLNMRYV GSDTHLLIPR IEGEYDADQR FIQRHQSEFG FTLDRKLLVD DVQVLLIVES EDKQSHNPYE EFNKLTKTII AQKSETIRPI YFEGEGWLDT SVYLLPELKF GTIIEGPSII IDNTQTILVE PKSKAAILSD HILILVEQEE RQNLSSKIVD PIQLSVFGHR FMSIAEQMGR TLQQTAISTN IKERLDFSCA LFDGNGDLVA NAPHVPIHLG AMSFAVKAQR SLWDGKLEQG DVLVSNHPSA GGSHLPDITV ITPVLDENNN PIFWTASRGH HADIGSISAG SMPPNSKTIY DEGAAIVTHK LCSRGKFDEV GITRILLEEP AKHPGGSGTR TLNDNISDLK AQVSANYKGI TLLQRLVDEF SLDVINLYMG AIQSTAEIAV RNLLRLAYEK FGGDDLKAID YLDDGTPIAL TVKINNDTGS AVFDFTESGD EIYGNLNAPK AILYSAVLYV LRSLISSDIP LNNGCLRPII FKTRPGSVVD PSFEAAVVGG NVETTQRIVD VMLKAFEAAA ASQGTCNNFT FGITDKKNNV SFGYYETICG GSGAGPTWDG QSVVQCHTTN TRITDTELFE KRYPVILHEY SVRQGSGGDG FHKGGNGVVR DIEFTYPNLQ VSCLMERRSL APFGLLGGKS GSRGRNYWYR HNEEEPGTFR RIYLGGKCTV SISKGDRVVI MTPGGGGFGE ARVDGAVNDS NAVSYPVTSS VPSILTGSVG MRSITQETN
|
| |