Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_89141 |
Symbol | |
ID | 4838646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 239973 |
End bp | 244735 |
Gene Length | 4763 bp |
Protein Length | 1571 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640389961 |
Product | predicted protein |
Protein accession | XP_001384008 |
Protein GI | 150864974 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase [COG0169] Shikimate 5-dehydrogenase [COG0337] 3-dehydroquinate synthetase [COG0703] Shikimate kinase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase [TIGR01093] 3-dehydroquinate dehydratase, type I [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase [TIGR01357] 3-dehydroquinate synthase [TIGR01809] shikimate-5-dehydrogenase, fungal AROM-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCTG TCGAGAAGGT ATCGATTCTC GGAGCCGAGA CCATCCATGT GGGATACGGC ATCCAGGACC ACATTGTCCA GGAGGTAATT TCACATCTAG CCTCATCTAC CTATGTGATT GTCACAGACA CCAACATGGC CAGAACTACT CCGTTCACCA AGTTGCGTAA CAAATTCGAA AGCAAACTTA AAGAATTACG TCCGGAGTCC CGTTTGCTTT TCTATTCAGT TTCACCAGGT GAAAACAACA AAAACAGAGA AACCAAGGCT GCTGTAGAAG ACTTCTTATT GCAACAGGGA TGTACCAGAG ACACCGTAAT TCTAGCTGTA GGAGGCGGAG TGATCGGGGA CATGATTGGT TTTGTAGCTG CAACCTTTAT GAGAGGGGTC AGAGTCGTTC AGGTTCCCAC TAGTTTGTTA GCCATGGTTG ACTCTTCTGT AGGTGGTAAG ACTGCCATAG ATACTCCATT GGGGAAGAAC TTTGTAGGAG CATTCCACCA GCCGGAGTAC GTCTTCGCAG ATGTTTCGTT CTTGGAGACC TTACCAACTA GACAGTTCAT CAACGGAATG GCTGAAGTCG TTAAGACGGC TGCGATCTGG AACGAAGAGG AATTCACTCG CTTGGAGAAG TTCTCCAAGA AGTTCCTCGC TGTAGTTTCT GCCAAAACCC CAGATTTAAT CTCAATTAAG GAAGAGTTAG TCAAAACGGT ACTTGAATCT ATTCGTGTAA AGGCTTTTGT CGTTTCGTCC GATGAGAAGG AAACTGGCTT GAGAAACTTG CTTAACTTTG GTCATACAAT TGGCCATGCA ATTGAAGCCG TGTTGACACC CCAAGCTCTC CACGGTGAAT GTGTTTCCAT CGGTATGATC AAAGAAGCTG AGTTGGCACG TTATTTGGGT GTTTTGTCAC CTGTAGCAGT CGCTAGATTG TCGAAGTGTC TTGTAGCTTA TGGTTTACCT GTTTCTATCG ACGAGAAAGA CTTTTTGAAG AAGGTGGGAA ACAAACGTCA CAATGTGGAA ATTGACATCT TGTTGAAGAA GATGGCTATT GACAAGAAGA ATGACGGTAG TAAGATTAGG TGTGTCATCC TCGAAGCCAT CGGAAAGTGC TACCAGTTGA AGGCTCACCA AGTCTCCAAA CAAGACTTGA GCTTTGTTCT CACAGATGAA GTATTGGTAC ATCCATTTGA TGACAAATTG ATTCCTAAGA CTAACGTCGT CATTCCTCCA GGCTCCAAGT CCATCTCTAA CAGAGCGTTG GTGTTAGCTG CCTTGGGTAC TGGTACTGTC AGAATCAAGA ACTTGTTGCA CTCTGACGAT ACCAAACACA TGTTGGAGGC AGTTGCTTCT TTGAAGGGAG CTTCTATCTC TACCGAAGAC AATGGTGAAA CCATTGTCGT CACTGGAAAT GGTGGAAAGC TCGTATCTTG CGACGAGCAA TTGTACTTAG GTAACGCCGG TACAGCCTCC AGATTCTTGA CTTCTGTTGC ACCTCTCGTA GGTATTAACC CTCAATCTGG TGAACATGTA GTTTTAACAG GTAATGCCAG AATGCAGGAA AGGCCAATTG GACCTTTGGT GGACGCTTTG AGAGCCAATG GCTCGGAAAT CGACTACCTC AACAAGGAAG GATCGTTACC ATTGAAGGTT AAGGCAGGTA AGGGCTTGAA CGGTGGAAGA ATAGAGTTAG CTGCTACTAT TTCGTCACAA TACGTTTCTT CCATCTTGAT GTGTGCTCCT TACGCCAATG AGCCGGTTAC TTTGTCGCTC GTAGGAGGTA AGCCAATCTC GCAGTTGTAC ATCAACATGA CGATTGCCAT GATGAAGACG TTTGGGATTG TCGTGACCAA GTCCGAAACT GAAGAACACA CTTACCACAT TCCCCGTGGA TCTTACGTAA ACCCTAAGGA ATACGTTATC GAATCAGATG CTTCTTCAGC TACTTACCCA TTGGCTTTTG CTGCCTTGAC AGGAACATCT TGTACGATTC CAAACATCGG TTCTTCTTCT TTACAGGGAG ACGCAAGGTT TGCTGTTGAT GTTTTGAGAC CTATGGGCTG TGAAGTGGTT CAGACTGCTA CTTCCACTAC TGTCACTGGT CCATCTGTAG GAAACTTGAA GCCTTTGCCC CATGTAGATA TGGAGCCAAT GACGGATGCT TTCCTTACTG CTTCTGTAGT CGCCGCTGTA GCCAAGAACG GCACGCAGTC TACCTCTATT ACTGGTATCG CCAACCAGAG AGTGAAGGAA TGTAATCGTA TCGCCGCTAT GGTATCTGAA TTGGCCAAAT TTGGTGTTGT AGCCAACGAG TTGCCAGACG GAATTGAAAT CCATGGAATT TCACCAAATG ACTTGGTTAC CCCATCCACA GAAAAGCGTG GAATCAAAAC TTTTGACGAT CACAGAGTGG CCATGTCGTT TTCGCTTTTG GCTGGTTTGT GTAAGGATAA GGTACTCATT CAAGAAAGAT CTTGTACTGG TAAGACCTGG CCGGGATGGT GGGACATCTT ACACACCAAG TTCAAGGTTG CTATCGATGG CTACGAGCTT CCATTACAAC ACGAAGACAG TACTGCCTTG GTTGAAAAAC ATGGTAATGG TAAGAGAAGT ATCATCGTTA TTGGAATGAG AGGCGCTGGG AAGTCCACCT TGTCGAAGTG GATGGCTTCC TTCTTGGGCT TCAAGCTTGT AGACTTGGAC GATGTTCTTG AAGAAAAAAT TGGCACTGAT ATCAGGTCGT TTGTACAACA GCAAGGCTGG GAAGAATTCC GTAAGCAAGA AGCAATTGTA GCTAAAGAAT CTTTCATTAA GTTCTCTGAA GGCTGTGTAT TGTCTACTGG GGGTGGAATT GTAGAAGGCG AAGAGGCTAG AGAATCCTTA AAGAGCTATG TAAAGTCTGG TGGAATTGTC TTACACTTAC ATCGTGATTT GGACGAGACC GTTGTACTTC TTTCTGCGGA CACAACTAGA CCAGCCTACG TGGATGAAAT CAAACAGGTT TGGTTACGTA GAGAAAATTG GTACCGTGAA TGTTCTAACT ATCACTTCTA TTCAGCACAT TGTTCCAGTG ATGCCGAATT CAAGCACTTG CGAAACTCCT TCACTACTTA CATCAAGACC ATCACCGGCT TCCATGTGGC TCAGATTCCT AAGAAGAGGT CCTTCTACAC TAGTTTGACG TTCTCTGACT TGACTGAAGT TGCTTCATCC TTGGAAGACA TCTCCACAGG TTCTGATGCC ATCGAATTAA GAGTTGACCT TTTGAAGGAA ACGACTCATA CTTTCGTTGC TGACCAAACT GCCATCTTGA GAAAGTCGAC TAATCTTCCG ATCATCTACA CTATCAGAAC AGAATCACAA GGAGGAAAGT TCCCCGACAA CAAATTCGAA GAGCTCGAAG AGTTGTTGGC TTTGGGTATT AAGTTAGGTG TGCAATACTT GGACCTTCAG TTAGACTTGC CTAATGACTT GCTTGAAAGA ATTTTGGAAT CGAAGAAGTT CACCAAGATC ATTGCTTCCT ACGTTGATGT TTCTGGCTCA TTAAGATGGG ATAACGTCGA ATGGAAAAAC AGATACAATC AGGGTGTTTC TCTTGGTGCT GACCTTGTCA AGTTGGTTGG AAGAGCCAAT CTGTTCCAAG ATAACTTGAG CTTGGAAGTA TTCAGAGGCA CCAGCACATT GAAGCCTTTA ATTGCCTACA ATGTTGGTGA AAAAGGTAAG TTGTCTAGAG TGTTGAACCC AAGATTGACT CCAGTTACTC ACGCAAAGAT TCCTGCCGAA TCTGGTAACG AAGGAGCATT GGATGTCGCT CAGATCAACA AAGCGTACAC TGACATTGGT GGCTTGTCAG AGAAGCACTT CTGGATTGTC GGCAACCCTG TTGGCCATAG TCGTTCTCCC AACTTGCACA ATGCTGGCTA CAAGAAGTTG AACTTGCCAT ACGTGTTTGA CAGATTTGAG ACATCTGATG CTGGAGAAGC ATTTCAGAAA TTGATCAAAG AAGACAAGAA CTTCGGCGGT TTGGCTGTGA CCATGCCCTT GAAGGTTGAT ATCATGAAAT ACACTGATAA GTTGTCTGAT TCAGCTCAAG TTATTGGCGC TGTTAATACT GTGATTGAAT TGGAGGGCGA ACAAGGAAAG TATTTGGGTG AGAACACCGA CTGGGTTGGT ATTTCAGAGT CTTTTGTTAG GGATGGAATT CCCAACCTTG AAAACGTCAA TGTCAACGGT TTGGTTGTCG GTGGTGGAGG CACTTCTCGT GCTGCTGTCT ACGCTTTGCA CCAATTAGGT TGCAAAAAGA TCTACATGCT CAATCGTACG GTTTCCAAGA TTCAAGAGAT TCAGAAGAAC TTCCCTGCTG AGTATAACAT TGAAATTTTG GACAGTGTTG AAGCTGTTGA AGCTGCACAA CCTATCTCGT TGATTGTTTC TTGTATTCCA GCAGACAAGC CAATTGACGA GCAATTGTTG AACAAGCTTG AGAGAGTATT GTACGTTGGA GGTGAAGCCA AGATTGGTGG ATTCACCCCT TCCTTATTAG AAGCTTCCTA CAAGCCCAGA GTTACTCCTA TCATGAAGAT CGCTCTGGAG AAGTATGAGT GGAACGTGAT TCCTGGTGTG GAAATGTTGG TGAACCAAGG CATTACGCAA TTTCAGTTGC ACACTGGTTT TGTTGCCCCC TACGATGTGG TTCACGATGC TGTTGTGAAC CAATGAAGTA CTAGTATATA CAAAAAGTTG ATAAGACCAT ATACAATTAT TCT
|
Protein sequence | MTSVEKVSIL GAETIHVGYG IQDHIVQEVI SHLASSTYVI VTDTNMARTT PFTKLRNKFE SKLKELRPES RLLFYSVSPG ENNKNRETKA AVEDFLLQQG CTRDTVILAV GGGVIGDMIG FVAATFMRGV RVVQVPTSLL AMVDSSVGGK TAIDTPLGKN FVGAFHQPEY VFADVSFLET LPTRQFINGM AEVVKTAAIW NEEEFTRLEK FSKKFLAVVS AKTPDLISIK EELVKTVLES IRVKAFVVSS DEKETGLRNL LNFGHTIGHA IEAVLTPQAL HGECVSIGMI KEAELARYLG VLSPVAVARL SKCLVAYGLP VSIDEKDFLK KVGNKRHNVE IDILLKKMAI DKKNDGSKIR CVILEAIGKC YQLKAHQVSK QDLSFVLTDE VLVHPFDDKL IPKTNVVIPP GSKSISNRAL VLAALGTGTV RIKNLLHSDD TKHMLEAVAS LKGASISTED NGETIVVTGN GGKLVSCDEQ LYLGNAGTAS RFLTSVAPLV GINPQSGEHV VLTGNARMQE RPIGPLVDAL RANGSEIDYL NKEGSLPLKV KAGKGLNGGR IELAATISSQ YVSSILMCAP YANEPVTLSL VGGKPISQLY INMTIAMMKT FGIVVTKSET EEHTYHIPRG SYVNPKEYVI ESDASSATYP LAFAALTGTS CTIPNIGSSS LQGDARFAVD VLRPMGCEVV QTATSTTVTG PSVGNLKPLP HVDMEPMTDA FLTASVVAAV AKNGTQSTSI TGIANQRVKE CNRIAAMVSE LAKFGVVANE LPDGIEIHGI SPNDLVTPST EKRGIKTFDD HRVAMSFSLL AGLCKDKVLI QERSCTGKTW PGWWDILHTK FKVAIDGYEL PLQHEDSTAL VEKHGNGKRS IIVIGMRGAG KSTLSKWMAS FLGFKLVDLD DVLEEKIGTD IRSFVQQQGW EEFRKQEAIV AKESFIKFSE GCVLSTGGGI VEGEEARESL KSYVKSGGIV LHLHRDLDET VVLLSADTTR PAYVDEIKQV WLRRENWYRE CSNYHFYSAH CSSDAEFKHL RNSFTTYIKT ITGFHVAQIP KKRSFYTSLT FSDLTEVASS LEDISTGSDA IELRVDLLKE TTHTFVADQT AILRKSTNLP IIYTIRTESQ GGKFPDNKFE ELEELLALGI KLGVQYLDLQ LDLPNDLLER ILESKKFTKI IASYVDVSGS LRWDNVEWKN RYNQGVSLGA DLVKLVGRAN SFQDNLSLEV FRGTSTLKPL IAYNVGEKGK LSRVLNPRLT PVTHAKIPAE SGNEGALDVA QINKAYTDIG GLSEKHFWIV GNPVGHSRSP NLHNAGYKKL NLPYVFDRFE TSDAGEAFQK LIKEDKNFGG LAVTMPLKVD IMKYTDKLSD SAQVIGAVNT VIELEGEQGK YLGENTDWVG ISESFVRDGI PNLENVNVNG LVVGGGGTSR AAVYALHQLG CKKIYMLNRT VSKIQEIQKN FPAEYNIEIL DSVEAVEAAQ PISLIVSCIP ADKPIDEQLL NKLERVLYVG GEAKIGGFTP SLLEASYKPR VTPIMKIASE KYEWNVIPGV EMLVNQGITQ FQLHTGFVAP YDVVHDAVVN Q
|
| |