Gene PICST_89141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89141 
Symbol 
ID4838646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp239973 
End bp244735 
Gene Length4763 bp 
Protein Length1571 aa 
Translation table12 
GC content44% 
IMG OID640389961 
Productpredicted protein 
Protein accessionXP_001384008 
Protein GI150864974 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase
[COG0169] Shikimate 5-dehydrogenase
[COG0337] 3-dehydroquinate synthetase
[COG0703] Shikimate kinase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase
[TIGR01093] 3-dehydroquinate dehydratase, type I
[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase
[TIGR01357] 3-dehydroquinate synthase
[TIGR01809] shikimate-5-dehydrogenase, fungal AROM-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCTG TCGAGAAGGT ATCGATTCTC GGAGCCGAGA CCATCCATGT GGGATACGGC 
ATCCAGGACC ACATTGTCCA GGAGGTAATT TCACATCTAG CCTCATCTAC CTATGTGATT
GTCACAGACA CCAACATGGC CAGAACTACT CCGTTCACCA AGTTGCGTAA CAAATTCGAA
AGCAAACTTA AAGAATTACG TCCGGAGTCC CGTTTGCTTT TCTATTCAGT TTCACCAGGT
GAAAACAACA AAAACAGAGA AACCAAGGCT GCTGTAGAAG ACTTCTTATT GCAACAGGGA
TGTACCAGAG ACACCGTAAT TCTAGCTGTA GGAGGCGGAG TGATCGGGGA CATGATTGGT
TTTGTAGCTG CAACCTTTAT GAGAGGGGTC AGAGTCGTTC AGGTTCCCAC TAGTTTGTTA
GCCATGGTTG ACTCTTCTGT AGGTGGTAAG ACTGCCATAG ATACTCCATT GGGGAAGAAC
TTTGTAGGAG CATTCCACCA GCCGGAGTAC GTCTTCGCAG ATGTTTCGTT CTTGGAGACC
TTACCAACTA GACAGTTCAT CAACGGAATG GCTGAAGTCG TTAAGACGGC TGCGATCTGG
AACGAAGAGG AATTCACTCG CTTGGAGAAG TTCTCCAAGA AGTTCCTCGC TGTAGTTTCT
GCCAAAACCC CAGATTTAAT CTCAATTAAG GAAGAGTTAG TCAAAACGGT ACTTGAATCT
ATTCGTGTAA AGGCTTTTGT CGTTTCGTCC GATGAGAAGG AAACTGGCTT GAGAAACTTG
CTTAACTTTG GTCATACAAT TGGCCATGCA ATTGAAGCCG TGTTGACACC CCAAGCTCTC
CACGGTGAAT GTGTTTCCAT CGGTATGATC AAAGAAGCTG AGTTGGCACG TTATTTGGGT
GTTTTGTCAC CTGTAGCAGT CGCTAGATTG TCGAAGTGTC TTGTAGCTTA TGGTTTACCT
GTTTCTATCG ACGAGAAAGA CTTTTTGAAG AAGGTGGGAA ACAAACGTCA CAATGTGGAA
ATTGACATCT TGTTGAAGAA GATGGCTATT GACAAGAAGA ATGACGGTAG TAAGATTAGG
TGTGTCATCC TCGAAGCCAT CGGAAAGTGC TACCAGTTGA AGGCTCACCA AGTCTCCAAA
CAAGACTTGA GCTTTGTTCT CACAGATGAA GTATTGGTAC ATCCATTTGA TGACAAATTG
ATTCCTAAGA CTAACGTCGT CATTCCTCCA GGCTCCAAGT CCATCTCTAA CAGAGCGTTG
GTGTTAGCTG CCTTGGGTAC TGGTACTGTC AGAATCAAGA ACTTGTTGCA CTCTGACGAT
ACCAAACACA TGTTGGAGGC AGTTGCTTCT TTGAAGGGAG CTTCTATCTC TACCGAAGAC
AATGGTGAAA CCATTGTCGT CACTGGAAAT GGTGGAAAGC TCGTATCTTG CGACGAGCAA
TTGTACTTAG GTAACGCCGG TACAGCCTCC AGATTCTTGA CTTCTGTTGC ACCTCTCGTA
GGTATTAACC CTCAATCTGG TGAACATGTA GTTTTAACAG GTAATGCCAG AATGCAGGAA
AGGCCAATTG GACCTTTGGT GGACGCTTTG AGAGCCAATG GCTCGGAAAT CGACTACCTC
AACAAGGAAG GATCGTTACC ATTGAAGGTT AAGGCAGGTA AGGGCTTGAA CGGTGGAAGA
ATAGAGTTAG CTGCTACTAT TTCGTCACAA TACGTTTCTT CCATCTTGAT GTGTGCTCCT
TACGCCAATG AGCCGGTTAC TTTGTCGCTC GTAGGAGGTA AGCCAATCTC GCAGTTGTAC
ATCAACATGA CGATTGCCAT GATGAAGACG TTTGGGATTG TCGTGACCAA GTCCGAAACT
GAAGAACACA CTTACCACAT TCCCCGTGGA TCTTACGTAA ACCCTAAGGA ATACGTTATC
GAATCAGATG CTTCTTCAGC TACTTACCCA TTGGCTTTTG CTGCCTTGAC AGGAACATCT
TGTACGATTC CAAACATCGG TTCTTCTTCT TTACAGGGAG ACGCAAGGTT TGCTGTTGAT
GTTTTGAGAC CTATGGGCTG TGAAGTGGTT CAGACTGCTA CTTCCACTAC TGTCACTGGT
CCATCTGTAG GAAACTTGAA GCCTTTGCCC CATGTAGATA TGGAGCCAAT GACGGATGCT
TTCCTTACTG CTTCTGTAGT CGCCGCTGTA GCCAAGAACG GCACGCAGTC TACCTCTATT
ACTGGTATCG CCAACCAGAG AGTGAAGGAA TGTAATCGTA TCGCCGCTAT GGTATCTGAA
TTGGCCAAAT TTGGTGTTGT AGCCAACGAG TTGCCAGACG GAATTGAAAT CCATGGAATT
TCACCAAATG ACTTGGTTAC CCCATCCACA GAAAAGCGTG GAATCAAAAC TTTTGACGAT
CACAGAGTGG CCATGTCGTT TTCGCTTTTG GCTGGTTTGT GTAAGGATAA GGTACTCATT
CAAGAAAGAT CTTGTACTGG TAAGACCTGG CCGGGATGGT GGGACATCTT ACACACCAAG
TTCAAGGTTG CTATCGATGG CTACGAGCTT CCATTACAAC ACGAAGACAG TACTGCCTTG
GTTGAAAAAC ATGGTAATGG TAAGAGAAGT ATCATCGTTA TTGGAATGAG AGGCGCTGGG
AAGTCCACCT TGTCGAAGTG GATGGCTTCC TTCTTGGGCT TCAAGCTTGT AGACTTGGAC
GATGTTCTTG AAGAAAAAAT TGGCACTGAT ATCAGGTCGT TTGTACAACA GCAAGGCTGG
GAAGAATTCC GTAAGCAAGA AGCAATTGTA GCTAAAGAAT CTTTCATTAA GTTCTCTGAA
GGCTGTGTAT TGTCTACTGG GGGTGGAATT GTAGAAGGCG AAGAGGCTAG AGAATCCTTA
AAGAGCTATG TAAAGTCTGG TGGAATTGTC TTACACTTAC ATCGTGATTT GGACGAGACC
GTTGTACTTC TTTCTGCGGA CACAACTAGA CCAGCCTACG TGGATGAAAT CAAACAGGTT
TGGTTACGTA GAGAAAATTG GTACCGTGAA TGTTCTAACT ATCACTTCTA TTCAGCACAT
TGTTCCAGTG ATGCCGAATT CAAGCACTTG CGAAACTCCT TCACTACTTA CATCAAGACC
ATCACCGGCT TCCATGTGGC TCAGATTCCT AAGAAGAGGT CCTTCTACAC TAGTTTGACG
TTCTCTGACT TGACTGAAGT TGCTTCATCC TTGGAAGACA TCTCCACAGG TTCTGATGCC
ATCGAATTAA GAGTTGACCT TTTGAAGGAA ACGACTCATA CTTTCGTTGC TGACCAAACT
GCCATCTTGA GAAAGTCGAC TAATCTTCCG ATCATCTACA CTATCAGAAC AGAATCACAA
GGAGGAAAGT TCCCCGACAA CAAATTCGAA GAGCTCGAAG AGTTGTTGGC TTTGGGTATT
AAGTTAGGTG TGCAATACTT GGACCTTCAG TTAGACTTGC CTAATGACTT GCTTGAAAGA
ATTTTGGAAT CGAAGAAGTT CACCAAGATC ATTGCTTCCT ACGTTGATGT TTCTGGCTCA
TTAAGATGGG ATAACGTCGA ATGGAAAAAC AGATACAATC AGGGTGTTTC TCTTGGTGCT
GACCTTGTCA AGTTGGTTGG AAGAGCCAAT CTGTTCCAAG ATAACTTGAG CTTGGAAGTA
TTCAGAGGCA CCAGCACATT GAAGCCTTTA ATTGCCTACA ATGTTGGTGA AAAAGGTAAG
TTGTCTAGAG TGTTGAACCC AAGATTGACT CCAGTTACTC ACGCAAAGAT TCCTGCCGAA
TCTGGTAACG AAGGAGCATT GGATGTCGCT CAGATCAACA AAGCGTACAC TGACATTGGT
GGCTTGTCAG AGAAGCACTT CTGGATTGTC GGCAACCCTG TTGGCCATAG TCGTTCTCCC
AACTTGCACA ATGCTGGCTA CAAGAAGTTG AACTTGCCAT ACGTGTTTGA CAGATTTGAG
ACATCTGATG CTGGAGAAGC ATTTCAGAAA TTGATCAAAG AAGACAAGAA CTTCGGCGGT
TTGGCTGTGA CCATGCCCTT GAAGGTTGAT ATCATGAAAT ACACTGATAA GTTGTCTGAT
TCAGCTCAAG TTATTGGCGC TGTTAATACT GTGATTGAAT TGGAGGGCGA ACAAGGAAAG
TATTTGGGTG AGAACACCGA CTGGGTTGGT ATTTCAGAGT CTTTTGTTAG GGATGGAATT
CCCAACCTTG AAAACGTCAA TGTCAACGGT TTGGTTGTCG GTGGTGGAGG CACTTCTCGT
GCTGCTGTCT ACGCTTTGCA CCAATTAGGT TGCAAAAAGA TCTACATGCT CAATCGTACG
GTTTCCAAGA TTCAAGAGAT TCAGAAGAAC TTCCCTGCTG AGTATAACAT TGAAATTTTG
GACAGTGTTG AAGCTGTTGA AGCTGCACAA CCTATCTCGT TGATTGTTTC TTGTATTCCA
GCAGACAAGC CAATTGACGA GCAATTGTTG AACAAGCTTG AGAGAGTATT GTACGTTGGA
GGTGAAGCCA AGATTGGTGG ATTCACCCCT TCCTTATTAG AAGCTTCCTA CAAGCCCAGA
GTTACTCCTA TCATGAAGAT CGCTCTGGAG AAGTATGAGT GGAACGTGAT TCCTGGTGTG
GAAATGTTGG TGAACCAAGG CATTACGCAA TTTCAGTTGC ACACTGGTTT TGTTGCCCCC
TACGATGTGG TTCACGATGC TGTTGTGAAC CAATGAAGTA CTAGTATATA CAAAAAGTTG
ATAAGACCAT ATACAATTAT TCT
 
Protein sequence
MTSVEKVSIL GAETIHVGYG IQDHIVQEVI SHLASSTYVI VTDTNMARTT PFTKLRNKFE 
SKLKELRPES RLLFYSVSPG ENNKNRETKA AVEDFLLQQG CTRDTVILAV GGGVIGDMIG
FVAATFMRGV RVVQVPTSLL AMVDSSVGGK TAIDTPLGKN FVGAFHQPEY VFADVSFLET
LPTRQFINGM AEVVKTAAIW NEEEFTRLEK FSKKFLAVVS AKTPDLISIK EELVKTVLES
IRVKAFVVSS DEKETGLRNL LNFGHTIGHA IEAVLTPQAL HGECVSIGMI KEAELARYLG
VLSPVAVARL SKCLVAYGLP VSIDEKDFLK KVGNKRHNVE IDILLKKMAI DKKNDGSKIR
CVILEAIGKC YQLKAHQVSK QDLSFVLTDE VLVHPFDDKL IPKTNVVIPP GSKSISNRAL
VLAALGTGTV RIKNLLHSDD TKHMLEAVAS LKGASISTED NGETIVVTGN GGKLVSCDEQ
LYLGNAGTAS RFLTSVAPLV GINPQSGEHV VLTGNARMQE RPIGPLVDAL RANGSEIDYL
NKEGSLPLKV KAGKGLNGGR IELAATISSQ YVSSILMCAP YANEPVTLSL VGGKPISQLY
INMTIAMMKT FGIVVTKSET EEHTYHIPRG SYVNPKEYVI ESDASSATYP LAFAALTGTS
CTIPNIGSSS LQGDARFAVD VLRPMGCEVV QTATSTTVTG PSVGNLKPLP HVDMEPMTDA
FLTASVVAAV AKNGTQSTSI TGIANQRVKE CNRIAAMVSE LAKFGVVANE LPDGIEIHGI
SPNDLVTPST EKRGIKTFDD HRVAMSFSLL AGLCKDKVLI QERSCTGKTW PGWWDILHTK
FKVAIDGYEL PLQHEDSTAL VEKHGNGKRS IIVIGMRGAG KSTLSKWMAS FLGFKLVDLD
DVLEEKIGTD IRSFVQQQGW EEFRKQEAIV AKESFIKFSE GCVLSTGGGI VEGEEARESL
KSYVKSGGIV LHLHRDLDET VVLLSADTTR PAYVDEIKQV WLRRENWYRE CSNYHFYSAH
CSSDAEFKHL RNSFTTYIKT ITGFHVAQIP KKRSFYTSLT FSDLTEVASS LEDISTGSDA
IELRVDLLKE TTHTFVADQT AILRKSTNLP IIYTIRTESQ GGKFPDNKFE ELEELLALGI
KLGVQYLDLQ LDLPNDLLER ILESKKFTKI IASYVDVSGS LRWDNVEWKN RYNQGVSLGA
DLVKLVGRAN SFQDNLSLEV FRGTSTLKPL IAYNVGEKGK LSRVLNPRLT PVTHAKIPAE
SGNEGALDVA QINKAYTDIG GLSEKHFWIV GNPVGHSRSP NLHNAGYKKL NLPYVFDRFE
TSDAGEAFQK LIKEDKNFGG LAVTMPLKVD IMKYTDKLSD SAQVIGAVNT VIELEGEQGK
YLGENTDWVG ISESFVRDGI PNLENVNVNG LVVGGGGTSR AAVYALHQLG CKKIYMLNRT
VSKIQEIQKN FPAEYNIEIL DSVEAVEAAQ PISLIVSCIP ADKPIDEQLL NKLERVLYVG
GEAKIGGFTP SLLEASYKPR VTPIMKIASE KYEWNVIPGV EMLVNQGITQ FQLHTGFVAP
YDVVHDAVVN Q