Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33358 |
Symbol | |
ID | 4840528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 407631 |
End bp | 412271 |
Gene Length | 4641 bp |
Protein Length | 1546 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391843 |
Product | predicted protein |
Protein accession | XP_001386280 |
Protein GI | 150866621 |
COG category | [R] General function prediction only |
COG ID | [COG1752] Predicted esterase of the alpha-beta hydrolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.176236 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGACT CCACAGAAGC TCTTAACTCT ATAGCGTTCG CTGTGGATAC GACTTTGTCG CTGATTCTAC CGTCTTCTTT GGCTCCTCCA TCTGCTCCGC CAGCAACGTC TCTGTTTTTA AAGTCCATCT GGTACGCATT CTGGTGGTTA TGGCTGATGG TAGTTTTCAA GATCATGAAT ATTATATTGC TTTATATTCC CAGCAAGATC ATGAATGCGC TTTCAATCAA CTTCGAGATT ACATTGAACC TTCTGTCGAT TCTTGTCGCA CTCCTGGCTA TCATTACTGT CTGTTTTCTT GTGGTAAGAT ACAAATATCT CACTGGTTAT TCCAAAGACA CGAGTGACCG GAAAACAAAA GGCAAAGTCA ACCCATCAGC ATCCAACATC AACTTGAAGA ACCAGAGCCT AGACTATGTA GACCAGAAGA AGGGCCACCG TAGAACGACA AACTATTTGG ACGAGTTCCT CTCTGCCATC AAAGTTTTTG GCTACTTGGA AAAGTCGGTC TTCCACGAAT TAACCAAAAA TATGACAACC CAAAAGTTGT CACATGACGA GATCCTCTAC CTTGATGAAA AGCTTGGATT CTCCATCGTT GTAGAAGGAG TAATGCAGGT TTATACTAGA ATCACCGAGA ACTCCAATGT AAGCGGCGGC TTTGATCCGG ACGACGACAA CGAGCTCAAC TACGAAAAGG ACGATGTGCT AATAATCGGA AATCAACGCT ACCAGTTACT AAATGAAGTC AAAAGTGGTG CTCCCTTGAG TTCTTTGGTA AGTGTTCTAG ACTTGTTGAA ACCGGTAGAT AGTGAAGATT CAACTTCAGA TATGCTTCAT TCATTTAATA TTTCTGACGA TGACAACATC TCCAAGATTC CAGAGATCTC ACCCATCTCT CTTCCATTTC AAGGAATTCA TGCCGGCGCG AAAGATGATT CAGTGCCTCC TTCTCCCATC ATCAGGCCTT CTAAAACGAA GCAGCTCTAT CCAGAAATAG TTGCCAGACC CAAAAGTCGG CCGCACAAAG AACACACCGG CCATCATCTT CATCATGTTC ACCATTCAGG TGCCACCATT GCAATCATAC CGTATCTGGC TTTTCAGAGG GTACAATCCA AGTATCCGAA AGCCACCTCT CACATCGTCA CCATGGTTCT AGCCAGATTG TACAAGGTTA CATTCAACAC TATTCACGAC TATTTGGGTC TAACCAAAGA GATCATGGAG AGTGAGGTAA AATTGAACAC TACTAGTAGT GTCAGGGGTG CTAACTTGCC AGGATACTTA TTTGATGGTC TTCTTGATAA GATATATGGT GCCAACGGAA TTAATGAAGC AAGCCTTTCC AGAAAGTCAG ACTTCCAAAG AGCACTGTCT GTAAATCTCA ACAAACAGAA GACTACTTTA TGCGACGCTA AGTCTTCGAG GTATGTTCTT CTAGATTCCC GGTTGAAATC GACTCACCCA GGAGACTTGT TGTCGTCTGT GCCATTGTCT AGAAGATCAG ACTATTACCA AACACACAGC CATCCTCTTT CTGCAGACGA TCCTTTAGTT AGGTCTGCAT TCCCAAGTTC AAAGACATTA TCATCACTTT CATCACCAAC TTCCAACTTA AAGAGAGCCA GTTCAAACCT AAAATTTGAG AATATTCGTG ACAGATCATT CTCTGACGAC CGTGAAGAAA CGGAAGAAAC GTCGCTCAGA ATAGCTGTTG TCGAGAGCAT CTTCAAAATT TTGGGTATTA GCGAGAAGAG CACAGCGATG AGGAATCTTT CCAGTTTCAA TTCAGGAAGG TCTTCTGTTT CTTCTTCAAT TGTAGGCTTG TCCAACCTTA TGAGCGCCGA TGACAAATTC GATACCAATG CAGCACGGGT AAGGTTTGAT TCGTATAATG GATTCTCTTC CACAGCCACT TCAATCAGTA GAAGCTCCAC ACCCATCAAA TTTTACAATA CCATTAACCA AAACCAATTG CACAACCATC ACATGGGTGA CTCTGTCAGT GGGATCAACA TCTCCACATT AAGTAGACAA CATCAAGCAA ACCGAAACTC TTCTCCAACA GAGTTCAACT TTGCCAACGT CAAGAGTGAC TTCGCCAAAT GTTTGGAAAT CAAATCATAT GGTCCAAATA CGACTATCGT AGAACAGGGA TCGTTTAATT CCGGCTTGTA TTATGTCATT GATGGCTCAT TGGATGTGTT GTATCGTCCA AGTAACCATG GTGAACCCAG TTCAAACCGT GAAGACAATT TGAAAAAGCT CTATTCAGTG AAGAGTGGAG GTGTAGCTGG CTATTTATCC TCTGTTGTAG GCTTTAGATC GCTAGTGACA ATAAGAACTT CTAAGAAAAG AGGAGTTATC GTTGCACATA TATCCAAGTC AGACTACTCC AAGCTAATGG ATAGGTACTA TTTCTTGCAA TTGCCAGTTG CTACCAAGTT GAAGAAGTTG TTATCACCAC AGATTCTAAC CATTGACTAC GCCTTAGAAT GGTGTCATAT TCCTGCTGGA GGCGTCTTGT GCTCACAGGG GGATTTAGCC AATGGATTTC ATATAGTACT TAGTGGCAGA TTCAGAGTGG TCAGAAACAA GAGTGACAGG TATCAAGGTA ACACTTCAGA TGATGATATT CTTGGATTTT CTGATACTTC TATGGATTGT AGCCCCAGTA GTGACATAAA TAACGAGGAC CTTGAGGTGT TGGGTGAATA TGGCCATGGT GAATCTATTG GAGAAGTTGA GGTATTGACA GCTTCTAGAA GAACCAACTC ATTGATTGCA GTTAGGGATT CAGAGACAGC CAGAATTCCT AGAACTCTTT TTGAGATGCT TTCATTGCTG AATCCTTCCA TTATGGTGAA GGTTTCCAGA ATTGTAGCCA GTAAAGTTGT TTATAAGGAT GTGCTTGACC AGTCGTCTCG CAATTCTACA CTCATTCCAT CTTCCACAGC CTCTCATATC TCCAATGACT ACAAAACAAT TACAATATTG CCTACAGTAA GTGGCCTACC AGTCAGAGAG TTTGCTGATA AGTTGGTTTC TGCATTGAAA GCGATTGGCA GGAATGTCAT TGCTTTGGAT CAGGCTTCTA CATTGACCCA CCTAGGCAGA CATGCCTTTG ATGAGCGTTT GGCACAGTTG AAATTGTCTG GTTACTTTGC CTATTTAGAA GAAGAGTATG AAACCATCGT CTATATCTGT GATACGCCCT TGAAGTCAAA CTGGACTTCC ACGTGTATTT CTCAGGGAGA TTGTATCTTG TTACTTGCCG ATGCAGAGGA CGATGTAGTA GCCACTGGTA TTGGGGACTA CGAGCGCTTG TTAATTAACT TGAAAACGAT GGCTAGGACG GATTTGTGCT TGCTTCACCC AGAGAAATAC GTTGAGCCTG GCTCCACAAG TATCTGGTTG AAGAATCGAA TTTGGGTCCA AGGTCACCAT CATATTGAAA TGGAAATCAT CCGTAAAAAA GACGAAAATT CTGTGAAAAA GAGACCAAAT ATTATCAGTG AATTGGCGTC TAAGATAGGG AGCAAGACGA ATCCCAGTAT TAAGTCTACT CTTGAAGATG TGAGACTTAA AGCAATTTCT TCCTTTGTGA AGTTGAACAC TAGTTTTGTA CATTCTGATA GATATAAGGC TGTGCAACCA CATAAAAACG ATTTTCTTCG TTTGGCTAGA ATTCTCTCCA ATGAAGCTGT TGGATTGGTT CTTGGTGGCG GTGGATCTCG TGGAATATCC CATGTTGGTA TAGTGACCGC ATTAGAAAGA CATGGCATCC CTGTGGATTT GATAGGCGGA ACTTCTATAG GTTCCCTTGT AGGTGGGCTC TATGCCAAAG ACTACAACAT AGTGTCAATC TATGGTAGAG CAAAAAAGTT TTCTAAAAGA GTTGCTTCGT TGTGGAGGTC TGTATTCGAT TTAACATATC CAGTAACGTC ATACATCACT GGGTACGAGT TCAACAGAGG AATTTGGAAA ATCTTTGGGT TTACGGAAAT TGAAGATTTC TGGATCAGAT ATTTTTGTAA CACTACCAAC ATCACCAACT CTACAATGGA TATCCATGAA AGTGGTTATC TGTGGAGATT CATTAGAGCA TCGATGTCGT TAGCGGGTCT TTTACCTCCA ATTGCCTTTC AAGGTTGCAT GCTATTGGAT GGCGGTTACT TGGATAACTT GCCTGTCAGT GAGATGAAGA AGAAAGGTGC CAAATACATC ATTGCGGTTG ATGTAGGCTC TGCTGATGAC AGAACACCCA TGAATTATGG TGATACATTG TCTGGATTCT GGGTGTTATT CAATCGTTGG AACCCATTTT CTAAGCATCC TAATGTTCCC AACATGATGG ATATTCAAAT GAGATTGGCC TATGTAGCAA GTGTCAACGC ATTAGAGGCT GCGAAAAAGA CCAATGGGGT GATTTACTTG AGACCTCCTA TTGATAACTA CGCCACGTTG GATTTTGCCA AATTCGACGA GATTTACCAT GTCGGCTTAA ATTACGCAGA CAAACTCTTT TCTAGCTGGT CTAAGAATGG CAAACTTCCA GCAATAGCAG GCATGGTGGA TAAGGCGAAG ATCAAAAGTG GAGATGACAA GAAAGTGTTA TACAGACGGA ACTCTATCTG A
|
Protein sequence | MKDSTEALNS IAFAVDTTLS SILPSSLAPP SAPPATSSFL KSIWYAFWWL WSMVVFKIMN IILLYIPSKI MNALSINFEI TLNLSSILVA LSAIITVCFL VVRYKYLTGY SKDTSDRKTK GKVNPSASNI NLKNQSLDYV DQKKGHRRTT NYLDEFLSAI KVFGYLEKSV FHELTKNMTT QKLSHDEILY LDEKLGFSIV VEGVMQVYTR ITENSNVSGG FDPDDDNELN YEKDDVLIIG NQRYQLLNEV KSGAPLSSLV SVLDLLKPVD SEDSTSDMLH SFNISDDDNI SKIPEISPIS LPFQGIHAGA KDDSVPPSPI IRPSKTKQLY PEIVARPKSR PHKEHTGHHL HHVHHSGATI AIIPYSAFQR VQSKYPKATS HIVTMVLARL YKVTFNTIHD YLGLTKEIME SEVKLNTTSS VRGANLPGYL FDGLLDKIYG ANGINEASLS RKSDFQRASS VNLNKQKTTL CDAKSSRYVL LDSRLKSTHP GDLLSSVPLS RRSDYYQTHS HPLSADDPLV RSAFPSSKTL SSLSSPTSNL KRASSNLKFE NIRDRSFSDD REETEETSLR IAVVESIFKI LGISEKSTAM RNLSSFNSGR SSVSSSIVGL SNLMSADDKF DTNAARVRFD SYNGFSSTAT SISRSSTPIK FYNTINQNQL HNHHMGDSVS GINISTLSRQ HQANRNSSPT EFNFANVKSD FAKCLEIKSY GPNTTIVEQG SFNSGLYYVI DGSLDVLYRP SNHGEPSSNR EDNLKKLYSV KSGGVAGYLS SVVGFRSLVT IRTSKKRGVI VAHISKSDYS KLMDRYYFLQ LPVATKLKKL LSPQILTIDY ALEWCHIPAG GVLCSQGDLA NGFHIVLSGR FRVVRNKSDR YQGNTSDDDI LGFSDTSMDC SPSSDINNED LEVLGEYGHG ESIGEVEVLT ASRRTNSLIA VRDSETARIP RTLFEMLSLS NPSIMVKVSR IVASKVVYKD VLDQSSRNST LIPSSTASHI SNDYKTITIL PTVSGLPVRE FADKLVSALK AIGRNVIALD QASTLTHLGR HAFDERLAQL KLSGYFAYLE EEYETIVYIC DTPLKSNWTS TCISQGDCIL LLADAEDDVV ATGIGDYERL LINLKTMART DLCLLHPEKY VEPGSTSIWL KNRIWVQGHH HIEMEIIRKK DENSVKKRPN IISELASKIG SKTNPSIKST LEDVRLKAIS SFVKLNTSFV HSDRYKAVQP HKNDFLRLAR ILSNEAVGLV LGGGGSRGIS HVGIVTALER HGIPVDLIGG TSIGSLVGGL YAKDYNIVSI YGRAKKFSKR VASLWRSVFD LTYPVTSYIT GYEFNRGIWK IFGFTEIEDF WIRYFCNTTN ITNSTMDIHE SGYSWRFIRA SMSLAGLLPP IAFQGCMLLD GGYLDNLPVS EMKKKGAKYI IAVDVGSADD RTPMNYGDTL SGFWVLFNRW NPFSKHPNVP NMMDIQMRLA YVASVNALEA AKKTNGVIYL RPPIDNYATL DFAKFDEIYH VGLNYADKLF SSWSKNGKLP AIAGMVDKAK IKSGDDKKVL YRRNSI
|
| |