Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28950 |
Symbol | MUC1.6 |
ID | 4851690 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2551938 |
End bp | 2558815 |
Gene Length | 6878 bp |
Protein Length | 1978 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393398 |
Product | Mucin-like not chitinase - possible cell wall mannoprotein |
Protein accession | XP_001386826 |
Protein GI | 126275274 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCA TTTCTTTATT GATTCCGATA TTTTGGCTAC TAGCGTTAGC ATCATGTGGT GACTCACTCA TATCCTTCGA ATTGAGGGCT GTTTTTCTTG ATAATCATTC ACCCAACCCT GATGGGTTCT ATATGAACTA CTATGAAGGA ACAGGTGAAT ATCAAACAAG CACAACTGTC TACGGCAAAA GAGAAGACAA AACTGAAATC CAACCATTGC CCACTTCCGA GTCTGCAAAT ATCAATAAAA GGACACCATC TTCGTATGTT TTGTTTGGGT TCGACTTCGA TACAGATGTA GTGGTTTTCA CTTTGGATCC TAGCTCCGGT TACCTTACCT GTAATATTGC AGAGGATCAA TACGTTGTCC TTGACTCATT AATGGGGCTT ACTCTTTCTT CAACTCCACA GGGTGGGTTT TCAACATCTG TATCTGCTTC TGATCAAGTT ACGTATTTGT CATTAGATGG AAGTATCTAT TTTTTTCTGT GTCAATTTGG AACGACGAAC TCTTTCACTA TCAATACATT TAATTCTAAT TCTTTGTACT GTGGAAGTAT TAACATGCAA ATTTATTTGT ATGACGGCGA CGAACCCACT GATACCGATC ATTCAACTAC ATCATCATCT AGTGACATTG CAGCTATTCC AAACCCTTCC ACCTCCGAGA TCAGCGATTC AACTTCTTCT GTAAGCACAA CTTCCATATC GTCAACTGCA TCCACCAGCT CTGGTTCTAC TGGCAGTCCT ACAGCAACTC CAATTCCATT AGTGGAGTCT CTCTGTCTGT TTGAGATGTT TATCAGAGGG GAGAATTTTG ATGATGTTCA ATTGGGTATG GAATTGGTAG ACAATTTCGT AGTGCCAGCA GTCGGAGGAT CTTCCGTATT CGACTACTAC TACGATACTA CAATGAGAGT AGGCTACATT ATGGTCGAAG GATCTTATCC TTCTGGACAA CAAGTGTACG ACGAAGGTCC ATATCTACAA AATCCTGCTC TTAACGGATT CCATCTTGGA TCTGGACCAA TGTACAGCTG GCAAGTTAAT CAAGTAGGAA ATGAATTTGA ATTGGCTTAC CCCTGGCCAT TCTATGCATG TAACTACGGA GATGATTCTA CACCATTTAC GCTTACGGAC AGTGAACAGC TATTGGAGTA CCCTCTATGT CAGCCGGTGA CAGTCATATT AAAATATGGT CAATTGTCAT CTACCTGTCC TCCAACCCCA TCTCTGGTTG TCAGTTCAGT GGAAAGTGAG TCGCACTCTG AGAGTGTGAG CTCAGTGTCG TCGCCTTCTT CGGAACTTCC AACTCTTACT TCAGATGCTC CAACTCCCGG TTGCGTTGAA AAGTTTGACG CCGATCCTCA GGGAATAACT TATGGTGTAA TGGGGTCATT TGCTATTGCC ACGGTCAGTG GTGTACCAAT GTTTGCTGTA TCTACCGATG GAGTTACAGG TGACTTTGCC TATGATCCAG AATCTAGTAA GATATCATAT GCCAACACCT ATTTGTCGAT TAGCGAAAAC CTTTTCATGA CAGCCGATGA CGAATCTCAC GCTGTAGAAG GATGGAGCAT TGTAGAATAC AACGGAGGAC ATATTTTGAG TTTCCTTGGA CTCACAATTT TTCAAGCTTG TTCTATAGAT GAGTCGGATA CTTACGTCAT TAAAGATCCA AGATTCAATC AAGATGGTCT CATTTGCCCA ACCTTCATGA TACAAATACT TGAAGATAGC ATTAACTCTG TATGTCTGCT AGATGCCTCA TCTACGCACT CTGCAATTAG TTCCACACCA ACCGAATCAT TTCAGTTGTC GTCCTCTTCG GAACTTCCAA TCATTACGTC AGGTGCTCCA ACTCTTACTT CAAGTGCTCC AACTCTTACT TCAAGTGCTC CAACTCTTAC TTCAGGTGCT CCAACTCCCG GTTGCGTTGA AGAGTTTGAT GCCGATCCTC AGGGAATAAC TTATGGTGTA ATGGGGTCAT TTGCTATTGC CACGGTCAGT GGTGTACCAA TGTTTGCTGT ATCTACCGAT GGAGTTACAG GTGACTTTGC CTATGATCCA GAATCTAGTA AGATATCATA TGCCAACACC TATTTGTCGA TTAGCGAAAA CCTTTTCATG ACAGCCGATG ACGAATCTCA CGCTGTAGAA GGATGGAGCA TTGTAGAATA CAACGGAGGA CATATTTTGA GTTTCCTTGG ACTCACAATT TTTCAAGCTT GTTCTATAGA TGAGTCGGAT ACTTACGTCA TTAAAGATCC AAGATTCAAT CAAGATGGTC TCATTTGCCC AACCTTCATG ATACAAATAC TTGAAGATAG CATTAACTCT GTATGTCTGC TAGATGCCTC ATCTACGCAC TCTGCAATTA GTTCCACACC AACCGAATCA TTTCAGTTGT CGTCCTCTTC GGAACTTCCA ATCATTACGT CAGGTGCTCC AACTCTTACT TCAAGTGCTC CAACTCTTAC TTCAGGTGCT CCAACTCCCG GTTGCGTTGA AGAGTTTGAT GCCGATCCTC AGGGAATAAC TTATGGTGTA ATGGGGTCAT TTGCTATTGC CACGGTCAGT GGTGTACCAA TGTTTGCTGT ATCTACCGAT GGAGTTACAG GTGACTTTGC CTATGATCCA GAATCTAGTA AGATATCATA TGCCAACACC TATTTGTCGA TTAGCGAAAA CCTTTTCATG ACAGCCGATG ACGAATCTCA CGCTGTAGAA GGATGGAGCA TTGTAGAATA CAACGGAGGA CATATTTTAA GTTTTCTTGG ACTCACAATA TTTCAAGCTT GTTCTATAGA TGGGACGGAT ACTTACGTCA TGAAGGACCC AAGATTCAAT CAAGATGGTC TCATTTGCCC AACCTTCATG ATACAAATAC TTGAAGATAG TATTAACTCC ATGTGTGTGG CAAGTCTGGC AAATAGTGAA TACTCTTCCT TAACGAGTTT ATTGAGTTCT AAGTTCAGCA GTATATCGCT GGTTTCAGAT TCAAGGGGGT ACATCTCTTC CTCACTGCAA TGGACTTCAT TACCAGAGTC AACCATAGAA AGCAAGGAGT CTACCGCTTC TTCGTTATCG AAATTGACTT CTATCTCTTC AGTCATTTCA AGCATTCCTA CTCCAAGTAT ATCTACATCA GTTTCTCCTC CTCCGACTGA CGTATTTAAG TTGCTTGCAA TTATAGACGG ACTCTTAGCC TACATTGAAG CACTATTGGT TGAATTAGAA TCGGGCTTGC CAATCTTAGC CAAAAGAGAA GAGGTCTACA TCTTGGGCAT TGATCTTGGC GGGCCGGATG TTGTCTTCAA TTATGATAAC GATACAGGTT ATCTCGCAGC AGACAACGGC TTGTATGTTC AAGCACCAGA TCCTGTAAAA GGAATCTATC TCGGACCAGA TCCAATATCT GGTTGGGGTT ATGACGAAAA CAGTCAATTG ACATTTAATT CACAGTCAGC TTTCTTTAGA TGCCCTTATG GAGACGATGG AGGATTCGTC CTTTCCCCTG TTAATGGTGG AAGCTGTGTA GGGTTGCAAT TGGCAGTCGA ACTTCAAGTT GGAAGTTCGT CATCTAGTAC ACCACTGGCT ACCTCAAGTC AAGACCATAC TTCTACTTCT GAGACTTCTA CTAATGAAAA TTATTCTTCA GTAGGACAAA CGGAAAGTCA AAGTATTGAT TCAACACTGG ATACTGGATT GGGTTATTCT AGCTTTTCTA ATAGTACTTC CAGTCAATTA TCTGTTAGAT CGTCTGTGGC AACTTCTATT AGCACCGGTG ACATTGCAGT TAGTATATCA ACTGGTGCAT CAGCTAGTAT ATCTGCTGGT ACATCTGCTG GAAATTACGA TACTTCACTT GGCATGTCTA CGGGCACTTC TCATGGTACT TCGTCTGATA TTAGTACTGA AACCACTCCT AGATCTTCTG CAGATGTTAA TTCTTCTATC GGCAATGATA ATTCTGGTAC TTCTTCAGAG GGAGATTCAT CTATAGTTTC TATTGATTTC TCTTCAGGTT CTCTCCAGAT CACAGATTCA CTTACTCCTT CTTCTATTGT TCCAACTTCT CTGGTTCCAG CTTCTCTGAT TCCAGCTTCT CTGGTTCCAG CTTCTCTGAT TCCAGCTTCT CTGGTTCCAG CTTCTCTGGT TCCAGTAACA AAGACGACGT CGTACAACTT CCAATTAACA GCCATCGCTG CTCCTCCAAA TAAATTCGAG GAATTGGTTC TAGTACAAGA CAGCAGAATG ATATTAGATG TAACTACAGG TTCTGAATTT GAATTGAAGC TTCCTGAAGG TTACCTTACT GTTCAAAGCT TGTACGTTCA TGCAGATGCT ATAGGATTCT ACCTTAGTTC TGAACCTATT GGTGGATTCT CATTCATCGG GGATCCAATT TTGGAATTTA ACGGCCAGCT GGACTTCTAC ATTTGTCCAA CGTTGGACGT CCCATTACAA CTTACGAAAG TTGATCCAAG CTGTATGCTC GGTTCATTGC TGCTTATCTT GGATGGGTCA TCTACTTCAA TTGCACCAGA TGAGAACTCC CAAAGTCAAC TGGTTGCCAA CATAAATACC CTCGACTCTT CAAGTCTGTT GACAACTGCT ACACTTTTAA CAAGCGAAGT TATTACAATC ACAGATTGTC CAAGTACAAT AACTAATTGT CCTCTTGAAT CTATCAGAAC CGTCACCATT CCTAAGACAT TAGTAACAAC CTATTGCCCT GTATCTGGAA GCCAATCGAT GACTATTTCG ATTGAGATTA TTTCGACTGA AATAGTCACA CTTACCAAAT GTCCAAGTTC TGTAACCAAC TGTCCCGTCA AAACCACGGA ATTGACAACA AATCTTCATA CAATGAAAAC GGTCTCAGTT ATGCCTATTG ATGTCTTAAC GACATTTACT TCAGTGGAGA CTTTGGCTTG CACCAAGCAT TGTTTAAAGA GCCAGTTACC GGTGCTCTTA ACTCATACAT TAACTTCACC TTTAGGAAAC AATCCACAGG CAACTGGAGG TTCAGCAGGT AAGGGAAGTT CACCAACTAA TACTGTGAAT ACAGCAGCTG CTGGAACAGG AGTTGCTGGC ATTCACAGTC CTGGTAACAC AATTTCGGAA TTTGAACCGG TCTCTTCGAG TATTTCGACT TCTATGATTG TTCCTAGCGT CTCATTACTT TCTCAAAATT CAAATTCTCC AGAATCAATT TCGACTTCAG TATCAGTATG GACTGTTGAA GCACTTGGAA ATGCTGCTGA TAAGCTTTCT GGAACTGTAG GATCCATTAT CTTGATGATG ATTGCCTATT TCTATTGAAT TCTTCAATTT TATTAGTTAC ATAATTTAGC CTTACATATA CTTCAATTTG CATGTCGTTC TTTAAGATTG TTACACCGTA TAGTCAAAAA TAGTTGTGCG AATACATAGT GAATTTTGTA TTAGTAGACA CAGATTATTG TATGAAGAAG TTACTATATA TTGAAGGACC ATGAGCACAA TATTAAAGGG TTAGCGGACT GTATTGTACC ATAAACAAAT GGACGTGCCT CAATTAAATT CCATCCCTTA GCTCCTCTGG AAAGAAGAGA TAAGCAAATA TGACTTCCAG TTGTTGTTCA CAAGGTCAGG ATTACAGAGA ATGTCCTTTC ATTTCGAGGA CGTCTTGGTA AGATAACCAA CGAACTGGGC AAGTCTATGG TCGACCATGG ATACTTGGAC GAATTAAAGC CTCTGGACGG CTTGATTATT CTTCAAGCTT ATTTGAACAG AAGGTCTCAA GAAGAAATCT TAACGAAGCC ACCTCATCTA AGAGAAGAGT TAATGCAAAT CAATTCACTT GTTGGTGATT CTGAAACTTA TTCATTCGAA GGAATGGACG AAGACAGTGA TGACGAAATT ATCAAAATGA TGGTACGCCA GCGAAGAACA GTAGGCAGAC TGGCTTACTT ACTTGAGTCC GTAAATGAAA TTGATAGTCA ATAATCAATT CACGCTAAAG AAAAAGATGC CAAAGATAAG AAGGCTCCTT ATAGAAAGTA CACTCTAGAG CAAGTGGCGG GATTTTCTGA TGATATAACA GCGCAAGTAG GGTCCTCGGT TGCCAAAATA GCTCGTTCTC ATGGAATACA AGAAAACACT GGGCAAAGAT GGGTAAGAGA CTATAAAAAG TCCAAAAAGT TGCCGTTTGA ACTTGCAAGA GGCAGGAAGC AGAAGTCAAA CAAATTGAAT GAAACCACGA GGATTTTCTT AAGTCAAAAC TTGTGGATGA TTGTACATTG TCTCTTGACA TGATGATTTC TGACTTACTG GACCACTTTG AAGGTCTTCG TGTGAGTGAA TCAACTTTGG GTCGATTTCT CAGCGATAAT GTTCATTTCA CGTTCAAAAA GATTAGAAAG GAGCCATTTG CTCGAAATAC AGAGTGGATG ATTGACAAGC GGTATGATTA TGTTAGCCTT ATTAACAGAT CTGATATCGA CTAGTACAAC AACTGTATTT TCATAGACGA GGCAGGTTTT CAAATTGATA TGTCTCCTCT GTATGGTTGG GCTTCATCTA GTGTTACTCC CGTTTCTAGA GTAACTGGCA AATCGGAAAA CAGAACTGTC ATGGGTGCTG TTAACTCGAA GGGAATTGTA CAACTCAACT TGAAAAAGCC ATTCAGGAAC GTCGCAAAGA AGAGGAAAAC TGCCTCAGCC AGAAGAGATC AAGCTAATGA AGATAGAGAT GAATCAGTAG GTACTACAAC AGGGCATTTC AAGTATTTTG TTCTTGATGT CCTTCTGACA TTGGATCTCT ACCCTGAGTA TAGGAACTGC TACCTCATCA TGGACAATGC AAGTGTTCAC AAAAATCACT CGATTTAG
|
Protein sequence | MKFISLLIPI FWLLALASCG DSLISFELRA VFLDNHSPNP DGFYMNYYEG TGEYQTSTTV YGKREDKTEI QPLPTSESAN INKRTPSSYV LFGFDFDTDV VVFTLDPSSG YLTCNIAEDQ YVVLDSLMGL TLSSTPQGGF STSVSASDQV TYLSLDGSIY FFLCQFGTTN SFTINTFNSN SLYCGSINMQ IYLYDGDEPT DTDHSTTSSS SDIAAIPNPS TSEISDSTSS VSTTSISSTA STSSGSTGSP TATPIPLVES LCLFEMFIRG ENFDDVQLGM ELVDNFVVPA VGGSSVFDYY YDTTMRVGYI MVEGSYPSGQ QVYDEGPYLQ NPALNGFHLG SGPMYSWQVN QVGNEFELAY PWPFYACNYG DDSTPFTLTD SEQLLEYPLC QPVTVILKYG QLSSTCPPTP SLVVSSVESE SHSESVSSVS SPSSELPTLT SDAPTPGCVE KFDADPQGIT YGVMGSFAIA TVSGVPMFAV STDGVTGDFA YDPESSKISY ANTYLSISEN LFMTADDESH AVEGWSIVEY NGGHILSFLG LTIFQACSID ESDTYVIKDP RFNQDGLICP TFMIQILEDS INSVCLLDAS STHSAISSTP TESFQLSSSS ELPIITSGAP TLTSSAPTLT SSAPTLTSGA PTPGCVEEFD ADPQGITYGV MGSFAIATVS GVPMFAVSTD GVTGDFAYDP ESSKISYANT YLSISENLFM TADDESHAVE GWSIVEYNGG HILSFLGLTI FQACSIDESD TYVIKDPRFN QDGLICPTFM IQILEDSINS VCLLDASSTH SAISSTPTES FQLSSSSELP IITSGAPTLT SSAPTLTSGA PTPGCVEEFD ADPQGITYGV MGSFAIATVS GVPMFAVSTD GVTGDFAYDP ESSKISYANT YLSISENLFM TADDESHAVE GWSIVEYNGG HILSFLGLTI FQACSIDGTD TYVMKDPRFN QDGLICPTFM IQILEDSINS MCVASLANSE YSSLTSLLSS KFSSISLVSD SRGYISSSLQ WTSLPESTIE SKESTASSLS KLTSISSVIS SIPTPSISTS VSPPPTDVFK LLAIIDGLLA YIEALLVELE SGLPILAKRE EVYILGIDLG GPDVVFNYDN DTGYLAADNG LYVQAPDPVK GIYLGPDPIS GWGYDENSQL TFNSQSAFFR CPYGDDGGFV LSPVNGGSCV GLQLAVELQV GSSSSSTPLA TSSQDHTSTS ETSTNENYSS VGQTESQSID STLDTGLGYS SFSNSTSSQL SVRSSVATSI STGDIAVSIS TGASASISAG TSAGNYDTSL GMSTGTSHGT SSDISTETTP RSSADVNSSI GNDNSGTSSE GDSSIVSIDF SSGSLQITDS LTPSSIVPTS LVPASLIPAS LVPASLIPAS LVPASLVPVT KTTSYNFQLT AIAAPPNKFE ELVLVQDSRM ILDVTTGSEF ELKLPEGYLT VQSLYVHADA IGFYLSSEPI GGFSFIGDPI LEFNGQLDFY ICPTLDVPLQ LTKVDPSCML GSLLLILDGS STSIAPDENS QSQLVANINT LDSSSLLTTA TLLTSEVITI TDCPSTITNC PLESIRTVTI PKTLVTTYCP VSGSQSMTIS IEIISTEIVT LTKCPSSVTN CPVKTTELTT NLHTMKTVSV MPIDVLTTFT SVETLACTKH CLKSQLPVLL THTLTSPLGN NPQATGGSAG KGSSPTNTVN TAAAGTGVAG IHSPGNTISE FEPVSSSIST SMIVPSVSLL SQNSNSPESI STSVSVWTVE ALGNAADKLS GTITNELGKS MVDHGYLDEL KPLDGLIILQ AYLNRRSQEE ILTKPPHLRE ELMQINSLVG DSETYSFEGM DEDSDDEIIK MMLVLMEYKK TLGKDGSSYE AGFQIDMSPL YGWASSSVTP VSRVTGKSEN RTVMGAVNSK GIVQLNLKKP FRNVAKKRKT ASARRDQANE DRDESVGTTT GHFKYFVLDV LLTLDLYPEY RNCYLIMDNA SVHKNHSI
|
| |