Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32696 |
Symbol | MUC1.8 |
ID | 4840092 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 426001 |
End bp | 428654 |
Gene Length | 2654 bp |
Protein Length | 805 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391407 |
Product | possible mannoprotein |
Protein accession | XP_001385434 |
Protein GI | 150865989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.313677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.600944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCCA GACTATGGGC GGTGAACGCA ACCAAAAAGA AGAAGAACGA GCTCGTTCCA GGGTCACTGC GATGGATCAC GAACGAGATG TCGCTCCAAT ATGACAGTGC CCTCACCAAC TGTGAGGACC ATTTTCAAGA TTGTAGTTCT CGGTATGATA GCCTCTCCAG CTTTGTACAG AAAGTTCTAG ACATCACCAA TAGCACAGAA GATCACACAG AAGACCTATC GCTATTGTTG GCTGTGCTGA ACCCTACCGT TTTCTCACCA GAGTTTTCAA AATCAACTAC TTTTGTGCAG CCTACAGAGC CAGAAGATAT GTACTCATCT AACCATAAGG ATTCTACTCC CAATATTCGA ACTGCCAATT CTAACGATAT CACCATACCC GTCAAGCCTG AAAACATCAA AGAGGATACT GTCATATCCG AATCACGTAA TTCTCTAAAG CAAGCGATCA TCGCACAATC TTCACCAATA CGACATCGTA CTTCTCAGTA TAGACAAACT CCAGTGAAAC CCAGAATAAA CCAACCACCA CGACTAATGC ACGCGTCGCC AAATCAAATT GATATATCCC CACCTCATTC GAATTCCAGT ATGGATCGAA TTGATACTGG TCGCAAATCG CTCAGCTCTT CGTCTCCTCC AAAAGTTGTA CAGCAATCTC TCCCTTCTGT CGAAACACAT CAAACGGAAC ATACATCTAC CGCCAGAAAC ACACTGTTAA TAAATGGCAT AGATGATTCA TTTCAGGCTA TCAGCACAGC TATCAGAAAG TCAATAGCGG GAAAATCTGC ACTTACGATA TCATCGAGCA CTCCTGCAAA AGCTAAGAAA AGTGGAGTAT ACGAAGAATT TGAGACAAAA ATTAATCTAC AACCTGAAGA AGCCAATAGA AGCTCAACTA TTCATTCAAC TGCACTGAGC AATGAAAACA CAAGGAAGAT ATCGGCTTCT ATGAGAAGTT CCATCTTTGT TGGACTTCCT ACAAGAGAGC CTATTACAGT CAATACTAAA TCTAGTAAGC ATTCCTCCAT CAAAAGTAGA TCACTGAGAC TATTTGAAAG ACTAGATATT GCTTCCAGAA GAGATACATC CTCAGAAATT GCTATTGACA ATAAGGTTGC GAACAAGGGC GAAGATAAAG CAGATAAAGA ACATGACGTT GATGCTAATT TTGATATGTC TCCTGTAGCA ATCCCCAAGC GAGCGTTTGC AAATTTCAAT AGCATCAAAA GTCAAGTGGA TATGAAAAAT AATGGTACGA CTAACCTCAA TTCAGTCCCC GATCTAAGCA ACACCACCTC AAAAAGCAAT TTACTTCCAG ACAAGCCAGT AAAAATTCCC AACAAAGAAC ACACCATCCC AAGCAGAGCA GAAACTGAAG AAAATATCAT CAGCGAAATC GGCGTCCCTT CTACAAAAGC TCCGAGAGAG CCAGAAGATG TCAAGTCAAC TTCTGCAACA GTAAGAAAGA CATTGCTTTC TGAACTGCCA TCGATCAGAC CATTTAATTC GTCAACAAAT GGAAGTAGGT CTCCTACCCG AAGTTACCTT TCATTTAAGT ACAAGGGCAG TCCCAGAGAA CCTACTACCA GATCTCGTAG CCGATCTCCT ACACGTAGTG TTGGACCGTT ATCCAAGAAG TCATCTCCAA TACTGAAAAT TCACGATACA ACTGGCTATG ATGTTCCTGA AAATGACGAA AAGGGGTTAA TCTCTCGTCT AACCATTCCG ACAAGTTCAA GTGCAGCAAA GAGAAAGACA CCGACATCTT CAAAAAGAGA AACAGAGAGC AGAAAATCTG AAGGCAGGAA GACTGATTTA ATGAATACAA AGAATAGATT TTTGACTACT ACATTGAACT CCAACAATCC CCAATTTAGT CTCAAAAGAC CACAGTTTCA AGGAAATCAA TCAGTCGCCG CCAAGCCGTT GCACTCCCCA ACTAGAAGAC GAAACCCAGC AATTGAAGAT CTAGAGATTA AAACTGCACC CAAGTTGGAG CCCGATGCTA TACCTATTTT GAAGAAAAAA TCAATGATGG CAGAAAGAAG CGAAGCAGCA GCCCAAAAAC CGAAGCAGAA ATTCACGATT TCGATGAATC ATACTTCTAA GCACAAGGCG GAAATAGTGC CCTTTTCGAA TCAGAAGCAA CTTGGAGACG TCTTCGCCCG AGATGAAGAC ACAAAGGAGA GTCACGACGT CCATCAATTT GAAAAATATA GCTACAGAAA TAATGCTGTC GCTTTGCCTG AAGCAGCAAG GGGAGGTTTT GGCAGTGCAA AAAGAAGAAA GACAAACAAG GAAGATAAAA CACCTTCTAG AACCGGTGCT CTTGCAAAGA AACAATTTCT CGAGAAGAAA GCTAGTAGGA TTTCTGTGGC TGAGAGAAGA ACGCCCCACA AGAATGCTGA TCCTCTTACA CCTGCAAAAT TGCACTACTC AGCTGAAAAC CTACCTGATA TTCCTACTGA TGACGAAGAC GATTCCAACG GAAAAGACCG AAAAATATTA CAAACATGGG GACATACTCC TGAGATCAAA CTGATTATAA TGAAGAACAT AGAGGTGAAT CCAGTTTCAG TATTTGGTGA TGTTCCTCAG CTCAACATGG AGGAGATATT CGATTCCCAT TCTTCCAGAG CTAG
|
Protein sequence | MSSRLWAVNA TKKKKNELVP GSSRWITNEM SLQYDSALTN CEDHFQDCSS RYDSLSSFVQ KVLDITNSTE DHTEDLSLLL AVSNPTVFSP EFSKSTTFVQ PTEPEDMYSS NHKDSTPNIR TANSNDITIP VKPENIKEDT VISESRNSLK QAIIAQSSPI RHRTSQYRQT PVKPRINQPP RLMHASPNQI DISPPHSNSS MDRIDTGRKS LSSSSPPKVV QQSLPSVETH QTEHTSTARN TSLINGIDDS FQAISTAIRK SIAGKSALTI SSSTPAKAKK SGVYEEFETK INLQPEEANR SSTIHSTASS NENTRKISAS MRSSIFVGLP TREPITVNTK SSKHSSIKSR SSRLFERLDI ASRRDTSSEI AIDNKVANKG EDKADKEHDV DANFDMSPVA IPKRAFANFN SIKSQVDMKN NGTTNLNSVP DLSNTTSKSN LLPDKPVKIP NKEHTIPSRA ETEENIISEI GVPSTKAPRE PEDVKSTSAT VRKTLLSESP SIRPFNSSTN GSRSPTRSYL SFKYKGSPRE PTTRSRSRSP TRSVGPLSKK SSPISKIHDT TGYDVPENDE KGLISRLTIP TSSSAAKRKT PTSSKRETES RKSEGRKTDL MNTKNRFLTT TLNSNNPQFS LKRPQFQGNQ SVAAKPLHSP TRRRNPAIED LEIKTAPKLE PDAIPILKKK SMMAERSEAA AQKPKQKFTI SMNHTSKHKA EIVPFSNQKQ LGDVFARDED TKESHDVHQF EKYSYRNNAV ALPEAARGGF GSAKRRKTNK EDKTPSRTGA LAKKQFLEKK ATQHGGDIRF PFFQS
|
| |