Gene PICST_32696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32696 
SymbolMUC1.8 
ID4840092 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp426001 
End bp428654 
Gene Length2654 bp 
Protein Length805 aa 
Translation table12 
GC content42% 
IMG OID640391407 
Productpossible mannoprotein 
Protein accessionXP_001385434 
Protein GI150865989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.313677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.600944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCA GACTATGGGC GGTGAACGCA ACCAAAAAGA AGAAGAACGA GCTCGTTCCA 
GGGTCACTGC GATGGATCAC GAACGAGATG TCGCTCCAAT ATGACAGTGC CCTCACCAAC
TGTGAGGACC ATTTTCAAGA TTGTAGTTCT CGGTATGATA GCCTCTCCAG CTTTGTACAG
AAAGTTCTAG ACATCACCAA TAGCACAGAA GATCACACAG AAGACCTATC GCTATTGTTG
GCTGTGCTGA ACCCTACCGT TTTCTCACCA GAGTTTTCAA AATCAACTAC TTTTGTGCAG
CCTACAGAGC CAGAAGATAT GTACTCATCT AACCATAAGG ATTCTACTCC CAATATTCGA
ACTGCCAATT CTAACGATAT CACCATACCC GTCAAGCCTG AAAACATCAA AGAGGATACT
GTCATATCCG AATCACGTAA TTCTCTAAAG CAAGCGATCA TCGCACAATC TTCACCAATA
CGACATCGTA CTTCTCAGTA TAGACAAACT CCAGTGAAAC CCAGAATAAA CCAACCACCA
CGACTAATGC ACGCGTCGCC AAATCAAATT GATATATCCC CACCTCATTC GAATTCCAGT
ATGGATCGAA TTGATACTGG TCGCAAATCG CTCAGCTCTT CGTCTCCTCC AAAAGTTGTA
CAGCAATCTC TCCCTTCTGT CGAAACACAT CAAACGGAAC ATACATCTAC CGCCAGAAAC
ACACTGTTAA TAAATGGCAT AGATGATTCA TTTCAGGCTA TCAGCACAGC TATCAGAAAG
TCAATAGCGG GAAAATCTGC ACTTACGATA TCATCGAGCA CTCCTGCAAA AGCTAAGAAA
AGTGGAGTAT ACGAAGAATT TGAGACAAAA ATTAATCTAC AACCTGAAGA AGCCAATAGA
AGCTCAACTA TTCATTCAAC TGCACTGAGC AATGAAAACA CAAGGAAGAT ATCGGCTTCT
ATGAGAAGTT CCATCTTTGT TGGACTTCCT ACAAGAGAGC CTATTACAGT CAATACTAAA
TCTAGTAAGC ATTCCTCCAT CAAAAGTAGA TCACTGAGAC TATTTGAAAG ACTAGATATT
GCTTCCAGAA GAGATACATC CTCAGAAATT GCTATTGACA ATAAGGTTGC GAACAAGGGC
GAAGATAAAG CAGATAAAGA ACATGACGTT GATGCTAATT TTGATATGTC TCCTGTAGCA
ATCCCCAAGC GAGCGTTTGC AAATTTCAAT AGCATCAAAA GTCAAGTGGA TATGAAAAAT
AATGGTACGA CTAACCTCAA TTCAGTCCCC GATCTAAGCA ACACCACCTC AAAAAGCAAT
TTACTTCCAG ACAAGCCAGT AAAAATTCCC AACAAAGAAC ACACCATCCC AAGCAGAGCA
GAAACTGAAG AAAATATCAT CAGCGAAATC GGCGTCCCTT CTACAAAAGC TCCGAGAGAG
CCAGAAGATG TCAAGTCAAC TTCTGCAACA GTAAGAAAGA CATTGCTTTC TGAACTGCCA
TCGATCAGAC CATTTAATTC GTCAACAAAT GGAAGTAGGT CTCCTACCCG AAGTTACCTT
TCATTTAAGT ACAAGGGCAG TCCCAGAGAA CCTACTACCA GATCTCGTAG CCGATCTCCT
ACACGTAGTG TTGGACCGTT ATCCAAGAAG TCATCTCCAA TACTGAAAAT TCACGATACA
ACTGGCTATG ATGTTCCTGA AAATGACGAA AAGGGGTTAA TCTCTCGTCT AACCATTCCG
ACAAGTTCAA GTGCAGCAAA GAGAAAGACA CCGACATCTT CAAAAAGAGA AACAGAGAGC
AGAAAATCTG AAGGCAGGAA GACTGATTTA ATGAATACAA AGAATAGATT TTTGACTACT
ACATTGAACT CCAACAATCC CCAATTTAGT CTCAAAAGAC CACAGTTTCA AGGAAATCAA
TCAGTCGCCG CCAAGCCGTT GCACTCCCCA ACTAGAAGAC GAAACCCAGC AATTGAAGAT
CTAGAGATTA AAACTGCACC CAAGTTGGAG CCCGATGCTA TACCTATTTT GAAGAAAAAA
TCAATGATGG CAGAAAGAAG CGAAGCAGCA GCCCAAAAAC CGAAGCAGAA ATTCACGATT
TCGATGAATC ATACTTCTAA GCACAAGGCG GAAATAGTGC CCTTTTCGAA TCAGAAGCAA
CTTGGAGACG TCTTCGCCCG AGATGAAGAC ACAAAGGAGA GTCACGACGT CCATCAATTT
GAAAAATATA GCTACAGAAA TAATGCTGTC GCTTTGCCTG AAGCAGCAAG GGGAGGTTTT
GGCAGTGCAA AAAGAAGAAA GACAAACAAG GAAGATAAAA CACCTTCTAG AACCGGTGCT
CTTGCAAAGA AACAATTTCT CGAGAAGAAA GCTAGTAGGA TTTCTGTGGC TGAGAGAAGA
ACGCCCCACA AGAATGCTGA TCCTCTTACA CCTGCAAAAT TGCACTACTC AGCTGAAAAC
CTACCTGATA TTCCTACTGA TGACGAAGAC GATTCCAACG GAAAAGACCG AAAAATATTA
CAAACATGGG GACATACTCC TGAGATCAAA CTGATTATAA TGAAGAACAT AGAGGTGAAT
CCAGTTTCAG TATTTGGTGA TGTTCCTCAG CTCAACATGG AGGAGATATT CGATTCCCAT
TCTTCCAGAG CTAG
 
Protein sequence
MSSRLWAVNA TKKKKNELVP GSSRWITNEM SLQYDSALTN CEDHFQDCSS RYDSLSSFVQ 
KVLDITNSTE DHTEDLSLLL AVSNPTVFSP EFSKSTTFVQ PTEPEDMYSS NHKDSTPNIR
TANSNDITIP VKPENIKEDT VISESRNSLK QAIIAQSSPI RHRTSQYRQT PVKPRINQPP
RLMHASPNQI DISPPHSNSS MDRIDTGRKS LSSSSPPKVV QQSLPSVETH QTEHTSTARN
TSLINGIDDS FQAISTAIRK SIAGKSALTI SSSTPAKAKK SGVYEEFETK INLQPEEANR
SSTIHSTASS NENTRKISAS MRSSIFVGLP TREPITVNTK SSKHSSIKSR SSRLFERLDI
ASRRDTSSEI AIDNKVANKG EDKADKEHDV DANFDMSPVA IPKRAFANFN SIKSQVDMKN
NGTTNLNSVP DLSNTTSKSN LLPDKPVKIP NKEHTIPSRA ETEENIISEI GVPSTKAPRE
PEDVKSTSAT VRKTLLSESP SIRPFNSSTN GSRSPTRSYL SFKYKGSPRE PTTRSRSRSP
TRSVGPLSKK SSPISKIHDT TGYDVPENDE KGLISRLTIP TSSSAAKRKT PTSSKRETES
RKSEGRKTDL MNTKNRFLTT TLNSNNPQFS LKRPQFQGNQ SVAAKPLHSP TRRRNPAIED
LEIKTAPKLE PDAIPILKKK SMMAERSEAA AQKPKQKFTI SMNHTSKHKA EIVPFSNQKQ
LGDVFARDED TKESHDVHQF EKYSYRNNAV ALPEAARGGF GSAKRRKTNK EDKTPSRTGA
LAKKQFLEKK ATQHGGDIRF PFFQS