Gene PICST_84949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84949 
SymbolMNS1 
ID4840096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp450054 
End bp452486 
Gene Length2433 bp 
Protein Length636 aa 
Translation table12 
GC content45% 
IMG OID640391411 
Productmannosyl- oligosaccharide 1,2-alpha-mannosidase 
Protein accessionXP_001385772 
Protein GI150866243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCTACCTCCA CACTGCTCTG CTTTCGATAC GTTCTGTGTG CTGGCTCAAT CTGTTGCTGA 
ATCCTCTGCT TACAGCTCCA TACTTGATCC GCTCTTTCTG TGGAATTGTG ATATATTACA
GGAAAGCAGT TTATTTTCAC TCTTCTATCA GACTCGCACT TGGATTTTTT TCGGTGCAAG
AATTATCAAT TTAGCCATTG CAATTTTTAC TGAGTTTCTA CTCTTTGCGC TGCGTTCCCT
TCCTCGGCCA TAGATGTCTT CGTTCTCATT TAGCGCAGAC AAACTCTCTC TAGGCAACCG
TTGGAAAGCC AAATCCTCTT CATCGTCGTT GCCAATGTAC TACAAGGATA AACCCAACTT
GAAATCTCCT CGTCCGTCCA AGATCATCAT GCTAGTCAAG GGATTCATAG GCTCCTTGGT
GCTATACTAC ATCTATCTGT TGACCTTCGG CTCTAGTGGC TTATTTGGAC TTACATATTC
CAGCGGCTCC AAATGGACTC GTGCCCAACA GGAAGTTAGA CTGGCCATGT TGGACTCATG
GCACACCTAT GAAAAGTTCG GCTGGGGCTA CGACATATAT CATCCTGTAA GACAAAAAGG
TGAGAACATG GGCCCCAAGC CTTTAGGATG GATGATTGTG GATTCGTTGG ATACCTTGAT
TTTGATGGAT GCCGAAGACG AAGTTGCTAG AGCCAAGAAG TGGATCAAAG AAGACTTGGA
CTACAGGTTC GACTATAATG TCAACACGTT TGAAACTACC ATCAGAATGT TGGGAGGTCT
TCTTTCAGCA TTCCATTTCA CCAATGACGA CTCCCTCTTG GATAAGGCTG TTGACTTGGC
CAATGCATTA GATGGTGCCT TTGCTAGTAA GACGGGCATA CCCTTCAGCT CTGTAAACTT
AGAGTCTGGC GAAGGCATTC CTAACCATGT AGACAACGGT GCTTCGTCAA CTGCTGAAGT
GGCTACTTTG CAATTGGAGT TCAAATATTT GGCCAAGTTG ACTGGTGAAG TGTTGTACTG
GAATCGTGTA GAGAAAGTAA TGCAGGTATT AGAGGCTAAC CAACCAGCTG ATGGACTTGT
ACCTATCTAC GTCAATCCAC AAACTGGTAA TTACCAGGGT AAGTTGATCA GATTGGGTTC
GCGTGGTGAC TCTTACTATG AGTACTTGTT GAAGCAATAC TTACAGACCA ACTTACAAGA
GCCCATCTAT GAGGGCATGT ACCGTGAGTC TGTCAGAGGT GTGAGAAAGC ATTTGGTCAG
AAGATCGAAG CCCAGTGATT TGGCTTTTAT CGGTGAATTG GAGAACGGTA TCGGCAAGCA
CCTCTCACCC AAGATGGACC ATCTTGTGTG TTTCTACGGT GGCTTACTTG CTCTTGGTGC
TACCAATGGT TTGACCTACA GCGAGGCTAA GAAGTTGCCG GACTGGACAG ATGAGAAGGA
AGAAGAGTTC CAGTTGGGTG CCGACTTGAC TTATACTTGT TACAGGATGT ATGCCGACAC
GCAGACAGGT TTGTCGCCAG AAATCGCTGT GTTCAATGAA GACAAGACAC AAAACTCTGA
TTTCCACATC AAGCCTGCCG ACAGGCACAA CTTGCAAAGA CCGGAAACTG TTGAGTCGTT
ATTTGTTTTG TACAGATTGA CTGGTGACGA AAAGTACCGT CAATACGGGT ATGAGATTTT
CAATAGTTTC ATGAAGCACA CCAAGATAGA AAATGAAAAC GGGGACATTT CATTTACGTC
ATTGAAGGAC GTCACTAGTA TCCCCTCACC TACCAAAGAC AACACGGAGT CCTTCTGGTG
GGCCGAGACC TTGAAATACT TGTATTTGTT GTTCGACGAC ACCAACAAGG TACCGTTAGA
CAAGTATGTG TTCAACACAG AAGCACATCC ATTCCCTCGC TTTGACTTAA ACAGCAACCT
CAAGACCGGC TGGATCAGAA AGATCGACGG GTCCAAGGAG CCTGAGCTTC AACAACCCAT
GGTAAAAATA GACAAAAACA AACTTCCAGA AGCACAACCG GTAGATAAGC CTGCAGCCGA
AGAAGCTAAA GAAATCCTCA AGAAAAACCC CGAAGCTGCA AAAGAGGAGA ACGTCGAGGC
CAACAAGAAG AAACTCGACG AATTGATCAA GGATGACATC GGTGCCGCGG CTGTCAGAGA
ATGAGCGAAT ATACTAGTTT TGCATTTGCC TATCAGCATG TAACATTTAA TAATTGAGAT
ATTACCAGTC CGGAGAAATT TGCAACGTCG AACGTGTCAT GACGTTCTGC AGTTTTAAAT
AGGTTGCGTT TTGTTCCCAA TGGGGCAAGA AGTCCTTCTA GTCTACTAAG AATATGTATG
TAGTTACTTA GAGTTTACAT GCTTATAGTT TAATTACGGC CCGTAATCGT AATTACGAAT
GCATGGAACG CAATAAATCT CCGAAGAAGT GGG
 
Protein sequence
MSSFSFSADK LSLGNRWKAK SSSSSLPMYY KDKPNLKSPR PSKIIMLVKG FIGSLVLYYI 
YSLTFGSSGL FGLTYSSGSK WTRAQQEVRS AMLDSWHTYE KFGWGYDIYH PVRQKGENMG
PKPLGWMIVD SLDTLILMDA EDEVARAKKW IKEDLDYRFD YNVNTFETTI RMLGGLLSAF
HFTNDDSLLD KAVDLANALD GAFASKTGIP FSSVNLESGE GIPNHVDNGA SSTAEVATLQ
LEFKYLAKLT GEVLYWNRVE KVMQVLEANQ PADGLVPIYV NPQTGNYQGK LIRLGSRGDS
YYEYLLKQYL QTNLQEPIYE GMYRESVRGV RKHLVRRSKP SDLAFIGELE NGIGKHLSPK
MDHLVCFYGG LLALGATNGL TYSEAKKLPD WTDEKEEEFQ LGADLTYTCY RMYADTQTGL
SPEIAVFNED KTQNSDFHIK PADRHNLQRP ETVESLFVLY RLTGDEKYRQ YGYEIFNSFM
KHTKIENENG DISFTSLKDV TSIPSPTKDN TESFWWAETL KYLYLLFDDT NKVPLDKYVF
NTEAHPFPRF DLNSNLKTGW IRKIDGSKEP ELQQPMVKID KNKLPEAQPV DKPAAEEAKE
ILKKNPEAAK EENVEANKKK LDELIKDDIG AAAVRE