Gene PICST_32530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32530 
SymbolYIC1 
ID4840087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp28472 
End bp30943 
Gene Length2472 bp 
Protein Length823 aa 
Translation table12 
GC content39% 
IMG OID640391402 
ProductAlpha-glucosidase II; Alpha-xylosidase 
Protein accessionXP_001385692 
Protein GI150866186 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.725216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC ATCAAAACAT GGAATATTAT TCGTTGGATC GTCTGATCGA TTCAGACACA 
CGGTTCACCC GCGGGATGTG GGAATTGAAA CATGGTATTA ATCTCCGTTG GGCATTAGAA
AATGTCAAAA CTGTAGTTAA AGATCCCAGT TCCATTCAGA CTGTATCTGC AACTCAAAGA
ATTTCTCAAA GAGGTGATAC TTTGAATAAT CCAACTATTA CTTCTACTAT ATCTTCACCT
TCGGAAGGTA TTGTGAAGTT TGAAGCTTAT CATCACCTAT CTGGGTATAA CAACCACAGC
GAACCTCGGT TTGGTTTGAA CGAAAAAGAT AGTCCTGATG TTACAACTTC AGTTTCTGAT
GACTTCTCGA GTTTGGAAAC GGAGGGTGAT ATTTCGGTGA ATGTCTCTAA CAAGCCAGCG
ACGTTTGAGT TTTTGGGTAC AAAAGGTAAA CTATTAACAA AGTTAGGGTA CAAGTCATTG
GGATGGGTTA AGGACGCTCG TGTTCCTCAC AGCTCCACTC CAAGAGGCTA TACTACATAT
ACTACTGCCC AATTACATCT TTCTGTTGGT GAAAGATTAT TTGGATTGGG AGAGAGGTTT
GGTCCTTTTG TTAAGAACGG TCAAAGAGTT GAAATTTGGA ATGAAGACGG AGGTACTCTG
AGTGAATGGA CATACAAGAA TATTCCATTT TATCTTTCTG ACAGAGGGTA TGGTATCTTT
GTTGATTCTT CCTCAAACGT TGTTTTCGAG TTACAATCTG AAAGAACAAC CAGGGTTAAT
ATTACTGTGC CAGGAGAAGG CATTAGATTT TATGTAATTC ATGGTCCTGA TCCAAAAACT
ATTTTGAAGA GATATACGAA GCTTACTGGC AGACCAGCTT TACCACCAGC TTGGACTTTT
GGTCTTTGGT TAACTACTTC ATTCACAACT GAATATGACT TGAACACTGT GAGTTCATTT
ATTCAAGGAA TGAAGGATAG AGATATTCCC TTGACCACCT TCCATTTTGA CTGCTTTTGG
ATGAAAGGTT TCCAATGGTG TGATTTTGAG TTTGATCCTC AATACTTTCC AGATGCCAAG
TTGATGTTGA AGGAACTCAA GTCAAGATTC AACGTCAAGG TTTGTGTTTG GATAAATCCA
TACATTGCAC AAGAGTCAAT GTTGTTTAGG GAGGCAGATG AAAAGAGGTA TCTTATTAGG
TATAACTCAA ATGTGAACAA TGGAGCTTCT TATCAGACAG ATTTGTGGCA AGCAGGTATG
GGTATTGTTG ATTTTACTAA TCCTGATGCT GTCAAATGGT ACCAATCCAA ACTTGAACAT
TTAATCGATC TAGGAGTAGA TTCATTCAAG ACGGATTTTG GTGAAAGAAT TCCTGTAAAG
GATATCGTCT ACCACAGTGG TGAAGATTCT GTGGCAATGC ACAATTATTA TGCATTGCTT
TACAACAAGA CTGTTTTTGA ACTTTTGGAA AGGAAGCTAG GAAAAGATAA TGCTTGCGTC
TTCGCTCGTA GTGCGACAGT TGGTGGTCAA CAATATCCAG TTCATTGGGG TGGAGACTGT
GAGTCCACCT TTGAAGCAAT GGCTGAATCA TTGAGAGGTG GATTAAGTCT TACATTATCA
GGTTTTGGAT TTTGGAGCCA TGACATAGGT GGCTTTGAAG GTGATCCACG CCCTGAAGTT
TATAAGAGAT GGTGTGCATT TGGCTTGCTT AGTTCTCATT CTCGTCTACA TGGAAGTAAC
TCTTACAGAG TTCCATGGAA TTTTGATGAT GAAGCATCGG AAGTTTTGGC AAAATTCACC
AAATTGAAGA TTTCATTGAT GCCCTACATC TACAAACATG CTATTGAATC ACATGAGACT
GGAGTTCCAG TAATGAGAGC AATGATGTTA GAATTCCCTG ATGACAAGAC TGCTGTAAGT
GTCGATTCTC AATTTACTTT AGGAGATTCT TTGTTGGTTT CTCCTGTATT TTCTGGAGAT
GAAGGTGAGG TTTCTTACTA CTTACCTAAG GGCTCTTGGT ATGGTCTTTT GGATGGCAAA
ATTAGATCAT CTGTGGGCGA GTGGATGAAT GAAGTTCATG GTTATACGTC TTTGCCCATT
TTAGTTCGTC CAAACTCGGT GATTGTCACG TCTGGTCCTG ATGCTGAAAA TGAACACCCT
GTGTACACTT GGAATGAAAG ATTTATGCTC AACGTATTTG ATGTTGACAA AACGTGGAAT
AATTCAACTA ATATTCCAAA TAGTAAGAAG CTTGGAAAGA TTGATACCAC TATTGATGTT
TCTAAAGGAG ATAATTTGTT AACAATCAAG GTAAGTGGAG ATTTCAAGAC ACCTTATTTT
GTAAGCTTCT TAGGCAAGAA TGCTGGTGTG GAAGTTGTGA AGGGTAAATC CAAACCATCA
GGAAGCGGCA CTATTGGAAA TCTTATTTTT GAGTGCAACA GTGGTGTTTT GCAATTCAAA
ATTTTAAACT AA
 
Protein sequence
MSNHQNMEYY SLDRSIDSDT RFTRGMWELK HGINLRWALE NVKTVVKDPS SIQTVSATQR 
ISQRGDTLNN PTITSTISSP SEGIVKFEAY HHLSGYNNHS EPRFGLNEKD SPDVTTSVSD
DFSSLETEGD ISVNVSNKPA TFEFLGTKGK LLTKLGYKSL GWVKDARVPH SSTPRGYTTY
TTAQLHLSVG ERLFGLGERF GPFVKNGQRV EIWNEDGGTS SEWTYKNIPF YLSDRGYGIF
VDSSSNVVFE LQSERTTRVN ITVPGEGIRF YVIHGPDPKT ILKRYTKLTG RPALPPAWTF
GLWLTTSFTT EYDLNTVSSF IQGMKDRDIP LTTFHFDCFW MKGFQWCDFE FDPQYFPDAK
LMLKELKSRF NVKVCVWINP YIAQESMLFR EADEKRYLIR YNSNVNNGAS YQTDLWQAGM
GIVDFTNPDA VKWYQSKLEH LIDLGVDSFK TDFGERIPVK DIVYHSGEDS VAMHNYYALL
YNKTVFELLE RKLGKDNACV FARSATVGGQ QYPVHWGGDC ESTFEAMAES LRGGLSLTLS
GFGFWSHDIG GFEGDPRPEV YKRWCAFGLL SSHSRLHGSN SYRVPWNFDD EASEVLAKFT
KLKISLMPYI YKHAIESHET GVPVMRAMML EFPDDKTAVS VDSQFTLGDS LLVSPVFSGD
EGEVSYYLPK GSWYGLLDGK IRSSVGEWMN EVHGYTSLPI LVRPNSVIVT SGPDAENEHP
VYTWNERFML NVFDVDKTWN NSTNIPNSKK LGKIDTTIDV SKGDNLLTIK VSGDFKTPYF
VSFLGKNAGV EVVKGKSKPS GSGTIGNLIF ECNSGVLQFK ILN