Gene PICST_42120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42120 
SymbolMAL6 
ID4836930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp752503 
End bp754221 
Gene Length1719 bp 
Protein Length572 aa 
Translation table12 
GC content42% 
IMG OID640388245 
Productalpha-glucosidase maltase 
Protein accessionXP_001382912 
Protein GI150864188 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.217758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.25099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTG CTCGCAACTG GTGGAAAAAT GCCACTGTAT ACCAAATCTG GCCAGCTTCG 
TACAAGGACT CCAATGGAGA CGGTTTTGGT GATATCCCAG GTATCATATC AACATTGGAC
TACCTCAAGG ATTTAGGAGT GGATGTGATT TGGTGTAGTC CCATGTACGA CTCGCCGCAG
GATGACATGG GGTATGACAT TAGCGACTAC GAAAAGGTTT ACCCGAAGTA TGGAACTAAC
GAAGATATGC AGGCACTTAT AGACGAAACG CATAAGCGGG GCATGAAATT GGTGTTGGAT
TTGGTTATCA ACCATACTTC TAGCGAGCAT GCCTGGTTCA AGGAATCCAG ATCCTCGAAG
ACCAACCCAA AAAGAGATTG GTATATTTGG AAGCCTCCTA AATTTGATGC AGATGGTAAG
AGACATCCTC CTAATAACTG GAGTTCGTAT TTTTCTGGCT CAGCTTGGGA ATACGACGAA
CTTACTGAAG AGTACTACTT AAGACTCTTT GCCAGAACTC AACCTGATTT GAACTGGGAA
AACGACGAAA CCAGAAAGGC AGTCTATGAC TCTGCTATGA AGTTTTGGCT CGACAAGGGT
ATTGATGGCT TTAGAATTGA TACAGCTGGA TTGTATTCAA AGGATCAACG CTTCCCGGAT
TGTCCCATTG TATACCCAGA TGAAGAATTT CAGCCAAGTC AAAAGTATAG TCTGAATGGG
CCCCGGATTC ATGAATTCCA CAAGGAAATG TACGCCAATG TAACTAGCAA CTATGATGCC
ATGACAGTTG GAGAAGTTGG CCATTGTTCA CGAGAAGATG CCTTGAAGTA TGTCAGTGCC
AAGGAACAAG AAATGAATAT GATATTCCTC TTCAATGCTA TTAATGTCGG TTACGATAAA
GCTGATCGTT ACAGGTACAA GGGCTGGACC TTGACTGACT TCAAGAAGGC CATTCAAAAG
GACTCTTCTT TCATCGAAGG CACTGATGCG TGGTCGACTG TCTTCATTGA AAACCATGAC
ATTGCTAGAT CGGTTACTAG ATTTGGCAGT CCCAAGCACA CATCAAAGTC TGCTAAGTTG
ATTTCCTTGT TGGAGTCCAC TTTAACAGGT ACCCTCTTCA TATACCAGGG CCAGGAAATT
GCCATGGAAA ATTTACCAAG ATCTTGGTCT ATCGAAGAAT ACAAGGATAT CAACACTGTC
AACTACTACA AGCAGTTCAA GGAGAAGTAT GGTAATGACC CAGACTTCAA GGAGAAGGAA
GAGAAGTTGA TGGACATCAT CAACCTTGTT TCCAGAGACC ATGCAAGATC TCCGGTTCAA
TGGGATTCTT CTCCCCATGG CGGTTTCACT ACAGGTACTC CGTGGACAAG AGTAAATGAT
AATTACAAAG CCATTAATGT TGCTAGCCAG ATTGATGACC CTAACTCGGT ATTGAACTTC
TGGAAGAAGT CTATTCAAAT AAGAAAGCAA TATCAAGACT TGCTTATTTT CGGCTCATTC
AAAATCTTAG ATTTTGACAA CGAGACCGTC TTCACATATG TTAAGGAAGA TGAAAATGCT
GCTTCTCCTA AGGCATATGT AGTATTGAAC TTCTCTAACG ATTCCGTGAA GTTTGAGAAG
TTGATCGATG GCGAATTTGA ACTTGTTCAC AGCACCACTG ACGACATTGA CGAACTGACA
TTGTCTCCAT ATGAAGGTCG TCTATATATT GTTGATTAG
 
Protein sequence
MTIARNWWKN ATVYQIWPAS YKDSNGDGFG DIPGIISTLD YLKDLGVDVI WCSPMYDSPQ 
DDMGYDISDY EKVYPKYGTN EDMQALIDET HKRGMKLVLD LVINHTSSEH AWFKESRSSK
TNPKRDWYIW KPPKFDADGK RHPPNNWSSY FSGSAWEYDE LTEEYYLRLF ARTQPDLNWE
NDETRKAVYD SAMKFWLDKG IDGFRIDTAG LYSKDQRFPD CPIVYPDEEF QPSQKYSSNG
PRIHEFHKEM YANVTSNYDA MTVGEVGHCS REDALKYVSA KEQEMNMIFL FNAINVGYDK
ADRYRYKGWT LTDFKKAIQK DSSFIEGTDA WSTVFIENHD IARSVTRFGS PKHTSKSAKL
ISLLESTLTG TLFIYQGQEI AMENLPRSWS IEEYKDINTV NYYKQFKEKY GNDPDFKEKE
EKLMDIINLV SRDHARSPVQ WDSSPHGGFT TGTPWTRVND NYKAINVASQ IDDPNSVLNF
WKKSIQIRKQ YQDLLIFGSF KILDFDNETV FTYVKEDENA ASPKAYVVLN FSNDSVKFEK
LIDGEFELVH STTDDIDEST LSPYEGRLYI VD