Gene PICST_33726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33726 
SymbolSGA1 
ID4840861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp337269 
End bp338951 
Gene Length1683 bp 
Protein Length560 aa 
Translation table12 
GC content38% 
IMG OID640392176 
ProductGlucoamylase GLU1 precursor (Glucan 1,4-alpha-glucosidase) (1,4-alpha-D-glucan glucohydrolase) 
Protein accessionXP_001386458 
Protein GI150866757 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.259511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTGC AATTACTATT GTTATTACAG TTTGTTAGCT TCTGCTACAG TTTGTACATC 
CCAATTGGAG GACAATCTTT CAATAGAGGT CTCATCGAAA TTGAAAAAAG CATAGGTGAT
GCTACCGACG GCTCATTCCT TCCTGGTTTT ATTCAGCAGT TCTTCAGTTG GTCAGTTTCG
GAGAAGGTAG AAGATAACAT CCGCCTTGTT GATTTTGAGA CATGGATTGA GAAGCAGAAG
GAAATTTCAT TCAGAGGTAT CCTTAATAAT ATCGGGGGTG TCAGTGATAC TCTCGAACAA
TCTGAAGTTT CCAAAGGCGC TGTAATCGCT TCTCCATCGA GAATTCAGCC AAATTACTTT
TACCAGTGGG TGAGAGACGC TGCTTTAACC ATTAAGTCAC TTGTTTATCA CATTGATGAT
AACAATTTTG AAAATGTCGA CGATATCCAA TCAGTTATCG AAGCGTACAT AGAGAACAAC
TATTATTTGC AACGTTTAGA TAACAACTCT GGAAAGTTTG ATGACCCAGA TAAGTCTGGC
CTTGGAGAAC CAAAGTTCCA TGCAAACAAT ACGGCTTTCG TCCAAAACTG GGGTAGACCT
CAGAGAGATG GGCCAGGTTT AAGAGCTATC ACTATTTTGA GTTATGTGAG CTTGTTGGAC
AAGTGGAACA AGAAAGTTTC CAACAAGTTT TTGAAGTCTC CAGAATTTAT CTATAACAAA
ATCGTGAAGC CTGACTTAAC TTACATTGTC AGAAATTGGT TCAAAGAGGG ATTTGATTTA
TGGGAAGAAA TAAATTCGCA TCACTTTTAC ACGTCTGTCA CACAACTAGC TGCAATCAAG
GATGGTTTAT TATTGGCCCA GAAGTTTGAA AAAGATTCCG ATTTTTTGAG ACAATTGCAA
ATCACTTATA CAAACTTGAA GCAATTTATA GAGAATGATT CTGGTTACAA GAACCCTGCT
GTACCGTATA TCGTTGAAAC TCCACTGTTA CTTAGAGCAG GTAAACGTAC TGGCTTGGAT
GCTGGATCAC TCTTGGGTTC TCTTCATTCT CATAACATGG AATTTGGAGA CTATAGTGAC
ATTCCGTTTG ATGTTAATGA TACCCATTTG ATCAACACTT TGAGTGCAAT GGTCGCAGAT
ATGAAGTACA GATATCCTCT CAATCATAAC AAGATTGGGT TTGAAAAGGG CATTGGATGT
GCCTTGGGAA GATATCCTGA AGATATTTAT GATGGATATG GTACTTCTGA AGGTAACCCA
TGGTTTATTT CAACTGCTTC TGCTTCTGAA CTAATTTACA AGTTTATATA CAACTTAGAG
CATAACCACA TGGATATTGT GATTAACAGT CAGAACAAAG ATTTCTTCAA ACAGTTTGTT
GACTTTGATA ATATCCCATC AAATGACTTG ACAACAGTAC CTGCCAATGA TTATACTGAT
TCAATTGTGA TTAGATATGG AACCCAAACA TTCAGAACAC TCTCAATTAA TTTGGTGACA
TATTCTGATT CCTTTTTGGA AGTGATCAAA GATCACGTTG ATAATCAGGG CCGCATGTCG
GAGCAATTCA ATAAGTATCA TGGTTTCATG CAAGGTGCAA GGGATTTGAC TTGGAGTTAT
AGTGCAGTTT GGAATGCCTT CAGATGGAGA CAGAAGACTT TAGATATTTT AGACCAATTC
TAG
 
Protein sequence
MKLQLLLLLQ FVSFCYSLYI PIGGQSFNRG LIEIEKSIGD ATDGSFLPGF IQQFFSWSVS 
EKVEDNIRLV DFETWIEKQK EISFRGILNN IGGVSDTLEQ SEVSKGAVIA SPSRIQPNYF
YQWVRDAALT IKSLVYHIDD NNFENVDDIQ SVIEAYIENN YYLQRLDNNS GKFDDPDKSG
LGEPKFHANN TAFVQNWGRP QRDGPGLRAI TILSYVSLLD KWNKKVSNKF LKSPEFIYNK
IVKPDLTYIV RNWFKEGFDL WEEINSHHFY TSVTQLAAIK DGLLLAQKFE KDSDFLRQLQ
ITYTNLKQFI ENDSGYKNPA VPYIVETPSL LRAGKRTGLD AGSLLGSLHS HNMEFGDYSD
IPFDVNDTHL INTLSAMVAD MKYRYPLNHN KIGFEKGIGC ALGRYPEDIY DGYGTSEGNP
WFISTASASE LIYKFIYNLE HNHMDIVINS QNKDFFKQFV DFDNIPSNDL TTVPANDYTD
SIVIRYGTQT FRTLSINLVT YSDSFLEVIK DHVDNQGRMS EQFNKYHGFM QGARDLTWSY
SAVWNAFRWR QKTLDILDQF