Gene PICST_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_1541 
SymbolBGL1 
ID4838763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1778040 
End bp1780253 
Gene Length2214 bp 
Protein Length738 aa 
Translation table12 
GC content41% 
IMG OID640390078 
Productbeta-glucosidase 
Protein accessionXP_001384652 
Protein GI150865435 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.36185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTT TTGACATTGA GGGAATATTG AGTCAATTGA CGTTAGAGGA AAAAGTAGGA 
CTACTTGCTG GTATAGACTT TTGGCACACC TACGCGGTGG ATAGGTTAGA TATCCCTAGT
CTAAGATTTA GTGACGGTCC TAATGGTGTC AGAGGAACAA AATTCTTTGA TGCTATTCCG
TCTGCGTGCT TTCCATGTGG TACTGCATTG GCTGCAACTT TTGACAAACA ATTGTTACGT
GACACCGGAA AGTTGATGGG GGTTGAAGCT AAAGCTAAGG GTGCTCATGT TATTCTTGGC
CCAACAATGA ATATTCAAAG AGGCCCATTG GGAGGAAGAG GGTTCGAATC TTTCAGTGAA
GACCCTCACT TGAGTGGCCA TGCTGCAGCT GCCATTGTGA ACGGAATTCA AGAAGAAGGT
ATTGCGGCAA CCGTTAAACA TTTCGTTTGC AATGATCTTG AAGATGAAAG AAATTCTAGC
AATTCAATTC TTTCCATGAG GGCTTTAAGG GAGATATATT TGGAACCTTT CAGGATAGCA
ATCAAACATG CGAACCCCAA GGCTTTGATG ACTGGTTACA ACAAAGTAAA TGGCGAACAT
GTTTCTCAAA GTGAAAGTAT TATCAAGGAC ATCTTAAGAG AAGAATGGAA ATGGGAAGGT
ACCATAATGT CCGATTGGTA TGGAACTTAT ACTAGTGACA CTGCTATTAG GGCTGGATTA
GACATCGAGA TGCCGGGTCC AACTAAGTTT AGAAGCTTAA GTGAAATTCT GCACATGGTT
GTGTCAAAGG AATTGCATAT CAAGCATATA AATGATAGAG TAAGGAATGT TCTTAAGTTG
GTTCAATTTG CCCAAGGTTC AGGAGTGCCT CAAAATGCTC CCGAAGGCAC AAGTAATAAT
AGTGCTGAAA CCAGTGCTAA GTTAAGAAAA ATTGCACTGG ATTCCATAGT ATTGCTCAAA
AACACTGGAA TACTACCTTT GAGTAAGGAT TCATCCATTG CAGTGATAGG TCCAAATGCT
AAATTTGCTG CTTATTGTGG AGGGGGATCT GCTTCGCTTG CATCTTACTA CACTACAACA
CCTTATTCTG GTATTGCATC CAAGACAACT ACTCCGCCTA AGTACTCAGT TGGTGCAACT
GGTCATAGAT TGTTGCCTGA TTTGGCTTCC CAGGTAATAA ATCCAATCAC TGGAAGTGTT
GGTGTCAATG CAAAGTTCTA CTCGGAACCT AGCACTTCTG AGAGAAGGAA CTTGCTAGAT
GAGTACAATT TAATTGATAC TCGGGTCAAT CTTTTTGATT ACATCAGTAC CAGTAGGGCA
CGTAATGAAC CATTCTATAT TGACTTCGAA GGAGACTTTG TTCCTGAAGA AACGGCCAGT
TACAGATTTG GACTTGCTGT GTTCGGTACA GCTGACTTGT ATGTTGACAA CAAATTGGTT
ATTGACAACA GCACAAATCA AAAGAAAGAC GAGCACTTTG TTGGTTCTGG AACTAGAGAA
GAACACGGCG TCATCCAATT AGAGAAAGGT AAGAATTATA GGATTCGTGT TGAGTTTGGG
TCAGCACACA CCTATACTTT TTCTGACCCC AACGCAGAAT TTCATGGTGG AGGTTCTTTG
AAAATCGGTT GTATTAAAGT TGTCGAACCC GAAGAAGAAA TTAGAAGGGC TATTGAAATC
GCAAAGACAG TAGACCAGGT TGTTTTGTGC ATTGGACTCA ATCTGGAGTG GGAATCTGAA
GGTTACGATC GTCCAGATAT GGAATTGATC GGCCTTCAGA ACAAATTAGT AGAGGAAATT
ATAAAGGCTA ATCCGAATAC TATCATTGTC AATCAGTCAG GCACTCCAGT AGAGATGCCT
TGGTTACCAA AAGCAAAGGC AGTTGTTCAA GCTTGGTTTG GAGGTACCGA AGGTGGTAAT
GCAATTGCAG ATGTCTTGTT TGGTGATGTT AATCCTAGTG GAAAGTTGTC ACTTTCATTC
CCTTTCAAAA ACTTCGATAA TCCGGCTTAC CTCAATTTCA CCACTGATAA CGGTCGAGTC
CTTTATGGAG AAGACATATT CGTTGGTTAT AGATATTACG AAAAACTAAA CAGAGAGGTT
GCGTACCCAT TTGGTTTTGG ATTATCATAT ACTTCATTCA AAATCGGAGA CTTGAAAGTT
CAAGTTCTAG ACCAGGATAA TATTGAGATT TCTGTTAACA TCAAGAATAC TGGA
 
Protein sequence
MTAFDIEGIL SQLTLEEKVG LLAGIDFWHT YAVDRLDIPS LRFSDGPNGV RGTKFFDAIP 
SACFPCGTAL AATFDKQLLR DTGKLMGVEA KAKGAHVILG PTMNIQRGPL GGRGFESFSE
DPHLSGHAAA AIVNGIQEEG IAATVKHFVC NDLEDERNSS NSILSMRALR EIYLEPFRIA
IKHANPKALM TGYNKVNGEH VSQSESIIKD ILREEWKWEG TIMSDWYGTY TSDTAIRAGL
DIEMPGPTKF RSLSEISHMV VSKELHIKHI NDRVRNVLKL VQFAQGSGVP QNAPEGTSNN
SAETSAKLRK IASDSIVLLK NTGILPLSKD SSIAVIGPNA KFAAYCGGGS ASLASYYTTT
PYSGIASKTT TPPKYSVGAT GHRLLPDLAS QVINPITGSV GVNAKFYSEP STSERRNLLD
EYNLIDTRVN LFDYISTSRA RNEPFYIDFE GDFVPEETAS YRFGLAVFGT ADLYVDNKLV
IDNSTNQKKD EHFVGSGTRE EHGVIQLEKG KNYRIRVEFG SAHTYTFSDP NAEFHGGGSL
KIGCIKVVEP EEEIRRAIEI AKTVDQVVLC IGLNSEWESE GYDRPDMELI GLQNKLVEEI
IKANPNTIIV NQSGTPVEMP WLPKAKAVVQ AWFGGTEGGN AIADVLFGDV NPSGKLSLSF
PFKNFDNPAY LNFTTDNGRV LYGEDIFVGY RYYEKLNREV AYPFGFGLSY TSFKIGDLKV
QVLDQDNIEI SVNIKNTG