Gene PICST_61725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61725 
SymbolBGL3 
ID4840204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1711017 
End bp1713230 
Gene Length2214 bp 
Protein Length738 aa 
Translation table12 
GC content41% 
IMG OID640391519 
Productbeta-glucosidase 
Protein accessionXP_001385685 
Protein GI150866180 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.614554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTT TTGACATTGA GGGAATATTG AGTCAATTGA CGTTAGAGGA AAAAATAGGA 
CTACTTGCTG GTATAGACTT TTGGCACACT TACGCGGTGG ATAGGTTAGA TATCCCTAGT
CTCAGATTTA GTGACGGTCC TAATGGTGTC AGAGGAACAA AATTCTTTGA TGCTATTCCG
TCTGCGTGCT TTCCATGTGG TACTGCATTG GCTGCAACTT TTGACAAACA ATTGTTACGT
GACACCGGAA AGTTGATGGG GGTTGAAGCT AAAGCTAAGG GCGCTCATGT TATTCTTGGC
CCAACAATGA ATATTCAAAG AGGCCCATTG GGAGGAAGAG GGTTCGAGTC TTTCAGTGAA
GACCCTCACT TGAGTGGCCA TGCTGCAGCT GCCATTGTGA ACGGAATTCA AGAAGAAGGT
ATTGCGGCAA CCGTTAAACA TTTCGTTTGC AATGATCTTG AAGATGAAAG AAATTCTAGC
AATTCAATTC TTTCCATGAG GGCTTTAAGG GAGATATATT TGGAACCTTT CAGGATAGCA
ATCAAACATG CGAACCCCAA GGCTTTGATG ACTGGTTACA ACAAAGTAAA TGGCGAACAT
GTTTCTCAAA GCGAAAGTAT TATCAAGGAC ATCTTAAGAG AAGAATGGAA ATGGGAAGGT
ACCATAATGT CCGATTGGTA TGGAACTTAT ACTAGTGACA CTGCTATTAG GGCTGGATTA
GACATCGAGA TGCCGGGTCC AACTAAGTTT AGAAGCTTAA GTGAAATTCT GCACATGGTT
GCGTCAAAGG AATTGCATAT CAAGCATATA AATGATAGAG TAAGGAATGT TCTTAAGTTG
GTTCAATTTG CCCAAGGTTC AGGAGTGCCT CAAAATGCTC CCGAAGGCAC AAGTAATAAT
AGTGCTGAAA CCAGTGCTAA GTTAAGAAAA ATTGCACTGG ATTCCATAGT ATTGCTCAAG
AACACTGGAA TACTACCTTT GAGTAAGGAT TCATCCATTG CAGTGATAGG TCCAAATGCT
AAATTTGCTG CTTATTGTGG AGGGGGATCT GCTTCGCTTG CATCTTACTA CACTACAACA
CCTTATTCTG GTATTGCATC CAAGACAACT ACTCCGCCTA AATACTCAGT TGGTGCAACT
GGTCATAGAT TGTTGCCTGA TTTGGCTTCC CAGGTAATAA ATCCAAGCAC TGGAAGTGTT
GGTGTCAATG CAAAGTTCTA CTCGGAACCT AGCACCTCTG AGAGAAGGAA CTTGCTAGAT
GAGTACAATT TAATTGATAC TCGGGTCAAT CTTTTTGATT ACATCAGTAC CAGTAGGGCA
CGTAATGAAC CATTCTATAT TGACTTCGAA GGAGACTTTG TTCCTGAAGA AACGGCCAGT
TACAAATTTG GACTTGCTGT GTTCGGTACA GCTGACTTGT ATGTTGACAA CAAATTGGTT
ATTGACAACA GCACAAATCA AAAGAAAGAC GAGCACTTTG TTGGTTCTGG AACTAGAGAA
GAACACGGCG TCATCCAATT AGAGAAAGGT AAGAATTATA GGATTCGTGT TGAATTTGGG
TCAGCACACA CCTATACTTT TTCTGACCCC AACGCAGAAT TTCATGGTGG AGGTTCTTTG
AAAATCGGTT GTATTAAGGT TGTCGAACCC GAAGAAGAAA TTAGAAGGGC TATTGAAATC
GCAAAGACAG TAGACCAGGT TGTTTTGTGC ATTGGACTCA ATCTGGAGTG GGAATCTGAA
GGCTACGATC GTCCAGATAT GGAGTTGATC GGCCTTCAGA ACAAATTAGT AGAGGAAATT
ATAAAGGCTA ATCCGAATAC TGTCATTGTC AATCAGTCAG GCACTCCAGT AGAGATGCCT
TGGTTACCAA AAGCAAAGGC GGTTGTTCAA GCTTGGTTTG GAGGTACCGA AGGTGGTAAT
GCAATTGCAG ATGTCTTGTT TGGTGATGTT AATCCTAGTG GAAAGTTGTC ACTATCATTC
CCTTTCAAAA ACATCGATAA TCCGGCTTAC CTCAATTTCA CCACTGATAA CGGTCGAGTC
CTTTATGGAG AAGACATATT CGTTGGTTAT AGATATTACG AAAAACTAAA CAGAGAGGTT
GCGTACCCAT TTGGTTTTGG ATTATCATAT ACTTCATTCA AAATCGGAGA CTTGAAAGTT
CAAGGTCTAG ACCAGGATAA TATTGAGATT TCTGTTAACA TCAAGAATAC TGGA
 
Protein sequence
MTAFDIEGIL SQLTLEEKIG LLAGIDFWHT YAVDRLDIPS LRFSDGPNGV RGTKFFDAIP 
SACFPCGTAL AATFDKQLLR DTGKLMGVEA KAKGAHVILG PTMNIQRGPL GGRGFESFSE
DPHLSGHAAA AIVNGIQEEG IAATVKHFVC NDLEDERNSS NSILSMRALR EIYLEPFRIA
IKHANPKALM TGYNKVNGEH VSQSESIIKD ILREEWKWEG TIMSDWYGTY TSDTAIRAGL
DIEMPGPTKF RSLSEISHMV ASKELHIKHI NDRVRNVLKL VQFAQGSGVP QNAPEGTSNN
SAETSAKLRK IASDSIVLLK NTGILPLSKD SSIAVIGPNA KFAAYCGGGS ASLASYYTTT
PYSGIASKTT TPPKYSVGAT GHRLLPDLAS QVINPSTGSV GVNAKFYSEP STSERRNLLD
EYNLIDTRVN LFDYISTSRA RNEPFYIDFE GDFVPEETAS YKFGLAVFGT ADLYVDNKLV
IDNSTNQKKD EHFVGSGTRE EHGVIQLEKG KNYRIRVEFG SAHTYTFSDP NAEFHGGGSL
KIGCIKVVEP EEEIRRAIEI AKTVDQVVLC IGLNSEWESE GYDRPDMELI GLQNKLVEEI
IKANPNTVIV NQSGTPVEMP WLPKAKAVVQ AWFGGTEGGN AIADVLFGDV NPSGKLSLSF
PFKNIDNPAY LNFTTDNGRV LYGEDIFVGY RYYEKLNREV AYPFGFGLSY TSFKIGDLKV
QGLDQDNIEI SVNIKNTG