Gene PICST_41452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41452 
SymbolBGL2 
ID4837167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2707875 
End bp2710430 
Gene Length2556 bp 
Protein Length851 aa 
Translation table12 
GC content44% 
IMG OID640388482 
Productbeta-glucosidase 
Protein accessionXP_001383273 
Protein GI126133496 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCAA GTGTTAAACA ACCTGTTCCT AAGGAGCTTG ATATCGAGTA CTTGATCGAG 
CAATTGACCA TTGAAGAAAA GGTCTCTTTG TTGGCCGGTA AGGACTTCTG GCACACCCAG
AACATCGACA GATTGAACAT CCCCAGTGTC AGAGTGTCAG ATGGCCCTAA CGGTATCAGA
GGTACCAAGT TTTTCAACTC TGTTCCTTCC AACTGTTTCC CATGTGGTAC TGGTTTGGCT
GCCACCTTCA ACAAGGAAGT TTTGCTCCAA GCCGGTGAGT TGATGGGTAA GGAAGCGAAG
ATGAAGGGAG CCCACGTTAT CTTGGGTCCA ACGTGTAACA TTGTTCGTTC TCCATTGGGA
GGAAGAGCGT TCGAATCGTA CTCTGAAGAT CCAGTTTTGT CTGGACATGC TGCTGCCAAT
GTTGTCAAGG GTATCCAGAA CCAAAATGTT GTTGCCTGTC TCAAGCATTT CGTTGCCAAT
GACCAAGAAC ACGAAAGAAA GGCCGTGGAC GAAATCATGA CAGAAAGAGC TTTGAGAGAA
ATCTACTTGA AGCCTTTCCA CATCGCCATG AGAGATGCCT ATCCAAAAGC CTTAATGACT
GCTTACAACA AGATCAACGG TGTCCATGTT TCGCAGAACA AGAAGATTTT GCAAGATCTC
TTGAGAGGTG AGTGGGGCTA TACAGGTACT GTCATGTCTG ACTGGCACGG TGTCTACTCT
ACCAAGGAAT CGTTGGATGC TGGCTTAAAC TTGGAAATGC CTGGTCCAAC TAGATTCAGA
CAGCAAGTAC CAACACTTCA TGCCATCCAA ACCAATGAAA TTCACACTGA TGTCATTGAT
GACAATGCTC GTGCAATCTT GAGATTGGTC AACGAAAGTT TGAAGGCTGG AATCCCAGAC
GATGTCATTG AGTCACCCAA CCCTACTAAA GAAGCATCTG ACTTGTTGAG AAAGGCTGGT
GACGAGTCAA TTGTCTTGTT GAAGAACGAA AATAACATCT TGCCATTATC TAAGACTGCT
GTCAAGGGCC AAGAGAAAAT TGCAGTTATC GGTCCAAATG CTAAGGCTGC CCAAGATTCT
GGTGGTGGAT CTGCTTCGTT AAACGCTGCC TACAAGATCA CTCCATACGA AGGTATCGAG
TCCAAAATCA TTGAAGGTGG AAACTCTGTA TCTTTGGATT ACTCCTTAGG TGCTTTTTTG
GACAGAAACT TACCCGATGT TGGGAACACT TTGATCAACG AAGAAGGTAA GAAGGGTATC
ACTGCCAAGT TCTACAAGCA AGCTCCAGGA GCCGCTGACA GAGAACACTT CGAAACTTTC
ACTTTGTCTA CTTCCAAAAT CTTCCTTTCT GACTACAAAA GTAAACACTT GAAACCAGGA
CAACTCTTGT TCTACGCTGA TTTCCATGGA ATCTATATTC CCGATGAAAC TGGTGACTAT
GAGTTTGGAG CTTCCTGTTT GGGTACTGCC CAACTTTTTG TTGATGACGA ATTGGTTGTT
GACAACAAGA CCAAGCAAGT GAAAGGTGAT GCTTTCTTCT TGGGTTTGGG TACCAGAGAA
GAAAGAGGTG TCAAGAAATT GGAGAAGGGC AAAAAGTACA ACATCAGAGT TGAGTTTGGT
TCTTCGCCTA CTTTCACCTT GAACAAGGCA GCTCTTGAAG GGGGAGGTGT CTTTTTCGGT
ATCAGAATGA TTTCTACTGC TGAAGCTGCA ATTGCTAAGG CAGTTGCTGT GGCCAAGGAA
GCTGACAAGG TTATCTTGGT TGTTGGTATC TCAAAGGAAT GGGAATCTGA AGGTTTCGAC
AGACCTACTA TGGATATCCC AGGTGCTACC AATGAATTAG TGGATGCCAT TACTGCCGTC
AACAAGAATG TCATTGTTGT CAACCAGTCG GGCTCTCCTG TGACCCTTCC ATGGATCAAC
AAAGTACAGG GTTTTGTCCA AGCCTGGTAC GGTGGTAATG AATTGGGTAA CACCATTGCC
GATGTGTTGT TTGGTGACTA CAATCCCTCT GGTAAGTTGT CTATGACTTT CCCTAAGAGA
CTTCAAGACA ATCCTTCGTA CTTGAACTTT GCTTCAACAC ATGGGCAAGT ATTATACGGT
GAAGATATCT ATGTTGGCTA TAGATACTAC GAGAAGGTCG GTGTTGAACC ATTGTTCCCA
TTCGGCTACG GTTTGTCCTA CACTACCTTC GAGCTCAAAG ACTTAGTAGT AGAGTATGAC
CAAGAAATTA TCAACGCCAA GGTCAGTGTC GTTAATACTG GTAAGGTGGA TGGTGCTGAA
GTTGTTCAAT TGTACGTTTC TCAGGTCAAC CCAAGCATTA ACAGACCAGT GAAAGAATTG
AAGGACTTCG GAAAAGTCTT CGTCAAGGCT GGCGAAACCA AGACACTTGA GTTGAGTGTT
TCCGTCAAGG AAGCCACTTC ATTCTGGAAC GGGTACAAGA ACAAATGGCA ATCTGAAAAG
GGCAAGTATA AAATTTCTGT TGGTAACAGT TCTGACAATA TCACTCTCGA AGACGAGTTT
GAAACCTCCA AGACTTACTT CTGGTTAGGT TTATAG
 
Protein sequence
MTPSVKQPVP KELDIEYLIE QLTIEEKVSL LAGKDFWHTQ NIDRLNIPSV RVSDGPNGIR 
GTKFFNSVPS NCFPCGTGLA ATFNKEVLLQ AGELMGKEAK MKGAHVILGP TCNIVRSPLG
GRAFESYSED PVLSGHAAAN VVKGIQNQNV VACLKHFVAN DQEHERKAVD EIMTERALRE
IYLKPFHIAM RDAYPKALMT AYNKINGVHV SQNKKILQDL LRGEWGYTGT VMSDWHGVYS
TKESLDAGLN LEMPGPTRFR QQVPTLHAIQ TNEIHTDVID DNARAILRLV NESLKAGIPD
DVIESPNPTK EASDLLRKAG DESIVLLKNE NNILPLSKTA VKGQEKIAVI GPNAKAAQDS
GGGSASLNAA YKITPYEGIE SKIIEGGNSV SLDYSLGAFL DRNLPDVGNT LINEEGKKGI
TAKFYKQAPG AADREHFETF TLSTSKIFLS DYKSKHLKPG QLLFYADFHG IYIPDETGDY
EFGASCLGTA QLFVDDELVV DNKTKQVKGD AFFLGLGTRE ERGVKKLEKG KKYNIRVEFG
SSPTFTLNKA ALEGGGVFFG IRMISTAEAA IAKAVAVAKE ADKVILVVGI SKEWESEGFD
RPTMDIPGAT NELVDAITAV NKNVIVVNQS GSPVTLPWIN KVQGFVQAWY GGNELGNTIA
DVLFGDYNPS GKLSMTFPKR LQDNPSYLNF ASTHGQVLYG EDIYVGYRYY EKVGVEPLFP
FGYGLSYTTF ELKDLVVEYD QEIINAKVSV VNTGKVDGAE VVQLYVSQVN PSINRPVKEL
KDFGKVFVKA GETKTLELSV SVKEATSFWN GYKNKWQSEK GKYKISVGNS SDNITLEDEF
ETSKTYFWLG L