Gene PICST_34123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34123 
SymbolBGL5 
ID4850980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp623361 
End bp625892 
Gene Length2532 bp 
Protein Length843 aa 
Translation table 
GC content45% 
IMG OID640392688 
Productbeta-glucosidase 
Protein accessionXP_001387350 
Protein GI126273941 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0475476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTC AAGAATTAGA TGTCGAACGT TTGATCGAGG AATTGACTAT TCCTGAAAAG 
ATCTCCTTGT TGGCCGGTAA GGACTTCTGG CACACTGTTC CGATCGAGAG ATTGAACATT
CCTTCAATCA GAGTCTCGGA CGGTCCTAAC GGTATCAGAG GTACCAAGTT CTTCAACTCG
GTTCCTTCCA ACTGTTTCCC ATGTGGTACC GGTTTGGCTG CTACCTTCAA CAAGGACTTG
TGGGTTGAAG CTGGTGAGTT GATGGGTAAG GAAGCCAAGA TGAAGGGTGC CCATGTGATC
TTAGGTCCTA CCTCGAACAT CGTTCGTTCT CCATTGGGAG GAAGAGCCTT CGAATCGTAC
TCCGAGGACC CACTTTTGTC TGGACACGCT GCCGCAAACA TCATCAAGGG TATTCAAAAC
GAAAATGTCG TTGCATGTCT TAAGCACTTT GTCTGTAATG ATCAAGAAGA CGACAGAAGA
GGTGTCGACA CCTTGTTGAC CACCAGAGCT TTCAGAGAAA TCTACTTGAA GCCTTTCCAC
ATTGCTTTAA GAGACGCTGA CCCTGGTGCC TTGATGACTG CTTACAACAA GATCAACGGT
ATCCATGTTT CGGAATCCAA GGAAATCTTG CAGGGAATCT TGAGAGACGA GTATAAGTAC
GAAGGTGCTA CAATGTCCGA CTGGTTCGGA ATCTACTCTA CCAAGACTGC TTTGGAAGCC
GGTTTGAACT TGGAAATGCC TGGTCCAACC AGATTCAGAT TGCCAATCCA AACTCTCCAT
GAAGTTCAAG CTAACAGAAT CCACACCAAG ACCATTGACG ATAACGTTCG TTACGTCTTG
AAGTTGATCA ACAGAGCTTT GAAAGCCGAT ATTCCTCATG ATGTTGTTGA GTCTGCCAAC
GAGGACCCTG CTGCTTCTGA AATCTTGAGA AAGGTGGGTG ATGAGTCTAT CGTTTTGTTG
AAGAACGAAG GCAACATCTT GCCTTTGTCC AAGACTTCTG TTGCTGGTCA AGAAAAGATC
GCTGTCATTG GTCCAAACGC CAAGGCTGCT CAAGACTCTG GTGGTGGATC TGCTTCTCTT
ACTGCTCGTT ACAAGGTTAC CCCATGGGAA GGTATCAAGA AGAAGATCGA GGAAGGTGGA
AACACTGTTT CTTTGGAATA TTCCTTGGGT GCTTTCTTAG ATAAGAACTT GCCAGATGTT
GCAGACATCT TAGAAAACGA AAAAGGTGAA AAGGGTGTCA CTGCTAAGTT CTTCAAGAAT
GCTCCAGGCA CCAAGGACAG ACAACAGTTT GCTGAATACT TGCTTCCAAC CTCTAAACTC
TTCCTTTCTG ACTTCACTGA CCCAGGTTTG GAATTAGGCG AATTGTTGTT CTACGCTGAT
TTCGAAGGTT ACTTCACTCC AGAGGAAACT GCTGACTACG ACTTCGGAGC TTCTTGTTTG
GGTACTGCTC AAGTCTTTGT TGACGGTAAG TTGGTTGCTG ACAACAAGAC CAAGCAAACA
AAGGGTGATG CCTTCTTCTT AGGTTTAGGT ACCAGAGAAG AAAGAGGTAC TGTCCATTTG
GAAAAGGGTA AGAAGTACCA TGTTAAGTGT GAGTTTGGTA CCAGTCCCAC CTACACTTTG
GAAGCATCTC AAGAAATCGG TGGTGTCTTC TTCGGTTTCA GAATCAACTC TCCAGCTGAA
ATCGAAATCA CCAAGGCTGT TGAACTCGCC AAGTCTGTTG ACAAGGTCGT CTTGGTTGTT
GGTCTCTCCA AAGAATGGGA ATCTGAAGGT TTCGACAGAC CAGACATGGA CATTCCAGGT
GCCACTAACC AGTTGATTGA AGAAGTCTTG AAGGTCAACA AGAATGTCGT CGTCGTTAAC
CAATCTGGTT CTCCAGTGAC TATGCCATGG GTTGACCAAG TTCCAGCTTT GGTCCACGCT
TGGTATGGTG GTAACGAATT GGGTAACACC ATTGCTGATG TATTGTTTGG TGATGTCAAC
CCATCTGGTA AGTTGTCTAT GTCTTTCCCA AAGAAGCTTG AAGACAACCC ATCTTACCTT
AACTTCGGTT CCATCAACGG TCAGGTCTGG TATGGTGAAG ACATCTTTGT AGGATATAGA
TACTACGAGA AGGTCAAGAA GGATGTCTTA TTCCCATTCG GTTTCGGTTT ATCCTACACC
ACTTTCGACT TCAAGGACTT GTCTGTTGCA GCCGATGACG AAAACGTTAC TGTTAGCGTC
AAGGTCACCA ACACCGGTTC TGTAGATGGT TCTGAGACAG TTCAAGTCTA CATTGAACAA
TCCAACCCAA GCATCATTAG ACCCGTTAAG GAATTGAAGG ATTTCGGTAA GGTCTTCTTG
AAGGCTGGTG AAACAAAATC TGTTGAAGTC AAGATCTCCA TCAAGGAAGC TACCTCCTAC
TGGAATGGCT ACCAAGACAA GTGGCAGTCA GAAAAAGATA CCTACAAGGT ATTGGTTGGT
AACAGTTCGG ACAACATCAT TCTTGAGGGT AAATTTGCTA CCTCCAAGAC TTTCTACTGG
TTGGGATTAT AG
 
Protein sequence
MGVQELDVER LIEELTIPEK ISLLAGKDFW HTVPIERLNI PSIRVSDGPN GIRGTKFFNS 
VPSNCFPCGT GLAATFNKDL WVEAGELMGK EAKMKGAHVI LGPTSNIVRS PLGGRAFESY
SEDPLLSGHA AANIIKGIQN ENVVACLKHF VCNDQEDDRR GVDTLLTTRA FREIYLKPFH
IALRDADPGA LMTAYNKING IHVSESKEIL QGILRDEYKY EGATMSDWFG IYSTKTALEA
GLNLEMPGPT RFRLPIQTLH EVQANRIHTK TIDDNVRYVL KLINRALKAD IPHDVVESAN
EDPAASEILR KVGDESIVLL KNEGNILPLS KTSVAGQEKI AVIGPNAKAA QDSGGGSASL
TARYKVTPWE GIKKKIEEGG NTVSLEYSLG AFLDKNLPDV ADILENEKGE KGVTAKFFKN
APGTKDRQQF AEYLLPTSKL FLSDFTDPGL ELGELLFYAD FEGYFTPEET ADYDFGASCL
GTAQVFVDGK LVADNKTKQT KGDAFFLGLG TREERGTVHL EKGKKYHVKC EFGTSPTYTL
EASQEIGGVF FGFRINSPAE IEITKAVELA KSVDKVVLVV GLSKEWESEG FDRPDMDIPG
ATNQLIEEVL KVNKNVVVVN QSGSPVTMPW VDQVPALVHA WYGGNELGNT IADVLFGDVN
PSGKLSMSFP KKLEDNPSYL NFGSINGQVW YGEDIFVGYR YYEKVKKDVL FPFGFGLSYT
TFDFKDLSVA ADDENVTVSV KVTNTGSVDG SETVQVYIEQ SNPSIIRPVK ELKDFGKVFL
KAGETKSVEV KISIKEATSY WNGYQDKWQS EKDTYKVLVG NSSDNIILEG KFATSKTFYW
LGL