Gene PICST_37797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37797 
SymbolBGL4 
ID4851550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2109602 
End bp2112046 
Gene Length2445 bp 
Protein Length814 aa 
Translation table 
GC content43% 
IMG OID640393258 
Productbeta-glucosidase 
Protein accessionXP_001387646 
Protein GI126274825 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.472931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTC CTGAAAAGGT CAATTTGACA ACTGGAACTG GCTGGGGTTC TGGTCCTTGT 
ATTGGTAACA CCGGCTCTGT TCCTCGATTG GGAATCCCCA ACTTATGTTT GCAGCACGGT
CCTAACGGTG TGAGATTTAC AGATTTTGTT ACCCATTTCC CGTCTGCCCT AGCTGCCGGT
GCCACTTTCA ACAAGGGGTT GATCTATCTT CGAGGCAAGG CCATTGGCCG AGAACATAAA
AAAAAGGGTG TACACATAGC ACTCGGACCT GTTGTCGGCC CCATTGGCCT CAAGGCTGCT
GGAGGCAGAA ATTGGGAGAG TTTTGGCGCG GATCCATACC TCCAGGGAGT TTGCGGGGCC
GCGACTGTAG AAGGAATTCA AGACGAAGGT GTGGTGGCAG TCGCGAGACA TCTAGTTGGC
AATGAGCAGG AACACTTCCG ACAGGTCGGT GAATGGGACG AAAATGGGTG GGAACATCTA
GAAACGTCCA TCAGTTCCAA TATAGGAGAC AGAGCCATGC ATGAGTTGTA TCTTTGGCCA
TTTGCCAATG CTGTTAGAGC CGGTGTAGGT GGTGTTATGT GTGCTTATAA CCAAGTCAAC
GGCACTTATA GCTGCGAAAA CTCCTACTTG CTTAATAACT TGTTAAAGGA AGAACTTGGA
TTCCAAGGCT TTGTTGTCTC CGATTGGGGA GCCCAACATA CTGGCGTATA TTCTTCACTT
GCTGGTCTTG ATATGACCAT GCCCGGTGAA GTCTTTGATG ACTGGCTAAC AGGAAAGTCT
AACTGGGGTC CATTGTTAAC GAGAGCTGTC TACAATGGTA CCCTTAGCCA GGAACGTCTA
AACGACATGG TTATGCGCAT CCTCGCACCA TTTTTTGCAG CTGATACCAT CACCCTTCCT
AGTGAAAATG ATGTTCCCAA CTTCAGTTCG TGGACATTTC ATACCTACGG ACAAGAATAC
ATGTATCAAC ACTATGGTCC CATTGTACAG CAGAATTGGC ATGTTGAAGC AAGATCAAAT
TTCAGCGACA ACACTGCCTT GAATACAGCA CGGGAAGCAA TTGTCTTGCT CAAGAATCCA
GGTCATAATC TACCGATTGC AAAAGTAGAC GGAGTCAGAC GCATATTCAT TGCAGGGATA
GGTGCTGGAG TTGACCCACG AGGGTTCAAC TGTAAGGACC AAAGGTGCGT GGACGGTGTT
TTGACTTCTG GTTGGGGTTC GTCTGCTCTC AACAATCCAT TTGTTATTAC ACCATATGAA
GCAATTGCAA AAAAGGCAAG GGATCAGGGT ATGTTGGTAG ATTTTTCAAA CGATGTGTGG
GAGTTAGATC ATGTCGAAGA ATTAGCAGAT TATTCTGATA TGTCCATAGT GGTCGTCGGT
GCTAGTTCAG GAGAAGGTTA TATTGAAGTT GATAACAATT TTGGAGATCG TAAGAACTTG
TCTCTCTGGC ATAACGGTGA TCAATTAATT GAATCTATCG CTGAAAAGTG CAAAAAAACG
GTCGTAGTAG TCAATTCTGT TGGACCAGTG AACTTGGAAA AATGGATTGA AAATGACAAT
GTTGTTGCCG TGATTTACGT TCCACCTTTA GGTCAATTTG TCGGACAGGC GATTGCAGAA
GTTTTATTTG GAGAAGTCAA CCCATCAGGA AAATTACCAT TTACAATTGC AAGAAAAAAG
CAACATTACG TTCCAATTAT TGACGAATTA GGAGACGACA GATCACCGCA AGACAACTTT
GATAGAGACA TTTACCTCGA TTATAGATTT TTTGATAAAC ATAATATCAA ACCAAGATAT
GAATTTGGCT ACGGTTTATC CTACAGCTCT TTCCTGGTCT GTGATCTAAA AATCAAAGAA
ATCAAAGCTC CCTTGGAATA CCTCCCATAT CCAGAAGAGT ACTTACCAAT TTACAAGACT
TGCGAGGATG ATATTTGTGA TCCAGAGGAT GCCTTATTCC CTCATGATGA GTTTGACCCT
GTTCCTGGTT ATATTTATCC ATATCTCTAT AATGAAAATG TCAGGACCTT AGAGGACGAC
AGCCATTTTG ATTATCCTCA TGGCTACCAT CCTGAACAGA ATTCAGTTCC TCCCTTATCA
GGAGGAGGAT TGGGTGGTAA TCCAGAGCTT TGGCAAACAT TGTATGAGGT CGATGCTGAA
GTGAAAAATG ATGGTAAATA CAGAGGAGCC TACGTCTTAC AGTTGTACTT AGAATTGCCA
AGCACAATTT TACCATCACC ACCTAGGATT TTAAGGGGGT TCGAGAAAGT GTTTCTAGAA
CCAGGTGAAA CTGCTCGAGT TTCATTCAAG CTTCTACATA GAGACCTCAG TGTTTGGGAT
ACATATTCAC AACAATGGAT TATCCAAACG GGAACATACA AGGTCTACCT TTCCTCTTCA
AGTAGGAAAG TTGAATTAAG TGGTGAGATT GACATCGGCT GTTAA
 
Protein sequence
MSIPEKVNLT TGTGWGSGPC IGNTGSVPRL GIPNLCLQHG PNGVRFTDFV THFPSALAAG 
ATFNKGLIYL RGKAIGREHK KKGVHIALGP VVGPIGLKAA GGRNWESFGA DPYLQGVCGA
ATVEGIQDEG VVAVARHLVG NEQEHFRQVG EWDENGWEHL ETSISSNIGD RAMHELYLWP
FANAVRAGVG GVMCAYNQVN GTYSCENSYL LNNLLKEELG FQGFVVSDWG AQHTGVYSSL
AGLDMTMPGE VFDDWLTGKS NWGPLLTRAV YNGTLSQERL NDMVMRILAP FFAADTITLP
SENDVPNFSS WTFHTYGQEY MYQHYGPIVQ QNWHVEARSN FSDNTALNTA REAIVLLKNP
GHNLPIAKVD GVRRIFIAGI GAGVDPRGFN CKDQRCVDGV LTSGWGSSAL NNPFVITPYE
AIAKKARDQG MLVDFSNDVW ELDHVEELAD YSDMSIVVVG ASSGEGYIEV DNNFGDRKNL
SLWHNGDQLI ESIAEKCKKT VVVVNSVGPV NLEKWIENDN VVAVIYVPPL GQFVGQAIAE
VLFGEVNPSG KLPFTIARKK QHYVPIIDEL GDDRSPQDNF DRDIYLDYRF FDKHNIKPRY
EFGYGLSYSS FLVCDLKIKE IKAPLEYLPY PEEYLPIYKT CEDDICDPED ALFPHDEFDP
VPGYIYPYLY NENVRTLEDD SHFDYPHGYH PEQNSVPPLS GGGLGGNPEL WQTLYEVDAE
VKNDGKYRGA YVLQLYLELP STILPSPPRI LRGFEKVFLE PGETARVSFK LLHRDLSVWD
TYSQQWIIQT GTYKVYLSSS SRKVELSGEI DIGC