Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_37797 |
Symbol | BGL4 |
ID | 4851550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2109602 |
End bp | 2112046 |
Gene Length | 2445 bp |
Protein Length | 814 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393258 |
Product | beta-glucosidase |
Protein accession | XP_001387646 |
Protein GI | 126274825 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.472931 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATTC CTGAAAAGGT CAATTTGACA ACTGGAACTG GCTGGGGTTC TGGTCCTTGT ATTGGTAACA CCGGCTCTGT TCCTCGATTG GGAATCCCCA ACTTATGTTT GCAGCACGGT CCTAACGGTG TGAGATTTAC AGATTTTGTT ACCCATTTCC CGTCTGCCCT AGCTGCCGGT GCCACTTTCA ACAAGGGGTT GATCTATCTT CGAGGCAAGG CCATTGGCCG AGAACATAAA AAAAAGGGTG TACACATAGC ACTCGGACCT GTTGTCGGCC CCATTGGCCT CAAGGCTGCT GGAGGCAGAA ATTGGGAGAG TTTTGGCGCG GATCCATACC TCCAGGGAGT TTGCGGGGCC GCGACTGTAG AAGGAATTCA AGACGAAGGT GTGGTGGCAG TCGCGAGACA TCTAGTTGGC AATGAGCAGG AACACTTCCG ACAGGTCGGT GAATGGGACG AAAATGGGTG GGAACATCTA GAAACGTCCA TCAGTTCCAA TATAGGAGAC AGAGCCATGC ATGAGTTGTA TCTTTGGCCA TTTGCCAATG CTGTTAGAGC CGGTGTAGGT GGTGTTATGT GTGCTTATAA CCAAGTCAAC GGCACTTATA GCTGCGAAAA CTCCTACTTG CTTAATAACT TGTTAAAGGA AGAACTTGGA TTCCAAGGCT TTGTTGTCTC CGATTGGGGA GCCCAACATA CTGGCGTATA TTCTTCACTT GCTGGTCTTG ATATGACCAT GCCCGGTGAA GTCTTTGATG ACTGGCTAAC AGGAAAGTCT AACTGGGGTC CATTGTTAAC GAGAGCTGTC TACAATGGTA CCCTTAGCCA GGAACGTCTA AACGACATGG TTATGCGCAT CCTCGCACCA TTTTTTGCAG CTGATACCAT CACCCTTCCT AGTGAAAATG ATGTTCCCAA CTTCAGTTCG TGGACATTTC ATACCTACGG ACAAGAATAC ATGTATCAAC ACTATGGTCC CATTGTACAG CAGAATTGGC ATGTTGAAGC AAGATCAAAT TTCAGCGACA ACACTGCCTT GAATACAGCA CGGGAAGCAA TTGTCTTGCT CAAGAATCCA GGTCATAATC TACCGATTGC AAAAGTAGAC GGAGTCAGAC GCATATTCAT TGCAGGGATA GGTGCTGGAG TTGACCCACG AGGGTTCAAC TGTAAGGACC AAAGGTGCGT GGACGGTGTT TTGACTTCTG GTTGGGGTTC GTCTGCTCTC AACAATCCAT TTGTTATTAC ACCATATGAA GCAATTGCAA AAAAGGCAAG GGATCAGGGT ATGTTGGTAG ATTTTTCAAA CGATGTGTGG GAGTTAGATC ATGTCGAAGA ATTAGCAGAT TATTCTGATA TGTCCATAGT GGTCGTCGGT GCTAGTTCAG GAGAAGGTTA TATTGAAGTT GATAACAATT TTGGAGATCG TAAGAACTTG TCTCTCTGGC ATAACGGTGA TCAATTAATT GAATCTATCG CTGAAAAGTG CAAAAAAACG GTCGTAGTAG TCAATTCTGT TGGACCAGTG AACTTGGAAA AATGGATTGA AAATGACAAT GTTGTTGCCG TGATTTACGT TCCACCTTTA GGTCAATTTG TCGGACAGGC GATTGCAGAA GTTTTATTTG GAGAAGTCAA CCCATCAGGA AAATTACCAT TTACAATTGC AAGAAAAAAG CAACATTACG TTCCAATTAT TGACGAATTA GGAGACGACA GATCACCGCA AGACAACTTT GATAGAGACA TTTACCTCGA TTATAGATTT TTTGATAAAC ATAATATCAA ACCAAGATAT GAATTTGGCT ACGGTTTATC CTACAGCTCT TTCCTGGTCT GTGATCTAAA AATCAAAGAA ATCAAAGCTC CCTTGGAATA CCTCCCATAT CCAGAAGAGT ACTTACCAAT TTACAAGACT TGCGAGGATG ATATTTGTGA TCCAGAGGAT GCCTTATTCC CTCATGATGA GTTTGACCCT GTTCCTGGTT ATATTTATCC ATATCTCTAT AATGAAAATG TCAGGACCTT AGAGGACGAC AGCCATTTTG ATTATCCTCA TGGCTACCAT CCTGAACAGA ATTCAGTTCC TCCCTTATCA GGAGGAGGAT TGGGTGGTAA TCCAGAGCTT TGGCAAACAT TGTATGAGGT CGATGCTGAA GTGAAAAATG ATGGTAAATA CAGAGGAGCC TACGTCTTAC AGTTGTACTT AGAATTGCCA AGCACAATTT TACCATCACC ACCTAGGATT TTAAGGGGGT TCGAGAAAGT GTTTCTAGAA CCAGGTGAAA CTGCTCGAGT TTCATTCAAG CTTCTACATA GAGACCTCAG TGTTTGGGAT ACATATTCAC AACAATGGAT TATCCAAACG GGAACATACA AGGTCTACCT TTCCTCTTCA AGTAGGAAAG TTGAATTAAG TGGTGAGATT GACATCGGCT GTTAA
|
Protein sequence | MSIPEKVNLT TGTGWGSGPC IGNTGSVPRL GIPNLCLQHG PNGVRFTDFV THFPSALAAG ATFNKGLIYL RGKAIGREHK KKGVHIALGP VVGPIGLKAA GGRNWESFGA DPYLQGVCGA ATVEGIQDEG VVAVARHLVG NEQEHFRQVG EWDENGWEHL ETSISSNIGD RAMHELYLWP FANAVRAGVG GVMCAYNQVN GTYSCENSYL LNNLLKEELG FQGFVVSDWG AQHTGVYSSL AGLDMTMPGE VFDDWLTGKS NWGPLLTRAV YNGTLSQERL NDMVMRILAP FFAADTITLP SENDVPNFSS WTFHTYGQEY MYQHYGPIVQ QNWHVEARSN FSDNTALNTA REAIVLLKNP GHNLPIAKVD GVRRIFIAGI GAGVDPRGFN CKDQRCVDGV LTSGWGSSAL NNPFVITPYE AIAKKARDQG MLVDFSNDVW ELDHVEELAD YSDMSIVVVG ASSGEGYIEV DNNFGDRKNL SLWHNGDQLI ESIAEKCKKT VVVVNSVGPV NLEKWIENDN VVAVIYVPPL GQFVGQAIAE VLFGEVNPSG KLPFTIARKK QHYVPIIDEL GDDRSPQDNF DRDIYLDYRF FDKHNIKPRY EFGYGLSYSS FLVCDLKIKE IKAPLEYLPY PEEYLPIYKT CEDDICDPED ALFPHDEFDP VPGYIYPYLY NENVRTLEDD SHFDYPHGYH PEQNSVPPLS GGGLGGNPEL WQTLYEVDAE VKNDGKYRGA YVLQLYLELP STILPSPPRI LRGFEKVFLE PGETARVSFK LLHRDLSVWD TYSQQWIIQT GTYKVYLSSS SRKVELSGEI DIGC
|
| |