Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77156 |
Symbol | BHA1 |
ID | 4838110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1144330 |
End bp | 1147773 |
Gene Length | 3444 bp |
Protein Length | 1010 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389425 |
Product | glycosyl hyrolase family 3-like protein |
Protein accession | XP_001383504 |
Protein GI | 150864611 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.354705 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCTTTAGGTT TGTTTTTTAT TCCAAGCGTC GCTCTTTGCA GGGCCCTCTC CATCGACCCG TAACTATTCA CGCTGCAAGA TGAGCCCGCT GAATCTTCCG CCGTTCCATG TGGGACAGCT TCTCTGCGGA GGCTTTCAGG GAACCACGGT CACACCGCAA GCGTACCATT TGATCGTCGA CCATCACGTC TCGCTGATGA TTTTGTCCCG CAAGAATGCC TTGCTGGCAC AACAGATGCT GAAGTTGATC AGAGATTTGC AGTATATCGC TTTTTCGCAG GGCCATTACC AATATCCAAT CATGTTTGCT ATAGACGAAG AAGGAGGCAT GATGAACTCG CTTTTTGATC CAGATTTCTT GACCCAATGC CCAGGAGCCA TGGCTCTTGC TGCTACAGGA GATACAGAAC TTGTGTACGA GCTTCTGAAG GCAATTGCTA TCGAGTTGAA GAACATTGGT TTCCTGATTA TATTGGGTCC CGTGTTAGAC GTTGTCACCA AGCTCTCACA TCAGTTGGTA GGAGTCCGCA GCTTTGGAAC TACCATCGAA GACGTGTCTA AATACAGTCA GGCCTGTGCT AAAGGATTGC AAGAGGGTGG TTTGTTCACT GTAGGGAAGC ATTTCCCTGG AATCGGTAAC GCTACCGTAG ATAGTCTTCT CGAATTGCCC ATGATTGTAG ATTCATTGGA CCAGATTAAG CACTTCAACA GCTTGCCATT TGCCAAACTC ATCGAGCAGA ATCTTCTCGA CGGAATCAGT GCTGCAGGAT GTGGAGTTCC GACAATCTCT CCAGACGAAA CCCACGCCTG CTTGTCGCCC GTAGTCATAA ACCAGTTGCT TCGTCAAGAT TTAAAGTTCA AAGGTTTTGT AATCTCTGAG TGTTTGGAAA TGGATGCTTT GTACCATCTG ATCGGTTTGG GCCAAGGTGT CATTCTTGCC ATCTCTGCTG GTTGTGATCT AGTCATGGTG TGTCATGACA TGGCTCTTCA GAATGAAGCT GTGGAATGCC TTGAGAAAGC CATAGCCAAT GGCAATCTCG ATGATGAAAT CATCCTTGCA AGCTTAAATA GAATAGAGCG CTTGCAGAAG CGATTGCCTA AATGGTCACA ACTTTTCCCT AGAGGTGAAA TTTCAGCCAA GGAAGATGAG ATCAAGTTGT TCAAATACGA GCATCCTGAG TTGTGGGAGA AACATCAGAA ATTGGCCTCG CTAGCCTATC AGAAATCTAT CACTCTTGTT AGAGACTATA ATCATACTCT ACCCATCTCA AAGTTCTTGT CTTCCAGTGA GGACGATAAG AAAATTGATC ACATTCTCAT ATTGTCACCT TTGCTTAATC CTATTTATCC ATCCACTAAA CTGCACAGCA AAGACGACCA AACTCAACAG CTCTACACCG GAGAAGAAGT ATTTCAGAAG TTTGGCGATT TGCTTGCTAA TAATTCGTTG AGTAAAACCA AATCCTACAA CGTGTTACAC ACTACATACA CAGCTAACGG ATTGACTCAG CTTCACGAAC TGCTCATTGA AAAATCGAAA GTCGTCATTG TCTTAACTTC CGAAGCTTCC AGAAATATGT ACCAAATCGG AATAGTCAAA TATGTATCGA TTTTGTGCGG AGCGAACCCT GCTTCTTTCA ACAACTCGGG TGCTACGTAC TTTCAATTGG CAAAGCCTCT AATCATAGTA GCTACTCTGT CTCCATACGA CTTTTTCTAT AACAAGACGA TGGGCAGTGC CTATTTATGC TGCTATGATT ACACGAACAG TGCTCTTGAA AAGCTTGCTG GGGTTCTCAT GGGTGACTTT GAACCAGAAG GCTGTATTCC AGGCGAGAAG AAATTCATAG GGAAGTCTAA GAAGAGGAAA TCAACAGGAT CAGTGGAAGG AGTGAGAATG GAAAAACCAC TTCTGATGAA AAAGATCAAG AGCTCTACAC CCAAGAGAAG ATGGTTAGTC GACGAGTTTG ACTTGAACCG TGACTGGACT GGGCTCCAAA AACTCTGGAA AAACAATACA GTAGAATCAG ATATGGCGAC TGGAACGAAT CACAACAAAA TTGACTATTC AGTGCCCGAC TTCTACAAGA GACTCTACGG ACTATTGGCG ACTACTGCCA AGTCTCAGAA ACATTTTGTG GTCAGAAACT CTTCTCTCAA TATATTATAC GGTGTAGTTT TAACCTGGGT CGATGAAAAC TTACCTCTTG ATGGTGACTT GACCTCAGAA GAACAGATTA GAGGCTCGAT ACTCTATATC TTGGTGGACA AGTCCAGAAG ATTGCAGAGT ATTGGGAAAA ACCTCCATGC CAGAGCTATT CGGTATCTTT TGAAAGAGAG GAAATGCTCT ACCATCACAC TTGGATCATC TTTTCCGTTG TTTGTGTTTC CCGAGAACTC TAACATTTCT AACAATCGTA GCAACTCCAA GATATCTACG TTTATGCAGA GTATTGGCTG GGATGTGAAC GTCACAAAGT CAGCGAAGAA GTATGTAATG CAACTAGGAG ACTTGGACAA CTGGCTGGTC CCAAAGAAAA TATTCAGAGA GTTGATGATC GTTGGTGTCA GGTTCGATAT ATGTAGCGAT CCTGAGAAGC TCATGAAGCT CATTGCTCGG TCAACAAAGG AAAATGAAAA TTCCGACGAT AATAAAGGCA TCAAGGGGCT TTATTTGGAG GCTGTCAAAC ATTTGGGAAA TACCTCTCCC TATGGTACCA AGATCATTAT TGCATTGGAG CCTACGAACC AAAACGTAAT TGGGAGCATT GTTCTATTCA CAAACAAGTC GCAGTTGTCT AAATTCTTCC CATTCATTGA CGAATTAAAG GCAGATGACG AAGGAGTAAT TGGAGGAATT ATTGGACCAA TTATAGATCC ACTGTATTCA AACTTGACGG AAATCTTCAA ATATGGATTG ATCTGCAGTG GAATCACATT CCTTAAATCC AATTTGAATG ACGGAGACAC CACAATGAAC CAATGCATGA TGCTAGATGT TAATGATGAC AAATCGCTCA CAGGTATAAA GGAGATTGGA TTCTCCGAGT GGAAATATTA TTACGATTAC TATGACAAGA AAAACAACGC CGAAAAGGCA TTTCTTGATT GAACCTTTAT GGTTTACCTA ATCAACTTAA TGAAGATGAT GTATTTTATG GGAACTATAC TGGACACGAT AATCGGCTGA ATTGCCCATT TAATTTCTAC AATAAGCTAG GCTCTATTTT TTACCAGTTG TTCATTGGGC CGCTGTATTT TAGGTACTAC TGTATAATAA TGTAACATAA GGGCAATCTG AATGGCCGTT CAGGCTTTAT ATATTCACTT TCAAAGCATG TTTTTTCCTG TTCGACTCTT ATTTTCCCCG TGCTCAACAT GACATAAACA GCATTGACTC TCAATATTTG TATATTTAAT TCATGAAGGC AGATATCTGT TGCC
|
Protein sequence | MSPSNLPPFH VGQLLCGGFQ GTTVTPQAYH LIVDHHVSSM ILSRKNALSA QQMSKLIRDL QYIAFSQGHY QYPIMFAIDE EGGMMNSLFD PDFLTQCPGA MALAATGDTE LVYELSKAIA IELKNIGFSI ILGPVLDVVT KLSHQLVGVR SFGTTIEDVS KYSQACAKGL QEGGLFTVGK HFPGIGNATV DSLLELPMIV DSLDQIKHFN SLPFAKLIEQ NLLDGISAAG CGVPTISPDE THACLSPVVI NQLLRQDLKF KGFVISECLE MDALYHSIGL GQGVILAISA GCDLVMVCHD MALQNEAVEC LEKAIANGNL DDEIILASLN RIERLQKRLP KWSQLFPRGE ISAKEDEIKL FKYEHPELWE KHQKLASLAY QKSITLVRDY NHTLPISKFL SSSEDDKKID HILILSPLLN PIYPSTKSHS KDDQTQQLYT GEEVFQKFGD LLANNSLSKT KSYNVLHTTY TANGLTQLHE SLIEKSKVVI VLTSEASRNM YQIGIVKYVS ILCGANPASF NNSGATYFQL AKPLIIVATS SPYDFFYNKT MGSAYLCCYD YTNSALEKLA GVLMGDFEPE GCIPGEKKFI GKSKKRKSTG SVEGVRMEKP LSMKKIKSST PKRRWLVDEF DLNRDWTGLQ KLWKNNTVES DMATGTNHNK IDYSVPDFYK RLYGLLATTA KSQKHFVVRN SSLNILYGVV LTWVDENLPL DGDLTSEEQI RGSILYILVD KSRRLQSIGK NLHARAIRYL LKERKCSTIT LGSSFPLFVF PENSNISNNR SNSKISTFMQ SIGWDVNVTK SAKKYVMQLG DLDNWSVPKK IFRELMIVGV RFDICSDPEK LMKLIARSTK ENENSDDNKG IKGLYLEAVK HLGNTSPYGT KIIIALEPTN QNVIGSIVLF TNKSQLSKFF PFIDELKADD EGVIGGIIGP IIDPSYSNLT EIFKYGLICS GITFLKSNLN DGDTTMNQCM MLDVNDDKSL TGIKEIGFSE WKYYYDYYDK KNNAEKAFLD
|
| |