Gene PICST_77156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_77156 
SymbolBHA1 
ID4838110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1144330 
End bp1147773 
Gene Length3444 bp 
Protein Length1010 aa 
Translation table12 
GC content42% 
IMG OID640389425 
Productglycosyl hyrolase family 3-like protein 
Protein accessionXP_001383504 
Protein GI150864611 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.354705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCTTTAGGTT TGTTTTTTAT TCCAAGCGTC GCTCTTTGCA GGGCCCTCTC CATCGACCCG 
TAACTATTCA CGCTGCAAGA TGAGCCCGCT GAATCTTCCG CCGTTCCATG TGGGACAGCT
TCTCTGCGGA GGCTTTCAGG GAACCACGGT CACACCGCAA GCGTACCATT TGATCGTCGA
CCATCACGTC TCGCTGATGA TTTTGTCCCG CAAGAATGCC TTGCTGGCAC AACAGATGCT
GAAGTTGATC AGAGATTTGC AGTATATCGC TTTTTCGCAG GGCCATTACC AATATCCAAT
CATGTTTGCT ATAGACGAAG AAGGAGGCAT GATGAACTCG CTTTTTGATC CAGATTTCTT
GACCCAATGC CCAGGAGCCA TGGCTCTTGC TGCTACAGGA GATACAGAAC TTGTGTACGA
GCTTCTGAAG GCAATTGCTA TCGAGTTGAA GAACATTGGT TTCCTGATTA TATTGGGTCC
CGTGTTAGAC GTTGTCACCA AGCTCTCACA TCAGTTGGTA GGAGTCCGCA GCTTTGGAAC
TACCATCGAA GACGTGTCTA AATACAGTCA GGCCTGTGCT AAAGGATTGC AAGAGGGTGG
TTTGTTCACT GTAGGGAAGC ATTTCCCTGG AATCGGTAAC GCTACCGTAG ATAGTCTTCT
CGAATTGCCC ATGATTGTAG ATTCATTGGA CCAGATTAAG CACTTCAACA GCTTGCCATT
TGCCAAACTC ATCGAGCAGA ATCTTCTCGA CGGAATCAGT GCTGCAGGAT GTGGAGTTCC
GACAATCTCT CCAGACGAAA CCCACGCCTG CTTGTCGCCC GTAGTCATAA ACCAGTTGCT
TCGTCAAGAT TTAAAGTTCA AAGGTTTTGT AATCTCTGAG TGTTTGGAAA TGGATGCTTT
GTACCATCTG ATCGGTTTGG GCCAAGGTGT CATTCTTGCC ATCTCTGCTG GTTGTGATCT
AGTCATGGTG TGTCATGACA TGGCTCTTCA GAATGAAGCT GTGGAATGCC TTGAGAAAGC
CATAGCCAAT GGCAATCTCG ATGATGAAAT CATCCTTGCA AGCTTAAATA GAATAGAGCG
CTTGCAGAAG CGATTGCCTA AATGGTCACA ACTTTTCCCT AGAGGTGAAA TTTCAGCCAA
GGAAGATGAG ATCAAGTTGT TCAAATACGA GCATCCTGAG TTGTGGGAGA AACATCAGAA
ATTGGCCTCG CTAGCCTATC AGAAATCTAT CACTCTTGTT AGAGACTATA ATCATACTCT
ACCCATCTCA AAGTTCTTGT CTTCCAGTGA GGACGATAAG AAAATTGATC ACATTCTCAT
ATTGTCACCT TTGCTTAATC CTATTTATCC ATCCACTAAA CTGCACAGCA AAGACGACCA
AACTCAACAG CTCTACACCG GAGAAGAAGT ATTTCAGAAG TTTGGCGATT TGCTTGCTAA
TAATTCGTTG AGTAAAACCA AATCCTACAA CGTGTTACAC ACTACATACA CAGCTAACGG
ATTGACTCAG CTTCACGAAC TGCTCATTGA AAAATCGAAA GTCGTCATTG TCTTAACTTC
CGAAGCTTCC AGAAATATGT ACCAAATCGG AATAGTCAAA TATGTATCGA TTTTGTGCGG
AGCGAACCCT GCTTCTTTCA ACAACTCGGG TGCTACGTAC TTTCAATTGG CAAAGCCTCT
AATCATAGTA GCTACTCTGT CTCCATACGA CTTTTTCTAT AACAAGACGA TGGGCAGTGC
CTATTTATGC TGCTATGATT ACACGAACAG TGCTCTTGAA AAGCTTGCTG GGGTTCTCAT
GGGTGACTTT GAACCAGAAG GCTGTATTCC AGGCGAGAAG AAATTCATAG GGAAGTCTAA
GAAGAGGAAA TCAACAGGAT CAGTGGAAGG AGTGAGAATG GAAAAACCAC TTCTGATGAA
AAAGATCAAG AGCTCTACAC CCAAGAGAAG ATGGTTAGTC GACGAGTTTG ACTTGAACCG
TGACTGGACT GGGCTCCAAA AACTCTGGAA AAACAATACA GTAGAATCAG ATATGGCGAC
TGGAACGAAT CACAACAAAA TTGACTATTC AGTGCCCGAC TTCTACAAGA GACTCTACGG
ACTATTGGCG ACTACTGCCA AGTCTCAGAA ACATTTTGTG GTCAGAAACT CTTCTCTCAA
TATATTATAC GGTGTAGTTT TAACCTGGGT CGATGAAAAC TTACCTCTTG ATGGTGACTT
GACCTCAGAA GAACAGATTA GAGGCTCGAT ACTCTATATC TTGGTGGACA AGTCCAGAAG
ATTGCAGAGT ATTGGGAAAA ACCTCCATGC CAGAGCTATT CGGTATCTTT TGAAAGAGAG
GAAATGCTCT ACCATCACAC TTGGATCATC TTTTCCGTTG TTTGTGTTTC CCGAGAACTC
TAACATTTCT AACAATCGTA GCAACTCCAA GATATCTACG TTTATGCAGA GTATTGGCTG
GGATGTGAAC GTCACAAAGT CAGCGAAGAA GTATGTAATG CAACTAGGAG ACTTGGACAA
CTGGCTGGTC CCAAAGAAAA TATTCAGAGA GTTGATGATC GTTGGTGTCA GGTTCGATAT
ATGTAGCGAT CCTGAGAAGC TCATGAAGCT CATTGCTCGG TCAACAAAGG AAAATGAAAA
TTCCGACGAT AATAAAGGCA TCAAGGGGCT TTATTTGGAG GCTGTCAAAC ATTTGGGAAA
TACCTCTCCC TATGGTACCA AGATCATTAT TGCATTGGAG CCTACGAACC AAAACGTAAT
TGGGAGCATT GTTCTATTCA CAAACAAGTC GCAGTTGTCT AAATTCTTCC CATTCATTGA
CGAATTAAAG GCAGATGACG AAGGAGTAAT TGGAGGAATT ATTGGACCAA TTATAGATCC
ACTGTATTCA AACTTGACGG AAATCTTCAA ATATGGATTG ATCTGCAGTG GAATCACATT
CCTTAAATCC AATTTGAATG ACGGAGACAC CACAATGAAC CAATGCATGA TGCTAGATGT
TAATGATGAC AAATCGCTCA CAGGTATAAA GGAGATTGGA TTCTCCGAGT GGAAATATTA
TTACGATTAC TATGACAAGA AAAACAACGC CGAAAAGGCA TTTCTTGATT GAACCTTTAT
GGTTTACCTA ATCAACTTAA TGAAGATGAT GTATTTTATG GGAACTATAC TGGACACGAT
AATCGGCTGA ATTGCCCATT TAATTTCTAC AATAAGCTAG GCTCTATTTT TTACCAGTTG
TTCATTGGGC CGCTGTATTT TAGGTACTAC TGTATAATAA TGTAACATAA GGGCAATCTG
AATGGCCGTT CAGGCTTTAT ATATTCACTT TCAAAGCATG TTTTTTCCTG TTCGACTCTT
ATTTTCCCCG TGCTCAACAT GACATAAACA GCATTGACTC TCAATATTTG TATATTTAAT
TCATGAAGGC AGATATCTGT TGCC
 
Protein sequence
MSPSNLPPFH VGQLLCGGFQ GTTVTPQAYH LIVDHHVSSM ILSRKNALSA QQMSKLIRDL 
QYIAFSQGHY QYPIMFAIDE EGGMMNSLFD PDFLTQCPGA MALAATGDTE LVYELSKAIA
IELKNIGFSI ILGPVLDVVT KLSHQLVGVR SFGTTIEDVS KYSQACAKGL QEGGLFTVGK
HFPGIGNATV DSLLELPMIV DSLDQIKHFN SLPFAKLIEQ NLLDGISAAG CGVPTISPDE
THACLSPVVI NQLLRQDLKF KGFVISECLE MDALYHSIGL GQGVILAISA GCDLVMVCHD
MALQNEAVEC LEKAIANGNL DDEIILASLN RIERLQKRLP KWSQLFPRGE ISAKEDEIKL
FKYEHPELWE KHQKLASLAY QKSITLVRDY NHTLPISKFL SSSEDDKKID HILILSPLLN
PIYPSTKSHS KDDQTQQLYT GEEVFQKFGD LLANNSLSKT KSYNVLHTTY TANGLTQLHE
SLIEKSKVVI VLTSEASRNM YQIGIVKYVS ILCGANPASF NNSGATYFQL AKPLIIVATS
SPYDFFYNKT MGSAYLCCYD YTNSALEKLA GVLMGDFEPE GCIPGEKKFI GKSKKRKSTG
SVEGVRMEKP LSMKKIKSST PKRRWLVDEF DLNRDWTGLQ KLWKNNTVES DMATGTNHNK
IDYSVPDFYK RLYGLLATTA KSQKHFVVRN SSLNILYGVV LTWVDENLPL DGDLTSEEQI
RGSILYILVD KSRRLQSIGK NLHARAIRYL LKERKCSTIT LGSSFPLFVF PENSNISNNR
SNSKISTFMQ SIGWDVNVTK SAKKYVMQLG DLDNWSVPKK IFRELMIVGV RFDICSDPEK
LMKLIARSTK ENENSDDNKG IKGLYLEAVK HLGNTSPYGT KIIIALEPTN QNVIGSIVLF
TNKSQLSKFF PFIDELKADD EGVIGGIIGP IIDPSYSNLT EIFKYGLICS GITFLKSNLN
DGDTTMNQCM MLDVNDDKSL TGIKEIGFSE WKYYYDYYDK KNNAEKAFLD