Gene PICST_61452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61452 
SymbolEXG3 
ID4839893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp77768 
End bp79288 
Gene Length1521 bp 
Protein Length506 aa 
Translation table12 
GC content44% 
IMG OID640391208 
Productglucan 1,3-beta-glucosidase 
Protein accessionXP_001385705 
Protein GI150866196 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.243086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATA AGGTCAAACT CCGCTCGAAG GGCAGTAACC CAGTAGCTGC CGTTCCTGCG 
GCTACTCCGG GTCAACCTCC AAGTACAAGA CAGATCTACC AATCGAGAAA GAATTTCGGA
GTCAACATTG GAGCTTGTTT TGTATCGGAA AAATGGATAT TCCATGAGTT GTTCGGTGAA
AACGTTCCTG AATTCGAGCT CGCAGCAGTT GAAGCTATGG TGAGAGCGAA GGGTTTAGAT
GGTGCCAAAA GTACATTTGA GAATTTCTGG TCTAATTTTA TGAATGACAA TGACTGGAGA
TGGTTGCAGG ACAACCAGGT CACTTCTGTA AGAATTCCCA TAGGATATTG GGATGTTGCC
GGTGGAAGGT TTACCAAAGG CACCCAATTT GAGAAGTATG GGTCTTCTGT CTATTCTGGA
GCTTGGAATA TATTTAAGGA AAAGTTCGTC AAGCCAGCAG GAAAACATAA TATCTCTGTA
TTGGTGGATC TTCATGGGTT ACCCGGTGGT GCTAACTCTA GCGATCACAG TGGTGAGAAG
TCTGGTGGTC TGGCAGCTTT TTGGTCAAAC GAGAAATTTC AGTTGCAGGT TGCTGAAATG
CTCACCTTTA TTGCCAGGGA TTTACAGCAG TTTGAGAACA TTTCTGGTAT TCAAGTAGTC
AACGAAGCAG AATTCGCGCA AGAGCCAGCT TCAAAGCAAA CTACTTACTA TGTAGCTGCT
CTCAATCTGA TCCGAGAAGC GGATTCAGGT ATCCCAGTGA TTATTTCTGA CGGCTGGTGG
ACAGACCAGT GGGTGAGATT CATTCAGAAA CACCAACAGA ACAACAATAG TCTAGGTTTG
ATAATCGATC ACCACGTATA CCGTTGTTTT TCTAAGGAAG ACAAGGATAA GTCTCCGATG
AGGATCATTG AAGATTTGAA CAATGATGTA TTAACTAATT TGACTGATAA TGGTAAGGGA
GTTGACATTA TGGTCGGTGA ATTCTCTTGT GTACTTGACC AACAGTCGTG GAATAAAGAT
GGTGCACAAG GCAGAAGAGA TGAGTTGGTG ATCCAGTACG GTAATAGACA ATGTGACTTA
ATTAATGAAA GAGCAGGTAT GGGCTTTTAC TTTTGGACTT ACAAGTTCCA GTCGGGAAAC
GGAGGTGAAT GGGACTTAAA GCAAATGGTG GAAAAAGGGG CTATAAGGAA TCCATTTTCC
GTCAATGGTA AGAGATTGCC TGACAGATCA ATGTTCGAAC AGGCTTACAA CCAAGCAATG
CAAGGTCATG TTGGATACTG GAGTGGAACC GATCCTGGTG GAAGATATGA ACATGAGCGA
TATGGTGAAG GGTTCACTAC TGCCTGGGCA GATGCCGAGG AATTCGCGAA GTTCAACGGG
TCTGTCTTGG GCCGGGTTGA AGCATGGAGA ATTGCACGGT TGTCGGAACA TATCAGAGCT
CGAGGTGCTC TGGGCTACTT GTGGGAATGG GAACAGGGTT TCTATGAAGG ATTGAAGCAG
TTTCATTCTA ATGTGAGATG A
 
Protein sequence
MFDKVKLRSK GSNPVAAVPA ATPGQPPSTR QIYQSRKNFG VNIGACFVSE KWIFHELFGE 
NVPEFELAAV EAMVRAKGLD GAKSTFENFW SNFMNDNDWR WLQDNQVTSV RIPIGYWDVA
GGRFTKGTQF EKYGSSVYSG AWNIFKEKFV KPAGKHNISV LVDLHGLPGG ANSSDHSGEK
SGGSAAFWSN EKFQLQVAEM LTFIARDLQQ FENISGIQVV NEAEFAQEPA SKQTTYYVAA
LNSIREADSG IPVIISDGWW TDQWVRFIQK HQQNNNSLGL IIDHHVYRCF SKEDKDKSPM
RIIEDLNNDV LTNLTDNGKG VDIMVGEFSC VLDQQSWNKD GAQGRRDELV IQYGNRQCDL
INERAGMGFY FWTYKFQSGN GGEWDLKQMV EKGAIRNPFS VNGKRLPDRS MFEQAYNQAM
QGHVGYWSGT DPGGRYEHER YGEGFTTAWA DAEEFAKFNG SVLGRVEAWR IARLSEHIRA
RGASGYLWEW EQGFYEGLKQ FHSNVR