Gene PICST_39320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39320 
SymbolEGC3 
ID4851776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2798817 
End bp2800262 
Gene Length1446 bp 
Protein Length481 aa 
Translation table 
GC content39% 
IMG OID640393484 
Productendoglucanase family 5 glycoside hydrolase 
Protein accessionXP_001387099 
Protein GI126275568 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.280438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCCG GTTTCTTGAC CACTGCAGGT ACGAAGATCG TTGATGCTGA AGGAACCCCG 
GTCGTCCTTA AAGGGGCAGC TTTGGGCGGG CACTTGAATA TGGAGAACTT TATTACTGGT
TATCCCGGTC ATGAAACCGA ACATAAGTTG GTCTTGGAGA AAAAAATAGG TAAAGAGAAG
TTCGACTATT TTTTCGAAAA GTTCTACGAA TATTTCTGGA CTGAGAAGGA TGCTGAATTC
TACAGAAATA AATTGGGTTT TAACTGTTTG AGAATTCCTT TCAATTATCG ACACTTCATC
GACGATAATG GTGATTTGTT CAAAATTAAG GGAAAGGGCT TTGAATTGTT GGATAGAATA
GTAGATATCT GTTCCCAGTA CGGAATCTAT ACTATTTTGG ATTTACACAC AACTCCTGGT
GGACAGAACC AAGGTTGGCA CTCTGATTCT GCTATTCACA AGTCTCTCTT TTGGGATTTC
AAGGTTTTCC AAGATTCAAT TGTTAACCTT TGGGTTGAGT TGGCCAAGCA TTACAAAGAC
AATGTCTGGG TTGCTGGTTA CAATCCATTG AACGAGCCTG CCGTTTCAGA CTCTGAAAAG
TTGGTCGACT TCTACAAAAG ATTGCACGAC GAAGTTAGAC CCATTGATCC CAACCACATT
TTCTTCCTTG ATGGAAACAC ATATGCAATG GACTTCAGGA AATTCCCTTC GCCAGAATCC
TATATTCCTA ATACAGTATA TTCAATTCAT GATTACTCTA CCTATGGTTT CCCAAATCTT
GAAGGTGCAT TATACACTGG TTCAGAAGAG GAAAAGTCAA AATTAAAATC TCAATATAAC
AGAAAGATCG AGTACCAAAG TGAATACAAA GTTCCTGTTT GGAATGGTGA GTTTGGACCC
GTTTATGCTT CAAAGGAAAG AGGTGACAAA AATCCGGAAG TAATCAACCG GGCACGGTTC
AATGTCTTGA AAGACCAATT AGAAGTCTAC AGAAAGGGAG ATCCATCAGG TGACGGCTCC
CCTATTTCGT GGTCAATTTG GTTGTACAAA GATATTGGTT TCCAAGGTTT GACTTACGTC
TCTCCCAAGT CAAAATGGTA TGAGGTATTT GGAGAATGGC TACTTAAGAA GAAGAAGTTG
GGTTTAGATA AATGGGGCAA TGACATTGAC CCGGGTTATA ATCAATTGTA CCAAAACTTG
GTAGACCATA TGGAAGCCAA TGTCCCAGAA AAGTATCATA AAGTTCTATA CCCTCATACA
TGGACAATGG AGAAATATTT GGCCCGTGTT TCTAGAGATA TGCTCTTTTC ACAATACGCT
CAACATGAAT ATGCTGATTT GTTCGTTGGA TTTTCTTTAG AAGAACTTGA CGAATTAGCT
GCTTCTTTCA AATTTGAGAA TCTAGATCAA AGAGAGGAAT TGAATCAGAT ATTGAAAGAA
TACTAG
 
Protein sequence
MSAGFLTTAG TKIVDAEGTP VVLKGAALGG HLNMENFITG YPGHETEHKL VLEKKIGKEK 
FDYFFEKFYE YFWTEKDAEF YRNKLGFNCL RIPFNYRHFI DDNGDLFKIK GKGFELLDRI
VDICSQYGIY TILDLHTTPG GQNQGWHSDS AIHKSLFWDF KVFQDSIVNL WVELAKHYKD
NVWVAGYNPL NEPAVSDSEK LVDFYKRLHD EVRPIDPNHI FFLDGNTYAM DFRKFPSPES
YIPNTVYSIH DYSTYGFPNL EGALYTGSEE EKSKLKSQYN RKIEYQSEYK VPVWNGEFGP
VYASKERGDK NPEVINRARF NVLKDQLEVY RKGDPSGDGS PISWSIWLYK DIGFQGLTYV
SPKSKWYEVF GEWLLKKKKL GLDKWGNDID PGYNQLYQNL VDHMEANVPE KYHKVLYPHT
WTMEKYLARV SRDMLFSQYA QHEYADLFVG FSLEELDELA ASFKFENLDQ REELNQILKE
Y