Gene PICST_39160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39160 
SymbolEGC1 
ID4850994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp656267 
End bp657718 
Gene Length1452 bp 
Protein Length483 aa 
Translation table 
GC content44% 
IMG OID640392702 
ProductEndoglucanase C (EGC) (Endo-1,4-beta-glucanase) (Cellulase C) 
Protein accessionXP_001387765 
Protein GI126273955 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.151498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAG GATTCTTAAC CACCAAAGGC ACCAAGATCG TCGATGCCAA CGGCAAACAA 
GTTGTTCTTG TTGGTACTGC CATCGCCGGA CACTTAAACA TGGAGAATTT CATCACCGGT
TACCCTGGTC ACGAAACCGA ACATAAGAAT GTGTTGAAGA AGAAGATTGG AGAAGAAAAG
TTCAACTTCT TTTTTGACAA GTTCTACGAG TACTTCTGGA CCGAAAAGGA CGCAGACTTC
TACAAGAACG AATTGGGTTT CAACTGCTTA AGAATTCCTT TTAACTATCG TCACTTTATT
GACGAAGAGG TCGACTTGTT CAAAATTGAT CCAAAAGGTT TCGAAAGGTT GGACAGAGTT
ATCGACATTT GTTCCAAATA CGGTATTTAC ACTGTCTTGG ACTTGCATGC TACTCCAGGT
GGTCAGAACC AGGACTGGCA CGTTGACTCC GGTATCCACA AGTCCAGCTT TTTTGACTTC
AAGGTTTTCC AAGACTCAAT GGTGAACTTG TGGATTGAAC TCGCCAAGCA CTACAAGGAC
AACACATGGG TCGCTGGTTT CAACCCTTTG AACGAGCCTG CCGTTTCGCA ACATAAGAAG
TTGGTTAATT TCTACCAGAG ATTGCACGAC GAAATCAGAC CTATTGACCC TAACCATATC
TTCTTCCTTG ATGCCAACAC CTACTCCATG GACTTCAGAC AATTCCCAGC TCCAAAGGAT
TTCATCCCTA ATGCCGTCTA CTCCATCCAT GACTACTCTA CTTTCGGTTT CCCAAACATC
CAAGGTACCT TGTACACTGC CTCTGATGCT GAAAAGGAAA AGTTGAAGAG ACAATACGAC
CGTAAGGTTG AATACCATCA CGAACACAAT GTTCCCGTCT GGAACGGTGA ATTCGGTCCA
GTCTACGCTT CTAAGGAAAG AGGTGATGAA GACCCAGACA CCATCAACAG AGCTCGTTAC
CAAGTGTTGA AGGACCAATT GGCCATCTAC AAGAAGGGTG ACCCATCTGG TGACGGTACT
CCAATCTCCT GGTCCATTTG GTTGTACAAG GATATTGGTT ACCAAGGTTT GACTTACGTT
GACCCTGAAT CCAAGTGGTA CAAGGTCTTC GGTGAATTCT TATTGAAGAA GAAGAAGTTG
GGTCTCGACA GATGGGGTAA CGACATCGAC CCAGAATATA ACCAGTTGTA TGAAAATTTG
GCAAACCACA TCCTTGAAAA CGTTCCAGAG AAGTACCACC ATGCTCTCTA CCCTCACCAC
TGGACAGTAC TTGATTGGTT GTTCAGAGTT AGCAAAGACC AGTTGTTCTC TCAATACGCT
CAATACGAAT ACGCTGACTT GTTCGTTGGA CTTTCTTTTG AAGAATTGGA TGAACTCGCT
GCTTCTTTCA AGTTTGAAAA CATCAAATTG AGAGATGAAT TGAACGACAT CTTGAAGGAT
TACAAGAATT AA
 
Protein sequence
MSTGFLTTKG TKIVDANGKQ VVLVGTAIAG HLNMENFITG YPGHETEHKN VLKKKIGEEK 
FNFFFDKFYE YFWTEKDADF YKNELGFNCL RIPFNYRHFI DEEVDLFKID PKGFERLDRV
IDICSKYGIY TVLDLHATPG GQNQDWHVDS GIHKSSFFDF KVFQDSMVNL WIELAKHYKD
NTWVAGFNPL NEPAVSQHKK LVNFYQRLHD EIRPIDPNHI FFLDANTYSM DFRQFPAPKD
FIPNAVYSIH DYSTFGFPNI QGTLYTASDA EKEKLKRQYD RKVEYHHEHN VPVWNGEFGP
VYASKERGDE DPDTINRARY QVLKDQLAIY KKGDPSGDGT PISWSIWLYK DIGYQGLTYV
DPESKWYKVF GEFLLKKKKL GLDRWGNDID PEYNQLYENL ANHILENVPE KYHHALYPHH
WTVLDWLFRV SKDQLFSQYA QYEYADLFVG LSFEELDELA ASFKFENIKL RDELNDILKD
YKN