Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_39160 |
Symbol | EGC1 |
ID | 4850994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 656267 |
End bp | 657718 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | |
GC content | 44% |
IMG OID | 640392702 |
Product | Endoglucanase C (EGC) (Endo-1,4-beta-glucanase) (Cellulase C) |
Protein accession | XP_001387765 |
Protein GI | 126273955 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.151498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAG GATTCTTAAC CACCAAAGGC ACCAAGATCG TCGATGCCAA CGGCAAACAA GTTGTTCTTG TTGGTACTGC CATCGCCGGA CACTTAAACA TGGAGAATTT CATCACCGGT TACCCTGGTC ACGAAACCGA ACATAAGAAT GTGTTGAAGA AGAAGATTGG AGAAGAAAAG TTCAACTTCT TTTTTGACAA GTTCTACGAG TACTTCTGGA CCGAAAAGGA CGCAGACTTC TACAAGAACG AATTGGGTTT CAACTGCTTA AGAATTCCTT TTAACTATCG TCACTTTATT GACGAAGAGG TCGACTTGTT CAAAATTGAT CCAAAAGGTT TCGAAAGGTT GGACAGAGTT ATCGACATTT GTTCCAAATA CGGTATTTAC ACTGTCTTGG ACTTGCATGC TACTCCAGGT GGTCAGAACC AGGACTGGCA CGTTGACTCC GGTATCCACA AGTCCAGCTT TTTTGACTTC AAGGTTTTCC AAGACTCAAT GGTGAACTTG TGGATTGAAC TCGCCAAGCA CTACAAGGAC AACACATGGG TCGCTGGTTT CAACCCTTTG AACGAGCCTG CCGTTTCGCA ACATAAGAAG TTGGTTAATT TCTACCAGAG ATTGCACGAC GAAATCAGAC CTATTGACCC TAACCATATC TTCTTCCTTG ATGCCAACAC CTACTCCATG GACTTCAGAC AATTCCCAGC TCCAAAGGAT TTCATCCCTA ATGCCGTCTA CTCCATCCAT GACTACTCTA CTTTCGGTTT CCCAAACATC CAAGGTACCT TGTACACTGC CTCTGATGCT GAAAAGGAAA AGTTGAAGAG ACAATACGAC CGTAAGGTTG AATACCATCA CGAACACAAT GTTCCCGTCT GGAACGGTGA ATTCGGTCCA GTCTACGCTT CTAAGGAAAG AGGTGATGAA GACCCAGACA CCATCAACAG AGCTCGTTAC CAAGTGTTGA AGGACCAATT GGCCATCTAC AAGAAGGGTG ACCCATCTGG TGACGGTACT CCAATCTCCT GGTCCATTTG GTTGTACAAG GATATTGGTT ACCAAGGTTT GACTTACGTT GACCCTGAAT CCAAGTGGTA CAAGGTCTTC GGTGAATTCT TATTGAAGAA GAAGAAGTTG GGTCTCGACA GATGGGGTAA CGACATCGAC CCAGAATATA ACCAGTTGTA TGAAAATTTG GCAAACCACA TCCTTGAAAA CGTTCCAGAG AAGTACCACC ATGCTCTCTA CCCTCACCAC TGGACAGTAC TTGATTGGTT GTTCAGAGTT AGCAAAGACC AGTTGTTCTC TCAATACGCT CAATACGAAT ACGCTGACTT GTTCGTTGGA CTTTCTTTTG AAGAATTGGA TGAACTCGCT GCTTCTTTCA AGTTTGAAAA CATCAAATTG AGAGATGAAT TGAACGACAT CTTGAAGGAT TACAAGAATT AA
|
Protein sequence | MSTGFLTTKG TKIVDANGKQ VVLVGTAIAG HLNMENFITG YPGHETEHKN VLKKKIGEEK FNFFFDKFYE YFWTEKDADF YKNELGFNCL RIPFNYRHFI DEEVDLFKID PKGFERLDRV IDICSKYGIY TVLDLHATPG GQNQDWHVDS GIHKSSFFDF KVFQDSMVNL WIELAKHYKD NTWVAGFNPL NEPAVSQHKK LVNFYQRLHD EIRPIDPNHI FFLDANTYSM DFRQFPAPKD FIPNAVYSIH DYSTFGFPNI QGTLYTASDA EKEKLKRQYD RKVEYHHEHN VPVWNGEFGP VYASKERGDE DPDTINRARY QVLKDQLAIY KKGDPSGDGT PISWSIWLYK DIGYQGLTYV DPESKWYKVF GEFLLKKKKL GLDRWGNDID PEYNQLYENL ANHILENVPE KYHHALYPHH WTVLDWLFRV SKDQLFSQYA QYEYADLFVG LSFEELDELA ASFKFENIKL RDELNDILKD YKN
|
| |