Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_39320 |
Symbol | EGC3 |
ID | 4851776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2798817 |
End bp | 2800262 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | |
GC content | 39% |
IMG OID | 640393484 |
Product | endoglucanase family 5 glycoside hydrolase |
Protein accession | XP_001387099 |
Protein GI | 126275568 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.280438 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCCG GTTTCTTGAC CACTGCAGGT ACGAAGATCG TTGATGCTGA AGGAACCCCG GTCGTCCTTA AAGGGGCAGC TTTGGGCGGG CACTTGAATA TGGAGAACTT TATTACTGGT TATCCCGGTC ATGAAACCGA ACATAAGTTG GTCTTGGAGA AAAAAATAGG TAAAGAGAAG TTCGACTATT TTTTCGAAAA GTTCTACGAA TATTTCTGGA CTGAGAAGGA TGCTGAATTC TACAGAAATA AATTGGGTTT TAACTGTTTG AGAATTCCTT TCAATTATCG ACACTTCATC GACGATAATG GTGATTTGTT CAAAATTAAG GGAAAGGGCT TTGAATTGTT GGATAGAATA GTAGATATCT GTTCCCAGTA CGGAATCTAT ACTATTTTGG ATTTACACAC AACTCCTGGT GGACAGAACC AAGGTTGGCA CTCTGATTCT GCTATTCACA AGTCTCTCTT TTGGGATTTC AAGGTTTTCC AAGATTCAAT TGTTAACCTT TGGGTTGAGT TGGCCAAGCA TTACAAAGAC AATGTCTGGG TTGCTGGTTA CAATCCATTG AACGAGCCTG CCGTTTCAGA CTCTGAAAAG TTGGTCGACT TCTACAAAAG ATTGCACGAC GAAGTTAGAC CCATTGATCC CAACCACATT TTCTTCCTTG ATGGAAACAC ATATGCAATG GACTTCAGGA AATTCCCTTC GCCAGAATCC TATATTCCTA ATACAGTATA TTCAATTCAT GATTACTCTA CCTATGGTTT CCCAAATCTT GAAGGTGCAT TATACACTGG TTCAGAAGAG GAAAAGTCAA AATTAAAATC TCAATATAAC AGAAAGATCG AGTACCAAAG TGAATACAAA GTTCCTGTTT GGAATGGTGA GTTTGGACCC GTTTATGCTT CAAAGGAAAG AGGTGACAAA AATCCGGAAG TAATCAACCG GGCACGGTTC AATGTCTTGA AAGACCAATT AGAAGTCTAC AGAAAGGGAG ATCCATCAGG TGACGGCTCC CCTATTTCGT GGTCAATTTG GTTGTACAAA GATATTGGTT TCCAAGGTTT GACTTACGTC TCTCCCAAGT CAAAATGGTA TGAGGTATTT GGAGAATGGC TACTTAAGAA GAAGAAGTTG GGTTTAGATA AATGGGGCAA TGACATTGAC CCGGGTTATA ATCAATTGTA CCAAAACTTG GTAGACCATA TGGAAGCCAA TGTCCCAGAA AAGTATCATA AAGTTCTATA CCCTCATACA TGGACAATGG AGAAATATTT GGCCCGTGTT TCTAGAGATA TGCTCTTTTC ACAATACGCT CAACATGAAT ATGCTGATTT GTTCGTTGGA TTTTCTTTAG AAGAACTTGA CGAATTAGCT GCTTCTTTCA AATTTGAGAA TCTAGATCAA AGAGAGGAAT TGAATCAGAT ATTGAAAGAA TACTAG
|
Protein sequence | MSAGFLTTAG TKIVDAEGTP VVLKGAALGG HLNMENFITG YPGHETEHKL VLEKKIGKEK FDYFFEKFYE YFWTEKDAEF YRNKLGFNCL RIPFNYRHFI DDNGDLFKIK GKGFELLDRI VDICSQYGIY TILDLHTTPG GQNQGWHSDS AIHKSLFWDF KVFQDSIVNL WVELAKHYKD NVWVAGYNPL NEPAVSDSEK LVDFYKRLHD EVRPIDPNHI FFLDGNTYAM DFRKFPSPES YIPNTVYSIH DYSTYGFPNL EGALYTGSEE EKSKLKSQYN RKIEYQSEYK VPVWNGEFGP VYASKERGDK NPEVINRARF NVLKDQLEVY RKGDPSGDGS PISWSIWLYK DIGFQGLTYV SPKSKWYEVF GEWLLKKKKL GLDKWGNDID PGYNQLYQNL VDHMEANVPE KYHKVLYPHT WTMEKYLARV SRDMLFSQYA QHEYADLFVG FSLEELDELA ASFKFENLDQ REELNQILKE Y
|
| |