Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78873 |
Symbol | EXG2 |
ID | 4839981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 383879 |
End bp | 386317 |
Gene Length | 2439 bp |
Protein Length | 438 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391296 |
Product | Glucan 1,3-beta-glucosidase precursor (Exo-1,3-beta-glucanase) |
Protein accession | XP_001385760 |
Protein GI | 150866234 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.161777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCACCATGAC TCCGTTATTG TAAGGACAAG AAGCGACACT GGCACATCTC TAGCCCCTTC CAGACTTGTA CATGAACAAG CATAGAGATA CAATTGGAAA TGATCCTTTG TCTCTAGTGG CTGGTTGCCA GTTCGCAGCA CACGAAAAAT AAAAAGCCAA CTTATCAAGC CTAAAATTTC TTTGTGTTGA ATAAATCCAA CGCACACTCG CACCTTTTTT TTTTACCGTG TGGAACATTC TCGGATTCCT ACCCAAGCAA CACCGATTGA AGGAGTGCCG TCGCCGGCTT GAATTTCACT TGAAAACAGG TGCACACTCG CGTTCTAGAA ACAACAACCG AAGATTGTCA TCTCCTGTTC ATATAAATAT ATAAGCACCC CCCGTCCAAC GTGGAAGAAC CCTTGTTCCA CTTTTTCTCA CTGTTCGTTT ATATTCATAG ATTCAAATCC GTATACGGTA CACTACTAAG GCCCAGCCCT CTTCTCATTT CATCTTCAGC CTTGTCGTGA GAACTCTATA GTTGCAACCT TCGACATCGC CATAGCATTT CCTTGTTTCA TCCACTTCGT TTCAGCCGTA TTAATAGCCG CTGAGTAGAC AAACCTGGCT CCTCTCTTTT CACTTGCAGC ATACAATTCC AGCTCCAGGA ATATCATAGC TCGTTGACAA GACCGCAAAA TTATCGTCAC CTAGTGTTGG ACCGAACTCC CGTACGATTT CATCAATAGC TCCTTGCTGC ATCAATTGTC GTAGATTGAT TCACAATTCT CGGATTCAAT AAATTCTCCT AAAATAACAG GATTAATTTA GAATATTATC AGAATATTAT TTCCCGATAA ATTTATTTTC TGGTGCATAC GTCATAAGGT GCGAACATAA CCTCCTTGCA CATTAACCTT TTGACACCTG TACAGCAATA TGGTACAACT TACATCCATT GTCTCCTCCA TTTTGGTGTT GTCACAATCT TTACTTGTGG CATCCGCCTC CATCAACAAC CCGCTCTTGG ATAACAATAA CAACTTGAAG AAGTTGACGA AGAAGGGTGC CTCCTGGGAC TACCAAAATG ATGTCATCAG AGGGGTGAAT TTGGGAGGCT GGTTCGTGCT TGAACCCTAC ATCACACCAT CTTTGTTTGA ACAGTGGGAA AACTGGGGCG ACGACTCTCA GGTCCCTGTA GATGAATACC ACTACACTCA AAAGTTGGGC AAATTGGTCG CTGGCCAGAG ATTAGATACG CACTGGAAGA CATGGTACAC CGAGCAGGAT TTCTCAGACA TAGCTGCTGC AGGCTTAAAC TTTGTCCGTA TCCCAATCGG CTACTGGGCT TTCCAATTGT TGGATAACGA TCCCTATGTT CAGGGCCAAG TTGAATACTT GGACCAGGCA CTTGGATGGG CCAACAAGTA TGGCTTGAAG GTGTGGATTG ACTTGCACGG TGCTCCAGGC TCTCAAAATG GTTTTGACAA CTCTGGTTTG AGAGATACTG TCCAGTACCA ACAACCTAAC AATGTCCAGG TTACATTGAA TGTGTTGGAG CAGATCTTTG AAAAATACGG TAACGGCGAG TATTCCAACT ATGTTATTGG TATCGAATTG TTGAACGAAC CTTTGGGCCC TGTCTCGGAC ATGAACAACT TGAAGAACTT CTTGACCCAA GGCTACAACA ACTTAAGACA AACTGGCTCC GTAACTCCCG TGATCATCCA TGATGCTTTC CAGGCTCCAG GTTACTGGGA CAACTTCTTG ACCGTTGAAA ACGGTGACTA CTGGAGCGTC GTTATTGACC ATCATCACTA TCAAGTGTTC TCCTACGGCG AATTAGCCCG TGATATCGAC CAACACATCT CTGTTGCATG TAACTGGGCT TGGGATTCCA AGAAGGAATA CCACTGGAAC GTTGCTGGTG AATGGTCTGC TGCCTTGACT GACTGTGCCA AGTGGCTTAA CGGTGTTGGT CGTGGTGCTC GTTACGCTGG TCAATACGAC AACTCTGCTT ACATTGGTGA TTGTACTCCA TACCTTGACT TGGGAACCTG GACTCAAGAT TACAAAACTA ACGTGCGTAA GTACATTGAA GCCCAATTGG ATGGTTTTGA ACAGACAGGT GGTTGGGTCT TCTGGAACTG GAAGACTGAA AACGCTGTTG AATGGGATTT CAAGAGATTG ACAGCTGCTC AACTCTTCCC ACTGCCACTC ACCGACAGAC AATTCCCTAA CCAATGTGGT TTCTAAAGCA TCTAAAGAAT GATTTCAAAA ACAAAAGTTG AATTGGGTTC AAGTGTTTGG TCTCAAATAC TGTTACACTA TAGACAACCT TTCTCAAACA CACTAGTTGC TTTTCTCATT TTCGATCACA CAATCTTCAT TTCAGTTCTG GTTGTTTATT AATACATTAT CTTACTAATT AATACACTAT TATTAAATTT ATTGTCTAG
|
Protein sequence | MVQLTSIVSS ILVLSQSLLV ASASINNPLL DNNNNLKKLT KKGASWDYQN DVIRGVNLGG WFVLEPYITP SLFEQWENWG DDSQVPVDEY HYTQKLGKLV AGQRLDTHWK TWYTEQDFSD IAAAGLNFVR IPIGYWAFQL LDNDPYVQGQ VEYLDQALGW ANKYGLKVWI DLHGAPGSQN GFDNSGLRDT VQYQQPNNVQ VTLNVLEQIF EKYGNGEYSN YVIGIELLNE PLGPVSDMNN LKNFLTQGYN NLRQTGSVTP VIIHDAFQAP GYWDNFLTVE NGDYWSVVID HHHYQVFSYG ELARDIDQHI SVACNWAWDS KKEYHWNVAG EWSAALTDCA KWLNGVGRGA RYAGQYDNSA YIGDCTPYLD LGTWTQDYKT NVRKYIEAQL DGFEQTGGWV FWNWKTENAV EWDFKRLTAA QLFPSPLTDR QFPNQCGF
|
| |