Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_61452 |
Symbol | EXG3 |
ID | 4839893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 77768 |
End bp | 79288 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391208 |
Product | glucan 1,3-beta-glucosidase |
Protein accession | XP_001385705 |
Protein GI | 150866196 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.243086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATA AGGTCAAACT CCGCTCGAAG GGCAGTAACC CAGTAGCTGC CGTTCCTGCG GCTACTCCGG GTCAACCTCC AAGTACAAGA CAGATCTACC AATCGAGAAA GAATTTCGGA GTCAACATTG GAGCTTGTTT TGTATCGGAA AAATGGATAT TCCATGAGTT GTTCGGTGAA AACGTTCCTG AATTCGAGCT CGCAGCAGTT GAAGCTATGG TGAGAGCGAA GGGTTTAGAT GGTGCCAAAA GTACATTTGA GAATTTCTGG TCTAATTTTA TGAATGACAA TGACTGGAGA TGGTTGCAGG ACAACCAGGT CACTTCTGTA AGAATTCCCA TAGGATATTG GGATGTTGCC GGTGGAAGGT TTACCAAAGG CACCCAATTT GAGAAGTATG GGTCTTCTGT CTATTCTGGA GCTTGGAATA TATTTAAGGA AAAGTTCGTC AAGCCAGCAG GAAAACATAA TATCTCTGTA TTGGTGGATC TTCATGGGTT ACCCGGTGGT GCTAACTCTA GCGATCACAG TGGTGAGAAG TCTGGTGGTC TGGCAGCTTT TTGGTCAAAC GAGAAATTTC AGTTGCAGGT TGCTGAAATG CTCACCTTTA TTGCCAGGGA TTTACAGCAG TTTGAGAACA TTTCTGGTAT TCAAGTAGTC AACGAAGCAG AATTCGCGCA AGAGCCAGCT TCAAAGCAAA CTACTTACTA TGTAGCTGCT CTCAATCTGA TCCGAGAAGC GGATTCAGGT ATCCCAGTGA TTATTTCTGA CGGCTGGTGG ACAGACCAGT GGGTGAGATT CATTCAGAAA CACCAACAGA ACAACAATAG TCTAGGTTTG ATAATCGATC ACCACGTATA CCGTTGTTTT TCTAAGGAAG ACAAGGATAA GTCTCCGATG AGGATCATTG AAGATTTGAA CAATGATGTA TTAACTAATT TGACTGATAA TGGTAAGGGA GTTGACATTA TGGTCGGTGA ATTCTCTTGT GTACTTGACC AACAGTCGTG GAATAAAGAT GGTGCACAAG GCAGAAGAGA TGAGTTGGTG ATCCAGTACG GTAATAGACA ATGTGACTTA ATTAATGAAA GAGCAGGTAT GGGCTTTTAC TTTTGGACTT ACAAGTTCCA GTCGGGAAAC GGAGGTGAAT GGGACTTAAA GCAAATGGTG GAAAAAGGGG CTATAAGGAA TCCATTTTCC GTCAATGGTA AGAGATTGCC TGACAGATCA ATGTTCGAAC AGGCTTACAA CCAAGCAATG CAAGGTCATG TTGGATACTG GAGTGGAACC GATCCTGGTG GAAGATATGA ACATGAGCGA TATGGTGAAG GGTTCACTAC TGCCTGGGCA GATGCCGAGG AATTCGCGAA GTTCAACGGG TCTGTCTTGG GCCGGGTTGA AGCATGGAGA ATTGCACGGT TGTCGGAACA TATCAGAGCT CGAGGTGCTC TGGGCTACTT GTGGGAATGG GAACAGGGTT TCTATGAAGG ATTGAAGCAG TTTCATTCTA ATGTGAGATG A
|
Protein sequence | MFDKVKLRSK GSNPVAAVPA ATPGQPPSTR QIYQSRKNFG VNIGACFVSE KWIFHELFGE NVPEFELAAV EAMVRAKGLD GAKSTFENFW SNFMNDNDWR WLQDNQVTSV RIPIGYWDVA GGRFTKGTQF EKYGSSVYSG AWNIFKEKFV KPAGKHNISV LVDLHGLPGG ANSSDHSGEK SGGSAAFWSN EKFQLQVAEM LTFIARDLQQ FENISGIQVV NEAEFAQEPA SKQTTYYVAA LNSIREADSG IPVIISDGWW TDQWVRFIQK HQQNNNSLGL IIDHHVYRCF SKEDKDKSPM RIIEDLNNDV LTNLTDNGKG VDIMVGEFSC VLDQQSWNKD GAQGRRDELV IQYGNRQCDL INERAGMGFY FWTYKFQSGN GGEWDLKQMV EKGAIRNPFS VNGKRLPDRS MFEQAYNQAM QGHVGYWSGT DPGGRYEHER YGEGFTTAWA DAEEFAKFNG SVLGRVEAWR IARLSEHIRA RGASGYLWEW EQGFYEGLKQ FHSNVR
|
| |