Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_62666 |
Symbol | AGL1 |
ID | 4839770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 37412 |
End bp | 39130 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640391085 |
Product | alpha-glucosidase maltase |
Protein accession | XP_001385341 |
Protein GI | 126137636 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.97685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.170022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATTA CTCGCAACTG GTGGAAAACC TCCACCGTTT ATCAGATTTG GCCAGCTTCT TATAAGGACT CTAATGGTGA TGGAATTGGT GATATTAAAG GTATTATCTC AACTTTGGAC TATCTCAGAG ACTTGGGGGT TGATGTTATT TGGTGTAGTC CGATGTACGA CTCACCACAG GATGACATGG GTTACGACGT TAGAGACTAT GAAAAGGTAT ATCCAAAATA TGGTACAAAC GACGATATGC AGTTACTTAT TGACGAGTGT CACAGTCGTG GTATGAAATT GATTTTGGAC TTGGTTATTA ACCACACTTC AAGTGAGCAC GTATGGTTCA AGGAATCGAG ATCCTCCAAG ACAAATCCAA AAAGAGATTG GTATATTTGG AAACCACCCA AATACGATGA AGAAGGCAAC AGACGCCCAC CCAACAATTG GGCATCTTAC TTCTCTGGTT CTGCTTGGGA GTACGATGAG CGCACAGATG AGTACTACTT GAGACTTTTT GCCAGTTCCC AGCCAGACCT AAATTGGGAA AATGAAGAGA CCAGAAATGC CATCTACGAA TCTGCAGTGA AGTTTTGGTT GGACAAAGGC GTAGATGGGT TCAGAATCGA CACTGCTGGT TTATACTCTA AGGTTCAAAC TTTTCCAGAT ACTCCGATCA TCTTTCCAGA GGAAGAATTT CAATCGAGTA AGTTGTACAG TCAGAATGGT CCCCGTATTC ATGAGTTTCA CAAGGAAATG TACTCAAAGG TCACAAGTAA ATATGATGCA ATGACAGTTG GCGAAGTTGG TCATTGTTCT CGCGAAGACG CCTTGAAGTA CGTAAGTGCT AAAGAGCATG AAATGAACAT GATGTTTCTT TTCAATAAAG TGTGGGTCGG ATGTGACAGA AACGATCGTT GGAAATTTGA TGGTTGGAAA TTGACCGATT TTAAGAAGGC CGTTGAAATA GATTGTGATT TCATTGCTGG GACAGATGCT TGGTCAACCG TTTTCATTGA AAATCATGAC CTTCCCAGAT GTGTTACTAG ATTTGGAGAC AAAAAACATC GTTCCCAAGC TGCTAAGTTA CTCTCAATTT TAGGTACTAC ATTGACTGGT ACACTCTTTA TCTATCAGGG CCAAGAAATT GCCATGGAAA ACTTGCCAAG AGATTGGTCA ATTGATGAAT ATAAAGACAT CAATACAATC AACAGATACA AGGAGTTCAA GGATAAATAT GGAAATGATC CTGATTTCAA GGAAAAAGAG GAAAAGTTGA TGGATATAAT TAATCTATTG GCTAGAGATA ATAGCAGATC CCCAGTTCAA TGGGATGCAT CTCCAAACGC TGGATTTACC ACAGGAATTC CATGGACAAG AGTCAATGAG AACTATACCA CAATCAACGT TGAAAGTCAA ATTAGAGATC CAAATTCTGT TTTAAACTTC TACAAGAAGT CTATTCAAAT TAGAAAAAGT TATCAAGACT TGCTCATTTT TGGAGATATG AAGATCTTAG ATTATGAAAA TCAAAAGACC TTTACCTACC TAAAGTTGAA CGAGAATGCT TTATCGCCAA AAGCTTATAT TGTCTTGAAT TTCTCCAATG AAGAAGTCAA CTTTGAAAAG TTGATTAATG GTGATTTTGA ACTAGTTCTC AGTAATGTAG ATGTTATCAA TGAACAGAAA TTGTCTCCAT TTGAAGCACG TCTTTACATT GTTGATTAA
|
Protein sequence | MTITRNWWKT STVYQIWPAS YKDSNGDGIG DIKGIISTLD YLRDLGVDVI WCSPMYDSPQ DDMGYDVRDY EKVYPKYGTN DDMQLLIDEC HSRGMKLILD LVINHTSSEH VWFKESRSSK TNPKRDWYIW KPPKYDEEGN RRPPNNWASY FSGSAWEYDE RTDEYYLRLF ASSQPDLNWE NEETRNAIYE SAVKFWLDKG VDGFRIDTAG LYSKVQTFPD TPIIFPEEEF QSSKLYSQNG PRIHEFHKEM YSKVTSKYDA MTVGEVGHCS REDALKYVSA KEHEMNMMFL FNKVWVGCDR NDRWKFDGWK LTDFKKAVEI DCDFIAGTDA WSTVFIENHD LPRCVTRFGD KKHRSQAAKL LSILGTTLTG TLFIYQGQEI AMENLPRDWS IDEYKDINTI NRYKEFKDKY GNDPDFKEKE EKLMDIINLL ARDNSRSPVQ WDASPNAGFT TGIPWTRVNE NYTTINVESQ IRDPNSVLNF YKKSIQIRKS YQDLLIFGDM KILDYENQKT FTYLKLNENA LSPKAYIVLN FSNEEVNFEK LINGDFELVL SNVDVINEQK LSPFEARLYI VD
|
| |