Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80169 |
Symbol | ATH1 |
ID | 4851056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 834503 |
End bp | 837912 |
Gene Length | 3410 bp |
Protein Length | 1083 aa |
Translation table | |
GC content | 42% |
IMG OID | 640392764 |
Product | vacuolar acid trehalase |
Protein accession | XP_001387389 |
Protein GI | 126274042 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.135506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.979445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCTTAACTTT ACCTGCAGTT TCAGCCTGCT GCAAGGTCTA TTTGATAAAC CTTCTTAATT TGGTTTGTGA ATCTATTTGA TTATTCCGTG ATTAGTTTCT CCGCTTACCT TATCTGACTT TCTCCCATAG GACTTTCGAT GTCTGTACTA TAAATACCAT GGTTGTTGTG TCGAAGTGGC GCTTTTCGGA TGTAATTCAT ACCGTTTGTG ACACTATTGT CGATACTTTC TATCGAAAAC AAGCTTTTGT CAAGACTATG GTGTTTGTCA ATGTTGGATT GATTTTGCTC CTTCACTGGT ACTTTGTCTC GCTTCTGCAA GCTTTCCCGC TCCAGCTCCG CAACCATGTT CCAGGCACCG TGACACATGA GCTGGTGGCC GAGGAAAGAG CTAGACACTT GAAACTCGTT AACAGCCCTC AAAACAAAGC CGTGTACACC CAGTTGAAGT ATGCCGCCAA CGCCTTTTTT GATGTAGATA CGAACACTAT AGGAACTACC CACTTTACAC CGTATAACCA GTATCAGCGA CAACCCTATG TAGCCAATGG TTATATCGGT GCAAGAATTC CGAATTTGGG ACAAGGATTC ACGTATGATC AGATCAGCGA CTCAGCAGAT GCTACGAACG ATGACTTGCT CAATGGATGG CCCCTCTTTA ACGAGCGCTA CTCGGGTGCA TTTGTTGCAG GCTTCTTTGA CATCCAGAAA AACACCACGG GAACGAATTT TCCTGAGCTT TTAGCCAATG GCTACGAATC TGTTATTGCC TCAGTTCCAC AATGGACTAC CTTACAAGTC AGCACTATAA AGGATGGAGT TGACTTCTCC TTGGATCCAG TAAACTCTGT AGATATGAAC AATGTCATCT CCAGTTATGG TCAAAACTTG TCTCTTGTGA ACGGAATCGT CACCACAGAA TACACCTGGC TCGATGATAT CAGAATCAAG TACAAGATCT TGGCTCACAG AAAAGAAATC AACCTTGGAC TCGTAGAGTT GTCTATTTCC AATATGGGTA ACACTTCGTT GACTTTCAAT GTAACTGATG TCTTGGACGC TTCTACAGCC CAACGGTCTC AGCTAACAGG TGTAAACTCG GATGGTAAGG GCATTTACAT AACCTTCCTG CCCAACGAAC TCAACTACAT TAATGGTGCC ATATATTCAA CGTTGCATGT GGAAGATGGT TCATCTATAC AGCTGTCTTC TTCAACCTCC AAAGTTAGTC AATATGTCGA AGTAACTGTA AATCCTGGTC GTACTTCCAG AGTTGTCAAG CTTGTCGGAG TAGCGACGAC TGATTTAGAT CCTCGTAATC TAGATTCGCT TGATAAAGTA CTTGCATTCG CCAAAAAGGT ATCTCAGACT TATACGAATG CAGACGATGT TGTCGAATCT CATTTGGTAG CTTGGGCTCA AACTCTTGAG TCCACTCCAG CAATTACATT TGCTGATGAC AGACTACTCA CATTGGCCAG TAGAGCTTCA CTCTTTCATC TTACAGCCAA TACTAGGCCT GATGCCAATG GCGTTACCGC TGCTATGGGT GTAGCTGGTT TGAGCTCAGA CAGTTATGCT GGGATGGTCT TTTGGGATGC AGATATATGG ATGATGTCTG GATTGTTACC ATTCATACCA TCTCACGCAA AAAGTATTGT CAATTACAGA ATGCATACTC ATGATCAGGC TATCAAGAAC TTACCAGAGG GTGCCAAGGG TGCAGTGTAT CCGTGGACCT CTGGCAGATT TGGTAATTGT ACTGCTACTG GACCCTGTCT AGACTATGAA TACCACATTA ACGTAGCTGT GGCAATGTCA GCCTGGCAGC TTTACTTGAG CGGTGCAGCT GACGAACAAT TCTTAGCGGA TGTTGTCTAT CCATTGGTAA ACGATGCCGC CGAATTCTTT GCCGATTATG TTGTCACCTA CGACGATACA TTGAAACAGT TCACTACGCA TAATTTGACT GATCCTGATG AATATGCAAA TCACGTCGAT AACGGAGCAT ATACCAACTC GGGTATTTCA CTTGTAGAGA AATGGGCCAT TCAAATCTCA AATCATTTAG GAAAAGAATT CCCACTGCAG TACTCAAACA TAGTGGGAAA TATGCATTTA CCTACATCTG GCAATTCCGA CAATATCACA TTGGAATACA CTGGCATGAA CTCGTCTGTT GGAATAAAAC AAGCAGATGT TATTATGATA ACCTACCCAT TGCAAAACGA GTTGATATCA GAAGCACAGG CCCTAACGAA TATGGAATTT TATTCTGTCA AACAAGTTAA CTATGGACCA GCTATGACTT TCCCGATCTT CTCGATTGTC GCATCACATG TCTCTACGTC TGGATGTGCT TCTCAGTCGT ACTTACAAAA GGCTGTACAG CCATTCTTAA GAGGTCCATT TGCCCAATTT TCTGAACAAA ATAACGATGA CTTTTTAACC AATGGTGGCA CTCATCCAGC TTTCCCATTC ATGACTGCCC ACGGTGGATT TTTACAAGCA GTAACACAGG GTCTCACTGG TTTAAGATTT GGATACGTCA TTGAAGATGG GCAAATCAAA CGAGCTTTGG ACTTGGATCC TACTGCATTG CCTTGTTTGC CTAATGGGGT GATTTTTGAT GGAATCAAAT ACAACAACCA TTCTCTTTCA TTCGCAGTCA ATGAAACTTC ATTCACTGTC AAAAATAATG GACCTATTTC AGAGAAGTCT GATGGAGTTG TTCGTATAAG AATTGCAGAT AGAAACCCAA GCAGAGGCAT ATACACCATA AATAGTGGTG AGGATTTCAG TTTCCCATTG TATACACCTA AACCAAGCTA CCCTACTAGT ATTTCTGAAT GTGGTTTGGC TAGTTTCTAC AATATCACTG ATGGCGCATA TGGGGATTCT CCAATTCTGA TAAACGACGG TGATAACACT ACTCAATGGC AAGCAAAGTA CAATGACACC ACTGGCAAGA TTCTTGTTGA CTTCAAGCAA TTTAAGAACG TTTCTAATGG AATCATTAAT TGGGGTGATA AACCACCCAA GAATTGGAAA CTATCTCAAT TCACTGGTTC ATTGGTAGAG TTTAAGGATG TAGAAGACGT TTTAAGTCAA GTTGATTTTG GCAATGAATT GTACAACATA TACCGATATG AGGATGAAGA CTACAAGCTC TATAAGCAAG ATGAGGTTTT CCAAGTAGTT CTCAGTTCCA ACGTGAGTAT TTCGGCACCA TTCATTTTGG AAGACTACAA TACCATAGAA CTTCCAAAGA GACAAAACAC CACTGAATTC ACTATAGACG AAGAATTGTA TTCCCAGTTC CTATTGATCG AAATTGAAGG GATCCACAAT ACTGTTCCTA TTGAAGATGA TACTGGTGGA GCCAAATTGT ACGAAGTAGT GTTCTTCTGA
|
Protein sequence | MVVVSKWRFS DVIHTVCDTI VDTFYRKQAF VKTMVFVNVG LILLLHWYFV SLLQAFPLQL RNHVPGTVTH ELVAEERARH LKLVNSPQNK AVYTQLKYAA NAFFDVDTNT IGTTHFTPYN QYQRQPYVAN GYIGARIPNL GQGFTYDQIS DSADATNDDL LNGWPLFNER YSGAFVAGFF DIQKNTTGTN FPELLANGYE SVIASVPQWT TLQVSTIKDG VDFSLDPVNS VDMNNVISSY GQNLSLVNGI VTTEYTWLDD IRIKYKILAH RKEINLGLVE LSISNMGNTS LTFNVTDVLD ASTAQRSQLT GVNSDGKGIY ITFLPNELNY INGAIYSTLH VEDGSSIQLS SSTSKVSQYV EVTVNPGRTS RVVKLVGVAT TDLDPRNLDS LDKVLAFAKK VSQTYTNADD VVESHLVAWA QTLESTPAIT FADDRLLTLA SRASLFHLTA NTRPDANGVT AAMGVAGLSS DSYAGMVFWD ADIWMMSGLL PFIPSHAKSI VNYRMHTHDQ AIKNLPEGAK GAVYPWTSGR FGNCTATGPC LDYEYHINVA VAMSAWQLYL SGAADEQFLA DVVYPLVNDA AEFFADYVVT YDDTLKQFTT HNLTDPDEYA NHVDNGAYTN SGISLVEKWA IQISNHLGKE FPLQYSNIVG NMHLPTSGNS DNITLEYTGM NSSVGIKQAD VIMITYPLQN ELISEAQALT NMEFYSVKQV NYGPAMTFPI FSIVASHVST SGCASQSYLQ KAVQPFLRGP FAQFSEQNND DFLTNGGTHP AFPFMTAHGG FLQAVTQGLT GLRFGYVIED GQIKRALDLD PTALPCLPNG VIFDGIKYNN HSLSFAVNET SFTVKNNGPI SEKSDGVVRI RIADRNPSRG IYTINSGEDF SFPLYTPKPS YPTSISECGL ASFYNITDGA YGDSPILIND GDNTTQWQAK YNDTTGKILV DFKQFKNVSN GIINWGDKPP KNWKLSQFTG SLVEFKDVED VLSQVDFGNE LYNIYRYEDE DYKLYKQDEV FQVVLSSNVS ISAPFILEDY NTIELPKRQN TTEFTIDEEL YSQFLLIEIE GIHNTVPIED DTGGAKLYEV VFF
|
| |