Gene PICST_80169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80169 
SymbolATH1 
ID4851056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp834503 
End bp837912 
Gene Length3410 bp 
Protein Length1083 aa 
Translation table 
GC content42% 
IMG OID640392764 
Productvacuolar acid trehalase 
Protein accessionXP_001387389 
Protein GI126274042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.979445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCTTAACTTT ACCTGCAGTT TCAGCCTGCT GCAAGGTCTA TTTGATAAAC CTTCTTAATT 
TGGTTTGTGA ATCTATTTGA TTATTCCGTG ATTAGTTTCT CCGCTTACCT TATCTGACTT
TCTCCCATAG GACTTTCGAT GTCTGTACTA TAAATACCAT GGTTGTTGTG TCGAAGTGGC
GCTTTTCGGA TGTAATTCAT ACCGTTTGTG ACACTATTGT CGATACTTTC TATCGAAAAC
AAGCTTTTGT CAAGACTATG GTGTTTGTCA ATGTTGGATT GATTTTGCTC CTTCACTGGT
ACTTTGTCTC GCTTCTGCAA GCTTTCCCGC TCCAGCTCCG CAACCATGTT CCAGGCACCG
TGACACATGA GCTGGTGGCC GAGGAAAGAG CTAGACACTT GAAACTCGTT AACAGCCCTC
AAAACAAAGC CGTGTACACC CAGTTGAAGT ATGCCGCCAA CGCCTTTTTT GATGTAGATA
CGAACACTAT AGGAACTACC CACTTTACAC CGTATAACCA GTATCAGCGA CAACCCTATG
TAGCCAATGG TTATATCGGT GCAAGAATTC CGAATTTGGG ACAAGGATTC ACGTATGATC
AGATCAGCGA CTCAGCAGAT GCTACGAACG ATGACTTGCT CAATGGATGG CCCCTCTTTA
ACGAGCGCTA CTCGGGTGCA TTTGTTGCAG GCTTCTTTGA CATCCAGAAA AACACCACGG
GAACGAATTT TCCTGAGCTT TTAGCCAATG GCTACGAATC TGTTATTGCC TCAGTTCCAC
AATGGACTAC CTTACAAGTC AGCACTATAA AGGATGGAGT TGACTTCTCC TTGGATCCAG
TAAACTCTGT AGATATGAAC AATGTCATCT CCAGTTATGG TCAAAACTTG TCTCTTGTGA
ACGGAATCGT CACCACAGAA TACACCTGGC TCGATGATAT CAGAATCAAG TACAAGATCT
TGGCTCACAG AAAAGAAATC AACCTTGGAC TCGTAGAGTT GTCTATTTCC AATATGGGTA
ACACTTCGTT GACTTTCAAT GTAACTGATG TCTTGGACGC TTCTACAGCC CAACGGTCTC
AGCTAACAGG TGTAAACTCG GATGGTAAGG GCATTTACAT AACCTTCCTG CCCAACGAAC
TCAACTACAT TAATGGTGCC ATATATTCAA CGTTGCATGT GGAAGATGGT TCATCTATAC
AGCTGTCTTC TTCAACCTCC AAAGTTAGTC AATATGTCGA AGTAACTGTA AATCCTGGTC
GTACTTCCAG AGTTGTCAAG CTTGTCGGAG TAGCGACGAC TGATTTAGAT CCTCGTAATC
TAGATTCGCT TGATAAAGTA CTTGCATTCG CCAAAAAGGT ATCTCAGACT TATACGAATG
CAGACGATGT TGTCGAATCT CATTTGGTAG CTTGGGCTCA AACTCTTGAG TCCACTCCAG
CAATTACATT TGCTGATGAC AGACTACTCA CATTGGCCAG TAGAGCTTCA CTCTTTCATC
TTACAGCCAA TACTAGGCCT GATGCCAATG GCGTTACCGC TGCTATGGGT GTAGCTGGTT
TGAGCTCAGA CAGTTATGCT GGGATGGTCT TTTGGGATGC AGATATATGG ATGATGTCTG
GATTGTTACC ATTCATACCA TCTCACGCAA AAAGTATTGT CAATTACAGA ATGCATACTC
ATGATCAGGC TATCAAGAAC TTACCAGAGG GTGCCAAGGG TGCAGTGTAT CCGTGGACCT
CTGGCAGATT TGGTAATTGT ACTGCTACTG GACCCTGTCT AGACTATGAA TACCACATTA
ACGTAGCTGT GGCAATGTCA GCCTGGCAGC TTTACTTGAG CGGTGCAGCT GACGAACAAT
TCTTAGCGGA TGTTGTCTAT CCATTGGTAA ACGATGCCGC CGAATTCTTT GCCGATTATG
TTGTCACCTA CGACGATACA TTGAAACAGT TCACTACGCA TAATTTGACT GATCCTGATG
AATATGCAAA TCACGTCGAT AACGGAGCAT ATACCAACTC GGGTATTTCA CTTGTAGAGA
AATGGGCCAT TCAAATCTCA AATCATTTAG GAAAAGAATT CCCACTGCAG TACTCAAACA
TAGTGGGAAA TATGCATTTA CCTACATCTG GCAATTCCGA CAATATCACA TTGGAATACA
CTGGCATGAA CTCGTCTGTT GGAATAAAAC AAGCAGATGT TATTATGATA ACCTACCCAT
TGCAAAACGA GTTGATATCA GAAGCACAGG CCCTAACGAA TATGGAATTT TATTCTGTCA
AACAAGTTAA CTATGGACCA GCTATGACTT TCCCGATCTT CTCGATTGTC GCATCACATG
TCTCTACGTC TGGATGTGCT TCTCAGTCGT ACTTACAAAA GGCTGTACAG CCATTCTTAA
GAGGTCCATT TGCCCAATTT TCTGAACAAA ATAACGATGA CTTTTTAACC AATGGTGGCA
CTCATCCAGC TTTCCCATTC ATGACTGCCC ACGGTGGATT TTTACAAGCA GTAACACAGG
GTCTCACTGG TTTAAGATTT GGATACGTCA TTGAAGATGG GCAAATCAAA CGAGCTTTGG
ACTTGGATCC TACTGCATTG CCTTGTTTGC CTAATGGGGT GATTTTTGAT GGAATCAAAT
ACAACAACCA TTCTCTTTCA TTCGCAGTCA ATGAAACTTC ATTCACTGTC AAAAATAATG
GACCTATTTC AGAGAAGTCT GATGGAGTTG TTCGTATAAG AATTGCAGAT AGAAACCCAA
GCAGAGGCAT ATACACCATA AATAGTGGTG AGGATTTCAG TTTCCCATTG TATACACCTA
AACCAAGCTA CCCTACTAGT ATTTCTGAAT GTGGTTTGGC TAGTTTCTAC AATATCACTG
ATGGCGCATA TGGGGATTCT CCAATTCTGA TAAACGACGG TGATAACACT ACTCAATGGC
AAGCAAAGTA CAATGACACC ACTGGCAAGA TTCTTGTTGA CTTCAAGCAA TTTAAGAACG
TTTCTAATGG AATCATTAAT TGGGGTGATA AACCACCCAA GAATTGGAAA CTATCTCAAT
TCACTGGTTC ATTGGTAGAG TTTAAGGATG TAGAAGACGT TTTAAGTCAA GTTGATTTTG
GCAATGAATT GTACAACATA TACCGATATG AGGATGAAGA CTACAAGCTC TATAAGCAAG
ATGAGGTTTT CCAAGTAGTT CTCAGTTCCA ACGTGAGTAT TTCGGCACCA TTCATTTTGG
AAGACTACAA TACCATAGAA CTTCCAAAGA GACAAAACAC CACTGAATTC ACTATAGACG
AAGAATTGTA TTCCCAGTTC CTATTGATCG AAATTGAAGG GATCCACAAT ACTGTTCCTA
TTGAAGATGA TACTGGTGGA GCCAAATTGT ACGAAGTAGT GTTCTTCTGA
 
Protein sequence
MVVVSKWRFS DVIHTVCDTI VDTFYRKQAF VKTMVFVNVG LILLLHWYFV SLLQAFPLQL 
RNHVPGTVTH ELVAEERARH LKLVNSPQNK AVYTQLKYAA NAFFDVDTNT IGTTHFTPYN
QYQRQPYVAN GYIGARIPNL GQGFTYDQIS DSADATNDDL LNGWPLFNER YSGAFVAGFF
DIQKNTTGTN FPELLANGYE SVIASVPQWT TLQVSTIKDG VDFSLDPVNS VDMNNVISSY
GQNLSLVNGI VTTEYTWLDD IRIKYKILAH RKEINLGLVE LSISNMGNTS LTFNVTDVLD
ASTAQRSQLT GVNSDGKGIY ITFLPNELNY INGAIYSTLH VEDGSSIQLS SSTSKVSQYV
EVTVNPGRTS RVVKLVGVAT TDLDPRNLDS LDKVLAFAKK VSQTYTNADD VVESHLVAWA
QTLESTPAIT FADDRLLTLA SRASLFHLTA NTRPDANGVT AAMGVAGLSS DSYAGMVFWD
ADIWMMSGLL PFIPSHAKSI VNYRMHTHDQ AIKNLPEGAK GAVYPWTSGR FGNCTATGPC
LDYEYHINVA VAMSAWQLYL SGAADEQFLA DVVYPLVNDA AEFFADYVVT YDDTLKQFTT
HNLTDPDEYA NHVDNGAYTN SGISLVEKWA IQISNHLGKE FPLQYSNIVG NMHLPTSGNS
DNITLEYTGM NSSVGIKQAD VIMITYPLQN ELISEAQALT NMEFYSVKQV NYGPAMTFPI
FSIVASHVST SGCASQSYLQ KAVQPFLRGP FAQFSEQNND DFLTNGGTHP AFPFMTAHGG
FLQAVTQGLT GLRFGYVIED GQIKRALDLD PTALPCLPNG VIFDGIKYNN HSLSFAVNET
SFTVKNNGPI SEKSDGVVRI RIADRNPSRG IYTINSGEDF SFPLYTPKPS YPTSISECGL
ASFYNITDGA YGDSPILIND GDNTTQWQAK YNDTTGKILV DFKQFKNVSN GIINWGDKPP
KNWKLSQFTG SLVEFKDVED VLSQVDFGNE LYNIYRYEDE DYKLYKQDEV FQVVLSSNVS
ISAPFILEDY NTIELPKRQN TTEFTIDEEL YSQFLLIEIE GIHNTVPIED DTGGAKLYEV
VFF