Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_85073 |
Symbol | ROT2 |
ID | 4840545 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 896020 |
End bp | 898857 |
Gene Length | 2838 bp |
Protein Length | 911 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391860 |
Product | glucosidase II |
Protein accession | XP_001386183 |
Protein GI | 150866544 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTTC AATCGGTGCT ACTCTGGATT TTCTGCTTCA GCCTGGTTCT AGCAGTGAAA GAGTACCTTT TCAAGAACTG TGACAATTCA GGATTCTGCC ATCGTAACAA GCATTATGCT AATCAGATTA AAAGTCTCGG TTCAGCATTT GTTCCTCACT ATGCCATTGA CCCTTCTTCT GTTCATTTGC AAGAACTGGG TCAAGACTTC CATATCGCTG GTACGATAAT AAAGAAAGTT CCCAATATTC CTGATCTACA AGTGGAGTTG CCGATTACGG TTTCGTTGCT TGAAGGGAAC AATGTCCGAG TGCAGATTGA CGAATCGGGC AGAAACCAAA TCACTGTGAA GAATAAATAT GTAAATCATC GCAGATACAA CGAGACGGGG CAATGGGCTT TTGCAAGTGA AGAATTGCCT TATATTAGTA AGAAGGACGT CAAACTTGAC ATTTCCAGTG ATAAATTGTC TTTCACGTAT GGACCTGCTC AAGAGTATAC GGCGGAGTTG CAATTTTTGC CAGTTAAATT GACGATTTCC TATAAGGACG AGCCTCAAGT TGTAGTTAAC GATCAGAACT TTCTCAACTT GGAGCACTGG AGAGTCAGAG ATGCTAACGC AGAGCATCTC AGTGATGAGC AAGTAGATTT CGACATGTTC ACGGACAGCT TCGGGGATTC AAAGGAAGAT AAGTTGCCTT TGGGGCCAGA ATCCATAGGA CTTGATTTCA CCTTCAAGAA CTACAAAAAC TTGTATGGGA TTCCTGAGCA TGCTGACTCG TTGAACTTGA AAGATACCAC AGGCTCGAAC CAGCCGTACC GTCTTTTCAA CGTTGATATC TTTGAGTATG AGACTGACTC CAGATTGCCG ATGTATGGAG CAATTCCGTT GCTATTGGCT GTGAGACCCG AACTCTCTGT TGGTTTATTT TGGATCAACA GTGCTGATAC TTTCGTTGAT TTAGATAAAA ATTCAGATTC TGGCGACTCT AGGACTCACT GGATCTCAGA AAACGGTGTT ATTGATTTCA TGATCATCGT AGATAAAACA CCTGCTGCCA TTAACAAGAA CTACGGGTTG ATTACTGGTT ACGTCCAATT ACCTCCGCTA TTCTCTCTAG GATACCACCA ATGTCGCTGG AATTACAACG ACGAAAAGGA TGTATTGGAA ATAAACTCCT TGATGGACAA ACACAGAATT CCTTACGACA CCATTTGGTT GGATATCGAG TACACCGACT CCAAGAAATA CTTTACGTGG CAGAACGATG TTTTTCCTGA CCCAGAAGGT ATGATGAAGG AATTGGACGC TACTGGGAGG AACTTGGTGG TAATCATCGA CCCACACATC AAAACAGGCT ACCCTGTCAG CGACCAGTTC AGAAAGCAGA AAATTTGCAT CAATGATGCT ACCAATACTA GCTACTTAGG CCATTGCTGG CCCGGAGAAT CTGTTTGGAT CGATACTTTG AATCCTAATG CTCAAGCTCT TTGGGACTCT CAGTTCGTAT GGGACAAAAA GAACAAATTC ACAGGAGGTT TGTCCACCAA TCTTCATATC TGGAACGATA TGAACGAGCC CTCGGTATTT AACGGTCCAG AAACAACTTC TCCCAGAGAT AACTTACACT ACGGAGGATG GGAGCATCGT TCTGTTCATA ACATCTACGG TTTAAGTTAC CATGAAGCGA CCTACAATTC GTTAAAAAAA CGTCAATCAC ATACCACGAG AGAAAGACCA TTTATTCTTA CTAGATCGTA CTATTCTGGA TCTCAGAGAA CGGCTGCTAT GTGGACTGGA GACAATATGT CCAAATGGGA GTATCTACAG ATTTCGCTTC CAATGGTATT GACCTCAAAT ATAGTCGGTA TGCCTTTCGC GGGAGCCGAT GTCGGAGGAT TTTTTGGAAA CCCCTCGAAG GAATTGCTTA CCAGATGGTA CCAGGCTGGA ATCTGGTACC CTTTCTTCAG AGCACACGCG CACATAGATT CAAGGAGAAG AGAACCCTGG GTGGCAGGGG AACCTTACAC TTCTATCATG ACAGATGCTG TCAAGTTGAG ATACTCGTTA TTGCCCATGT TGTATACTGC GTTTTACGAA TCGTCAGTTT CAGGCATTCC AATTATGAAG CCTGTTTTCT ATGAAGCTCT TGACAATTTG GAAAGCTACT CGATTGAAGA TCAGTTTTTC GTAGGAAATT CCGGTTTGTT GGTTAAACCC GTTGTAGAGA AGGAAGCAGA TGACATCGAA ATCTATCTTC CGGATTCTGA GGTCTATTAC GATTTCACCA ATGGAAACAT CACCGGCGAT ATAACTAAGT TTCAATTGAA CAAACCTGGA TATGTCAAGA GGGCAGTAAC TTTGAATGAC ATTCCAGTTT TCTTAAAAGG TGGTTCCATC ATTGCACAAA AAAACAGATA CCGTAGATCT TCCAAGTTGA TGGTCAATGA TCCATACACA TTGATTGTTG CACCAGACTC GAACGGAAAC GCTAATGGAA AGTTGTATAT CGACGATGGT GAATCATTTG GCTATACCAA GGGTGAGAGC ATAATCATTG AGTTCCAGTT TTCAAAGAAA CTAGGATTGT CAGCCAAGGT TTCAAGTATA GACGTGAACT ATGTTGGTTC GTTGTCGAGT ATTGAAATTG AAAAGATTGT TATTATTTCC CAACCACAGT CGCAGATTAG TGAGGTCGAA CTCAGACAAT CTCTGAACTC CTGGAAGGCC AGATTCTCGA CCTCTAGAGA CAAGTTGATT ATTCATAACC CAAAGTTGAA GGTTGCTGCA GATTGGAGTG CTACTTTCGC TACTGATGTT GAGCATGACG AGTTGTGA
|
Protein sequence | MRLQSVLLWI FCFSSVLAVK EYLFKNCDNS GFCHRNKHYA NQIKSLGSAF VPHYAIDPSS VHLQESGQDF HIAGTIIKKV PNIPDLQVEL PITVSLLEGN NVRVQIDESG RNQITVKNKY VNHRRYNETG QWAFASEELP YISKKDVKLD ISSDKLSFTY GPAQEYTAEL QFLPVKLTIS YKDEPQVVVN DQNFLNLEHW RVRDANAEHL SDEQVDFDMF TDSFGDSKED KLPLGPESIG LDFTFKNYKN LYGIPEHADS LNLKDTTGSN QPYRLFNVDI FEYETDSRLP MYGAIPLLLA VRPELSVGLF WINSADTFVD LDKNSDSGDS RTHWISENGV IDFMIIVDKT PAAINKNYGL ITGYVQLPPL FSLGYHQCRW NYNDEKDVLE INSLMDKHRI PYDTIWLDIE YTDSKKYFTW QNDVFPDPEG MMKELDATGR NLVVIIDPHI KTGYPVSDQF RKQKICINDA TNTSYLGHCW PGESVWIDTL NPNAQALWDS QFVWDKKNKF TGGLSTNLHI WNDMNEPSVF NGPETTSPRD NLHYGGWEHR SVHNIYGLSY HEATYNSLKK RQSHTTRERP FILTRSYYSG SQRTAAMWTG DNMSKWEYLQ ISLPMVLTSN IVGMPFAGAD VGGFFGNPSK ELLTRWYQAG IWYPFFRAHA HIDSRRREPW VAGEPYTSIM TDAVKLRYSL LPMLYTAFYE SSVSGIPIMK PVFYEALDNL ESYSIEDQFF VGNSGLLVKP VVEKEADDIE IYLPDSEVYY DFTNGNITGD ITKFQLNKPG YVKRAVTLND IPVFLKGGSI IAQKNRYRRS SKLMVNDPYT LIVAPDSNGN ANGKLYIDDG ESFGYTKGLS AKVSSIDVNY VGSLSSIEIE KIVIISQPQS QINKLIIHNP KLKVAADWSA TFATDVEHDE L
|
| |