Gene PICST_85073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85073 
SymbolROT2 
ID4840545 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp896020 
End bp898857 
Gene Length2838 bp 
Protein Length911 aa 
Translation table12 
GC content42% 
IMG OID640391860 
Productglucosidase II 
Protein accessionXP_001386183 
Protein GI150866544 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTTC AATCGGTGCT ACTCTGGATT TTCTGCTTCA GCCTGGTTCT AGCAGTGAAA 
GAGTACCTTT TCAAGAACTG TGACAATTCA GGATTCTGCC ATCGTAACAA GCATTATGCT
AATCAGATTA AAAGTCTCGG TTCAGCATTT GTTCCTCACT ATGCCATTGA CCCTTCTTCT
GTTCATTTGC AAGAACTGGG TCAAGACTTC CATATCGCTG GTACGATAAT AAAGAAAGTT
CCCAATATTC CTGATCTACA AGTGGAGTTG CCGATTACGG TTTCGTTGCT TGAAGGGAAC
AATGTCCGAG TGCAGATTGA CGAATCGGGC AGAAACCAAA TCACTGTGAA GAATAAATAT
GTAAATCATC GCAGATACAA CGAGACGGGG CAATGGGCTT TTGCAAGTGA AGAATTGCCT
TATATTAGTA AGAAGGACGT CAAACTTGAC ATTTCCAGTG ATAAATTGTC TTTCACGTAT
GGACCTGCTC AAGAGTATAC GGCGGAGTTG CAATTTTTGC CAGTTAAATT GACGATTTCC
TATAAGGACG AGCCTCAAGT TGTAGTTAAC GATCAGAACT TTCTCAACTT GGAGCACTGG
AGAGTCAGAG ATGCTAACGC AGAGCATCTC AGTGATGAGC AAGTAGATTT CGACATGTTC
ACGGACAGCT TCGGGGATTC AAAGGAAGAT AAGTTGCCTT TGGGGCCAGA ATCCATAGGA
CTTGATTTCA CCTTCAAGAA CTACAAAAAC TTGTATGGGA TTCCTGAGCA TGCTGACTCG
TTGAACTTGA AAGATACCAC AGGCTCGAAC CAGCCGTACC GTCTTTTCAA CGTTGATATC
TTTGAGTATG AGACTGACTC CAGATTGCCG ATGTATGGAG CAATTCCGTT GCTATTGGCT
GTGAGACCCG AACTCTCTGT TGGTTTATTT TGGATCAACA GTGCTGATAC TTTCGTTGAT
TTAGATAAAA ATTCAGATTC TGGCGACTCT AGGACTCACT GGATCTCAGA AAACGGTGTT
ATTGATTTCA TGATCATCGT AGATAAAACA CCTGCTGCCA TTAACAAGAA CTACGGGTTG
ATTACTGGTT ACGTCCAATT ACCTCCGCTA TTCTCTCTAG GATACCACCA ATGTCGCTGG
AATTACAACG ACGAAAAGGA TGTATTGGAA ATAAACTCCT TGATGGACAA ACACAGAATT
CCTTACGACA CCATTTGGTT GGATATCGAG TACACCGACT CCAAGAAATA CTTTACGTGG
CAGAACGATG TTTTTCCTGA CCCAGAAGGT ATGATGAAGG AATTGGACGC TACTGGGAGG
AACTTGGTGG TAATCATCGA CCCACACATC AAAACAGGCT ACCCTGTCAG CGACCAGTTC
AGAAAGCAGA AAATTTGCAT CAATGATGCT ACCAATACTA GCTACTTAGG CCATTGCTGG
CCCGGAGAAT CTGTTTGGAT CGATACTTTG AATCCTAATG CTCAAGCTCT TTGGGACTCT
CAGTTCGTAT GGGACAAAAA GAACAAATTC ACAGGAGGTT TGTCCACCAA TCTTCATATC
TGGAACGATA TGAACGAGCC CTCGGTATTT AACGGTCCAG AAACAACTTC TCCCAGAGAT
AACTTACACT ACGGAGGATG GGAGCATCGT TCTGTTCATA ACATCTACGG TTTAAGTTAC
CATGAAGCGA CCTACAATTC GTTAAAAAAA CGTCAATCAC ATACCACGAG AGAAAGACCA
TTTATTCTTA CTAGATCGTA CTATTCTGGA TCTCAGAGAA CGGCTGCTAT GTGGACTGGA
GACAATATGT CCAAATGGGA GTATCTACAG ATTTCGCTTC CAATGGTATT GACCTCAAAT
ATAGTCGGTA TGCCTTTCGC GGGAGCCGAT GTCGGAGGAT TTTTTGGAAA CCCCTCGAAG
GAATTGCTTA CCAGATGGTA CCAGGCTGGA ATCTGGTACC CTTTCTTCAG AGCACACGCG
CACATAGATT CAAGGAGAAG AGAACCCTGG GTGGCAGGGG AACCTTACAC TTCTATCATG
ACAGATGCTG TCAAGTTGAG ATACTCGTTA TTGCCCATGT TGTATACTGC GTTTTACGAA
TCGTCAGTTT CAGGCATTCC AATTATGAAG CCTGTTTTCT ATGAAGCTCT TGACAATTTG
GAAAGCTACT CGATTGAAGA TCAGTTTTTC GTAGGAAATT CCGGTTTGTT GGTTAAACCC
GTTGTAGAGA AGGAAGCAGA TGACATCGAA ATCTATCTTC CGGATTCTGA GGTCTATTAC
GATTTCACCA ATGGAAACAT CACCGGCGAT ATAACTAAGT TTCAATTGAA CAAACCTGGA
TATGTCAAGA GGGCAGTAAC TTTGAATGAC ATTCCAGTTT TCTTAAAAGG TGGTTCCATC
ATTGCACAAA AAAACAGATA CCGTAGATCT TCCAAGTTGA TGGTCAATGA TCCATACACA
TTGATTGTTG CACCAGACTC GAACGGAAAC GCTAATGGAA AGTTGTATAT CGACGATGGT
GAATCATTTG GCTATACCAA GGGTGAGAGC ATAATCATTG AGTTCCAGTT TTCAAAGAAA
CTAGGATTGT CAGCCAAGGT TTCAAGTATA GACGTGAACT ATGTTGGTTC GTTGTCGAGT
ATTGAAATTG AAAAGATTGT TATTATTTCC CAACCACAGT CGCAGATTAG TGAGGTCGAA
CTCAGACAAT CTCTGAACTC CTGGAAGGCC AGATTCTCGA CCTCTAGAGA CAAGTTGATT
ATTCATAACC CAAAGTTGAA GGTTGCTGCA GATTGGAGTG CTACTTTCGC TACTGATGTT
GAGCATGACG AGTTGTGA
 
Protein sequence
MRLQSVLLWI FCFSSVLAVK EYLFKNCDNS GFCHRNKHYA NQIKSLGSAF VPHYAIDPSS 
VHLQESGQDF HIAGTIIKKV PNIPDLQVEL PITVSLLEGN NVRVQIDESG RNQITVKNKY
VNHRRYNETG QWAFASEELP YISKKDVKLD ISSDKLSFTY GPAQEYTAEL QFLPVKLTIS
YKDEPQVVVN DQNFLNLEHW RVRDANAEHL SDEQVDFDMF TDSFGDSKED KLPLGPESIG
LDFTFKNYKN LYGIPEHADS LNLKDTTGSN QPYRLFNVDI FEYETDSRLP MYGAIPLLLA
VRPELSVGLF WINSADTFVD LDKNSDSGDS RTHWISENGV IDFMIIVDKT PAAINKNYGL
ITGYVQLPPL FSLGYHQCRW NYNDEKDVLE INSLMDKHRI PYDTIWLDIE YTDSKKYFTW
QNDVFPDPEG MMKELDATGR NLVVIIDPHI KTGYPVSDQF RKQKICINDA TNTSYLGHCW
PGESVWIDTL NPNAQALWDS QFVWDKKNKF TGGLSTNLHI WNDMNEPSVF NGPETTSPRD
NLHYGGWEHR SVHNIYGLSY HEATYNSLKK RQSHTTRERP FILTRSYYSG SQRTAAMWTG
DNMSKWEYLQ ISLPMVLTSN IVGMPFAGAD VGGFFGNPSK ELLTRWYQAG IWYPFFRAHA
HIDSRRREPW VAGEPYTSIM TDAVKLRYSL LPMLYTAFYE SSVSGIPIMK PVFYEALDNL
ESYSIEDQFF VGNSGLLVKP VVEKEADDIE IYLPDSEVYY DFTNGNITGD ITKFQLNKPG
YVKRAVTLND IPVFLKGGSI IAQKNRYRRS SKLMVNDPYT LIVAPDSNGN ANGKLYIDDG
ESFGYTKGLS AKVSSIDVNY VGSLSSIEIE KIVIISQPQS QINKLIIHNP KLKVAADWSA
TFATDVEHDE L