Gene PICST_80310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80310 
Symbol 
ID4851454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1847178 
End bp1850637 
Gene Length3460 bp 
Protein Length938 aa 
Translation table 
GC content43% 
IMG OID640393162 
Productpredicted protein 
Protein accessionXP_001387988 
Protein GI126274582 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5215] Karyopherin (importin) beta 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACTTTAAAG TGCTCTGGAA ACAGAAAGAC ATACTTGTAT AGGTGCTCGC GTCCGAGATC 
TGTAGCACAG CTAGACACAC TTAGTCGAGA ATTGCGAACT GGTCCAGGTG CAAAGGAGAC
GCGTTGCCAG ATTCAGCCGC CAGTTATACT CATTGTTAAT CGATCTCGAT CAGAGACTCA
TTGGATCTAC AGAATTCCAG TGAAATTTGC CACAGGTATT GGTTTTTGAG AAAAGCCTGT
CATTAAGATC TATACAACAC CTATTGAATC AATAGTCTGA ATCGTCTAGT GAAGTATTGA
ACCCCAACAA AGTATTTATC CATAGAGATT CGGTTTTCAT TCACAACTCG TTCTTGGAAT
TTCATCTACA AATCGGAGTT AATAACTCAT AAAATCATTA AGACCTTCAA ATTTTCCTTT
TTATTTCAGA ATCTTGAATT CTCACATATT CTTTATTGAA TTCCCATAGT CATCTTCGCC
ATGTCCTGGA CTCCTGAACC GCAAGCTCTC GAGCAGCTTA GGCACATATT CAGAGGTACG
TTATCGTCCA ATAATAACGA ACGAAAGCTT GCAAACGAAG CATTGGACCA GGCGAAATTA
CAGCCAGAGA TCGAGAACTA TTTGCTAGAG CTTTTGGTCG TGGATGATTC TGCAAAATCA
GATATCAGAG CTGCCGCAGG TATCAACTTG AAAAACAGCA TATTGAACAG GAGGCATCAG
AAGGCGCCGC CTAACAGACT GTACTTGTTG GAGAACATTT TGAAGGGTTT GATGTCGAAA
GATAATATGG TGAGAAACAT TACAGGGAAC GTAATAACGT CACTTTTCTC CATCTATGGC
TTGGAAGGAT GGCCGCAAGC CCTTCCGCAG CTTCTTGAAT TGGTTAACCA CACCAGCACG
GATGGTTCCA TGACGTCTCA GGAAGCTGCC AGTGGGGCCT TGAGCAAGAT CTGCGAAGAT
AGCTTCTATT CTCTAGATGT AGAGTTCAAC GGAGAAAGAC CTCTCAATTT CATGATCTCC
AATTTCTTGA AGTTGATGAA CCATCCTGGT AGCGGCAAAA TCAGGGCCAA CTCCATCCAT
TGCATTGCTC AGTTTATCCC CTTAAAGACG CAGTCGTTCT TGGTACACAT TGACGAATAT
CTCCAGAAGT TGTTTGAGTT GGCACACGAT CCCAGCCGAG AAGTTAGAAA GAACATTTGT
TCGTCATTTG CATTAATCTT GGAAACCAGA CCAGACAAAT TGATGCCCCA TTTGGATGGA
GTTATAAACT ACTGTTTGCA CTTGATGCAA GATCCAAGCG AAGAAGTAGC TTTGGAAGCT
TGTGAGTTCT TATTGGCTTT ATCTACTGCT CCAGAGACAG AATCTGACAA GGAAATCTTC
AGTCCCAAGT TGAAAATGAT CTTGCCGACT CTCTTGGACA AGATGGTCTA TTCTGAGGAA
GATATCTTCT TGATGGAAAT CGCTGATTCA AAAGACGATG CCACTATTGC AGACAAGGAT
GAAGATATCA AGCCACTTAA TGCCAAATCA AAGGATATCC ACTCTGTAGC CAATACCAAT
TCTGCCAGTA ACGGCTCTAC TAAGAAAAAA GCTGCTGGCG ACGATTCTGA TTCCGATTTC
GATGACGACG AAGATGAAGA TGATGAAGAC CTGGAGTTGG ATCAATGGAG TTTGAGAAAG
TGCTCAGCTG CCACCTTGGA CATTCTATCC CTCAACCTTC CTGGTGAGGT TTTAAATGTG
ACATTACCCA TATTACAAGA TAGGATTGTA TCTCAGGAAT GGCCTGTTCG TGAAGCAGCA
ATTCTTGCCT TTGGAGCTAT AAGTAAAAGT TGCCTAGAAT TAGCCAGAGA AAAATTACCC
ACTTTGGTGC CATTCCTAGT CGATAGATTG AAAGATAGCG AACCCAGAGT GAGACAGATT
GCTTGTTGGA CGTTGTCCAG GTTTGCTACC TGGATCGCTG AGGAAGCGCA TGAAGGGGGA
CAGTATGCAA ATTACTTCCA ACCTACGTTT CAGTCGATTG TAGCCTGTTC CATGGACCAG
AAAAAGGTGG TTCAAGAAGC TGCATGTTCT GCATTATCGT CATTTATAGA AGAGTCCGAT
TCCACCTTGA TAGAGTACTA CTTAGGACCA TTGTTGGACC ATTTTGCAAA ATGCTTCCAG
ACATACCAGC GTAAGAACTT GATCATCTTG TACGATTGCG TTCAGACATT TGTAGAAAAA
ATGGGATACG ACAACTTGGC GTCTAAACCA GAATACGTCA ACACCTTGTT GCCTCCGTTG
TTGCATAAAT GGCAAATCCT AGACGACAAT GATACGGGAT TGTGGCCCTT GTTGGAATGT
ATGGCCTCTA TTGCAGCTAC CTTGGGTGAA CTCTTTGCTC CTTATGCTGT TCCAGTGTAC
GAGAGAGCTA TCAATATTTT GTCCAATTGT ATCCAGCTAG ACTTGCAAAC TCATACGGAT
CCCTCTATCG AGGCTCCTGA AAAGGACTTC ATTGTCACCT CTTTGGACTT GGTTGACGGA
TTGATCCAAG GGTTTGGTCA TCATTCCGCT GATTTGATCA GACAGCATAA CACAAACTTG
ATGGAATTGT TGATGCTTTG CTTCGAGGAC CACTCAGCAG ACGTCAGACA ATCGGCATAT
GCCTTGCTTG GAGACCTTTC CATTTTCACT TTGGACCCAA TAGTGAAACC ATACTTGCAA
TCAATATTCT TGAGCATTGG TAACGAGATC AATAACCGTT CTTACTCTAC TTTCCCAGTC
TACAACAATG CTATCTGGGC TCTAGGCGAA ATCGCCATGA GATTGCCGTA TGAGGAAATG
AAACATTACC TAGCTAACTT GGTCAACTTG TTAATTCCTG TGTTAAACGG CAGTGACATT
CAGCAGACAG TTTTAGAAAA TGCGGCCATC TGCTTGGGCA GAATGGGATT GAATGGAGGA
GCTGAGGTGA TTTCGCCCAG ATTGCCAGAA TTCATCGTTC AATGGTGCGC CCAGATGTTG
TATTTGGTAG ATAACAGCGA GAAGGAAACC GGGTTCCAAG GGATGTTGAA TATTATACAT
GGCAATCCAG ACCAGGGCTT TGGCGGATTA TCAAACCAAC AGGGAAAAAA GAACTTGAGT
CTTTTCGTGG TGTGCATTGG AAATTACATG GAACCACCAG AACACTTGAA GCAATTGTTT
GGTCAATTCT TGGTGTCGTA CAAACAGCTT CTTGGAGGTG ACATCTGGGA CCACCAGATC
TTGGCTGGGA TCGATGGCGA GTCGAGGATG ATGCTCTCGC AAGTCTACGG AGTATAAATC
ACGGATCGTT TGGTCCTTTG CATTGTTTAA TTGCTTCTTG TCAGATAAAT ATTCATCATC
ATTACTTATT AGTTAATTAT ATAGATTTCA TTTTCAAGGG GTGGTTAGCC CTGTATGAGC
ATTATATAGA TCGTTATTCA TAATACATTC TGAGGAATGG
 
Protein sequence
MSWTPEPQAL EQLRHIFRGT LSSNNNERKL ANEALDQAKL QPEIENYLLE LLVVDDSAKS 
DIRAAAGINL KNSILNRRHQ KAPPNRLYLL ENILKGLMSK DNMVRNITGN VITSLFSIYG
LEGWPQALPQ LLELVNHTST DGSMTSQEAA SGALSKICED SFYSLDVEFN GERPLNFMIS
NFLKLMNHPG SGKIRANSIH CIAQFIPLKT QSFLVHIDEY LQKLFELAHD PSREVRKNIC
SSFALILETR PDKLMPHLDG VINYCLHLMQ DPSEEVALEA CEFLLALSTA PETESDKEIF
SPKLKMILPT LLDKMVYSEE DIFLMEIADS KDDATIADKD EDIKPLNAKS KDIHSVANTN
SASNGSTKKK AAGDDSDSDF DDDEDEDDED LELDQWSLRK CSAATLDILS LNLPGEVLNV
TLPILQDRIV SQEWPVREAA ILAFGAISKS CLELAREKLP TLVPFLVDRL KDSEPRVRQI
ACWTLSRFAT WIAEEAHEGG QYANYFQPTF QSIVACSMDQ KKVVQEAACS ALSSFIEESD
STLIEYYLGP LLDHFAKCFQ TYQRKNLIIL YDCVQTFVEK MGYDNLASKP EYVNTLLPPL
LHKWQILDDN DTGLWPLLEC MASIAATLGE LFAPYAVPVY ERAINILSNC IQLDLQTHTD
PSIEAPEKDF IVTSLDLVDG LIQGFGHHSA DLIRQHNTNL MELLMLCFED HSADVRQSAY
ALLGDLSIFT LDPIVKPYLQ SIFLSIGNEI NNRSYSTFPV YNNAIWALGE IAMRLPYEEM
KHYLANLVNL LIPVLNGSDI QQTVLENAAI CLGRMGLNGG AEVISPRLPE FIVQWCAQML
YLVDNSEKET GFQGMLNIIH GNPDQGFGGL SNQQGKKNLS LFVVCIGNYM EPPEHLKQLF
GQFLVSYKQL LGGDIWDHQI LAGIDGESRM MLSQVYGV