Gene PICST_69889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_69889 
SymbolSMX1 
ID4837024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp552507 
End bp555688 
Gene Length3182 bp 
Protein Length1004 aa 
Translation table12 
GC content43% 
IMG OID640388339 
ProductImportin-beta like gene 
Protein accessionXP_001382866 
Protein GI150864152 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5656] Importin, protein involved in nuclear import 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.574805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.668272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGAAATCGC TAGCACTTCG CTGAAAAATC ACCAATTTTT CATTCATATT CAGCACTATA 
CAGCTATAGC ATAATTTCAC TATGGACAAA CCCAGTTTAC TCAAGGCTTT GGCCGGCACA
TTGGATGCAG ACTTCCACAC CCGTAAACTG AGCGAAAGAC AGTTGAACGT CTATGAACAG
CAACCTGGCT TCACAGCCTA CTTGCTAGAG CTCATTACAG ACCCCGAGGC CCAATTGGGC
ATCCAGATTT CAGCAGCCAT CCTCTTCAAA AACAGAGTCA TGACCTATTG GTTGACTCCG
GAAAACAAGG CTCCCAGTCC GTTGACTATC AGAGACAACG AAAAGCCTCA GATCAAGGAG
AAATTGATCC AGACTTTGAT AAAGACCTAC AAGAACACCC AGCTCAAATT GCAATTGTCT
ACAGCCTTGC ACAATATCTT GAGTTCGGAA AAATGGGATG AAATACTTGC TATCATCAAG
AACTTGTTGA ACGACCTGTC CAATATCGAC CACGTGTACG TTGGCTTGAT CTGCTTGTAC
GAATACACTA AAAACTACAG ATGGTCTAGC TTTGAACATG CAAATTCTTC TAATCCAGTG
TTGGAAGATG TAGCCAATGA AGTATTCCCA CAATTGCAAA CTCTTATTCA CAATTTGATA
AACAGCGACT CAGCTACTGC TGACGAGATG ACGTACTTGA TAGTGAAGAT CTTCAAGTTT
ACCACTTTCT CATCTTTACC ATCGTACTTC TTGAACACGG AAAATTTGGG CAACTGGTGT
CAGATCCACA TCATGATCAT CAACAAGCCA TTGCCAGCAT CTGTGTTGAA CGAGGACTCA
ATTGAGCTTA GAAACCAGAA CCCTAGGATT AAAGCTGTGA AGTGGTGTTT CGGAAACTTG
CACCGTTTGT TGAGTCGTCA TGGTGGAGGT ATCACAACGA AAGATAAAAC TAACAACCAG
TTTGCTACAG CTTTTTTGGA GAACTTTGTT CCAGTCATCT TGAACGCGTT TTGGAAGATC
ATCGAAGAGT GGTCCACCAA ACAAATCTGG TTGAGTGAAT CTTCTTTGTA CCATATCATT
TCGTTCTTGG AACAGATTGT AGACACGCCA GCCTGGAACT TGATCAATGA CAAGATCGAT
GCTATCATCA AGCACGTTAT ATTGCCCACA TTGAATGCAA CAGAAGAAAC CATAGAGTTA
TACGAAGACG ATTCTGACGA GTACATCAGA AGATTCTTTG ATACCAACCG AGAAAGCAAC
ACGGCAGATG TGGCCTCCAT CAACTTCATC TATCGTTTGT CTGTCAAGCG GTTTACTGCC
AGCATAAACA CCGTGTTAGC CATCGTAAAT GATATCTTCA ACCGTAGAGC CGGCGATCGT
GGCAATGTAG ACGTGGCCAA GGAGACTGAA GGTGCATTCA GAGTATTATC TACTCTTTCC
CATAAGTTAG ACAACAAGAA CTCGCCAGTC CACGGACAAG TAGATAAGGT GTTACATACA
TTCATCTACC CCGAGTTGGC CGAACCTGTT ATTGCTTCGA CTCCTTGGTT GACAGCTAGA
GCCTGTGACA CTTTGGCCAT GTTTAGACAC AATTACAAGG ACCAAGAAGT GTTGAGAGAC
ATTTTCCAGG GCGTGGTCAA CTGTTTCCAG AAGGAAGACC AGTTTCCCAT TCAGCTTACC
GCTGTGGATG CCTTGTGCAC TTTGGTAGAA GAAGACACCG TAGCTGAACA TGTAGGAGAA
CAGGCTCCTC AGTTAATGGG GACCTTATTG GAGATGTCTA AGAAGTTCGA GAGTGACATT
TTGACAAGCG TAATGGAAAC ATTTGTCGAA AAGTTCGCTA AGAACTTGGA ACCATATGCA
ACAGAGTTGG CCAGAAAGTT GATGGAACAG TTCCTCAGAA CGGTATCCGA GTTGATGGAA
CAACAGTCTG CTGACTATAA CAACGTAGAC GTGGACAAGG AATACAAAGC AGCAGGTGTA
TTGGGTACCT TAACTTCGTT GGTGATTGCC ATGGGTACTT CTCCTGAAGT GTCTGTAGCC
TTGGAAGGAG TATTACTGGA AATGATTATC TTTATTCTCG AGAATGCACA AGTTTCCTTT
TTGTGTGAAA CCATCGAGAT TCTCGAATCA TTGATTTTCA GCTCTCGTAA CGTTTCTCCT
GTCATGTGGA ACATTTACCA GGTTGTCATT GATTCATTTG ATACATATGC ACACGAGTAC
TTCGACAGTT TCCAGCCGTT CTTCGAAGGT ATTATTAACC ATGGCTTTAC TCAACCTGTA
ATCACTGTGG AGAGTCCTCA AATCCAGCAA TTACTTTCAG TTTGTTTCAA GCTCTTGAAG
AGCGACAGCT TGGATCCCGT ATTTGCTCAC TCTACTTTTG AAATTATGGA GTTGACCATC
TTGGCTTTGA ACACGAGATT CGTGCCAATC TTGCCTCAGT TTTTACCTGA GATTTTCGAA
ACCTTCAGTT CATTGGAGAG CCAGGATGCT TTCGATGGCT ATATGTTGCA CCACTTGTCA
ATTTTGAGAG TTTTCTTTGC TGCATTCTAC GTAGACCCAG TCACGACCAT TCAATTCTTG
AACGAAAAGG GATTCACGCC AGCCTTGTTC CAGTTATGGA TCAAGCACTC AAGTGACTTC
CAAAGTGTAT ATGGATGCAA GTTGCAGATT TTGGCCAGTA TTTCCATCAT CAGAAGCCAG
GCTTTGACCT TGATTCCCGA AGATCTTATT GGCGAAACCG TTGATTTAAT GGTAGACAAC
ATTTCCACCT TGCCATCAGC CATCAAGGCC AAAAACGATA TCTTGCAAAA AGAAAGCCTG
AAACCTTTTG GTAATGCTGG TAATGAAGAA GAAGATGACG AATACAATGC TGCTTACTAT
GAAGACGAAT TGGAGGCAGA CGAAGCCGAA TTGGAAGCAT TGAAGCAAAC TCCTATCGAC
GAAATCAACG TTTTCCAGGT AATTGCTGAC AACTTGCAAA CCATGATCCA TCAGGATCCA
GGCAAATACG AGGCATTGTT CGGAGGTGTT AGCGACAACA AGAAGGAGAT GCTCCAGCAG
ATCTTGCACA TTGTGCACGA AAAGGCCAAA AACTAGAGCC AAAGAATACG TAGTTTATTT
CCTCATACTT CTGTTTAAGC ATTTAGCCCT GCTAATAAAA GTCGCATTAT AATTTTGCAC
CC
 
Protein sequence
MDKPSLLKAL AGTLDADFHT RKSSERQLNV YEQQPGFTAY LLELITDPEA QLGIQISAAI 
LFKNRVMTYW LTPENKAPSP LTIRDNEKPQ IKEKLIQTLI KTYKNTQLKL QLSTALHNIL
SSEKWDEILA IIKNLLNDSS NIDHVYVGLI CLYEYTKNYR WSSFEHANSS NPVLEDVANE
VFPQLQTLIH NLINSDSATA DEMTYLIVKI FKFTTFSSLP SYFLNTENLG NWCQIHIMII
NKPLPASVLN EDSIELRNQN PRIKAVKWCF GNLHRLLSRH GGGITTKDKT NNQFATAFLE
NFVPVILNAF WKIIEEWSTK QIWLSESSLY HIISFLEQIV DTPAWNLIND KIDAIIKHVI
LPTLNATEET IELYEDDSDE YIRRFFDTNR ESNTADVASI NFIYRLSVKR FTASINTVLA
IVNDIFNRRA GDRGNVDVAK ETEGAFRVLS TLSHKLDNKN SPVHGQVDKV LHTFIYPELA
EPVIASTPWL TARACDTLAM FRHNYKDQEV LRDIFQGVVN CFQKEDQFPI QLTAVDALCT
LVEEDTVAEH VGEQAPQLMG TLLEMSKKFE SDILTSVMET FVEKFAKNLE PYATELARKL
MEQFLRTVSE LMEQQSADYN NVDVDKEYKA AGVLGTLTSL VIAMGTSPEV SVALEGVLSE
MIIFILENAQ VSFLCETIEI LESLIFSSRN VSPVMWNIYQ VVIDSFDTYA HEYFDSFQPF
FEGIINHGFT QPVITVESPQ IQQLLSVCFK LLKSDSLDPV FAHSTFEIME LTILALNTRF
VPILPQFLPE IFETFSSLES QDAFDGYMLH HLSILRVFFA AFYVDPVTTI QFLNEKGFTP
ALFQLWIKHS SDFQSVYGCK LQILASISII RSQALTLIPE DLIGETVDLM VDNISTLPSA
IKAKNDILQK ESSKPFGNAG NEEEDDEYNA AYYEDELEAD EAELEALKQT PIDEINVFQV
IADNLQTMIH QDPGKYEALF GGVSDNKKEM LQQILHIVHE KAKN