Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_69889 |
Symbol | SMX1 |
ID | 4837024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 552507 |
End bp | 555688 |
Gene Length | 3182 bp |
Protein Length | 1004 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388339 |
Product | Importin-beta like gene |
Protein accession | XP_001382866 |
Protein GI | 150864152 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5656] Importin, protein involved in nuclear import |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.574805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.668272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGAAATCGC TAGCACTTCG CTGAAAAATC ACCAATTTTT CATTCATATT CAGCACTATA CAGCTATAGC ATAATTTCAC TATGGACAAA CCCAGTTTAC TCAAGGCTTT GGCCGGCACA TTGGATGCAG ACTTCCACAC CCGTAAACTG AGCGAAAGAC AGTTGAACGT CTATGAACAG CAACCTGGCT TCACAGCCTA CTTGCTAGAG CTCATTACAG ACCCCGAGGC CCAATTGGGC ATCCAGATTT CAGCAGCCAT CCTCTTCAAA AACAGAGTCA TGACCTATTG GTTGACTCCG GAAAACAAGG CTCCCAGTCC GTTGACTATC AGAGACAACG AAAAGCCTCA GATCAAGGAG AAATTGATCC AGACTTTGAT AAAGACCTAC AAGAACACCC AGCTCAAATT GCAATTGTCT ACAGCCTTGC ACAATATCTT GAGTTCGGAA AAATGGGATG AAATACTTGC TATCATCAAG AACTTGTTGA ACGACCTGTC CAATATCGAC CACGTGTACG TTGGCTTGAT CTGCTTGTAC GAATACACTA AAAACTACAG ATGGTCTAGC TTTGAACATG CAAATTCTTC TAATCCAGTG TTGGAAGATG TAGCCAATGA AGTATTCCCA CAATTGCAAA CTCTTATTCA CAATTTGATA AACAGCGACT CAGCTACTGC TGACGAGATG ACGTACTTGA TAGTGAAGAT CTTCAAGTTT ACCACTTTCT CATCTTTACC ATCGTACTTC TTGAACACGG AAAATTTGGG CAACTGGTGT CAGATCCACA TCATGATCAT CAACAAGCCA TTGCCAGCAT CTGTGTTGAA CGAGGACTCA ATTGAGCTTA GAAACCAGAA CCCTAGGATT AAAGCTGTGA AGTGGTGTTT CGGAAACTTG CACCGTTTGT TGAGTCGTCA TGGTGGAGGT ATCACAACGA AAGATAAAAC TAACAACCAG TTTGCTACAG CTTTTTTGGA GAACTTTGTT CCAGTCATCT TGAACGCGTT TTGGAAGATC ATCGAAGAGT GGTCCACCAA ACAAATCTGG TTGAGTGAAT CTTCTTTGTA CCATATCATT TCGTTCTTGG AACAGATTGT AGACACGCCA GCCTGGAACT TGATCAATGA CAAGATCGAT GCTATCATCA AGCACGTTAT ATTGCCCACA TTGAATGCAA CAGAAGAAAC CATAGAGTTA TACGAAGACG ATTCTGACGA GTACATCAGA AGATTCTTTG ATACCAACCG AGAAAGCAAC ACGGCAGATG TGGCCTCCAT CAACTTCATC TATCGTTTGT CTGTCAAGCG GTTTACTGCC AGCATAAACA CCGTGTTAGC CATCGTAAAT GATATCTTCA ACCGTAGAGC CGGCGATCGT GGCAATGTAG ACGTGGCCAA GGAGACTGAA GGTGCATTCA GAGTATTATC TACTCTTTCC CATAAGTTAG ACAACAAGAA CTCGCCAGTC CACGGACAAG TAGATAAGGT GTTACATACA TTCATCTACC CCGAGTTGGC CGAACCTGTT ATTGCTTCGA CTCCTTGGTT GACAGCTAGA GCCTGTGACA CTTTGGCCAT GTTTAGACAC AATTACAAGG ACCAAGAAGT GTTGAGAGAC ATTTTCCAGG GCGTGGTCAA CTGTTTCCAG AAGGAAGACC AGTTTCCCAT TCAGCTTACC GCTGTGGATG CCTTGTGCAC TTTGGTAGAA GAAGACACCG TAGCTGAACA TGTAGGAGAA CAGGCTCCTC AGTTAATGGG GACCTTATTG GAGATGTCTA AGAAGTTCGA GAGTGACATT TTGACAAGCG TAATGGAAAC ATTTGTCGAA AAGTTCGCTA AGAACTTGGA ACCATATGCA ACAGAGTTGG CCAGAAAGTT GATGGAACAG TTCCTCAGAA CGGTATCCGA GTTGATGGAA CAACAGTCTG CTGACTATAA CAACGTAGAC GTGGACAAGG AATACAAAGC AGCAGGTGTA TTGGGTACCT TAACTTCGTT GGTGATTGCC ATGGGTACTT CTCCTGAAGT GTCTGTAGCC TTGGAAGGAG TATTACTGGA AATGATTATC TTTATTCTCG AGAATGCACA AGTTTCCTTT TTGTGTGAAA CCATCGAGAT TCTCGAATCA TTGATTTTCA GCTCTCGTAA CGTTTCTCCT GTCATGTGGA ACATTTACCA GGTTGTCATT GATTCATTTG ATACATATGC ACACGAGTAC TTCGACAGTT TCCAGCCGTT CTTCGAAGGT ATTATTAACC ATGGCTTTAC TCAACCTGTA ATCACTGTGG AGAGTCCTCA AATCCAGCAA TTACTTTCAG TTTGTTTCAA GCTCTTGAAG AGCGACAGCT TGGATCCCGT ATTTGCTCAC TCTACTTTTG AAATTATGGA GTTGACCATC TTGGCTTTGA ACACGAGATT CGTGCCAATC TTGCCTCAGT TTTTACCTGA GATTTTCGAA ACCTTCAGTT CATTGGAGAG CCAGGATGCT TTCGATGGCT ATATGTTGCA CCACTTGTCA ATTTTGAGAG TTTTCTTTGC TGCATTCTAC GTAGACCCAG TCACGACCAT TCAATTCTTG AACGAAAAGG GATTCACGCC AGCCTTGTTC CAGTTATGGA TCAAGCACTC AAGTGACTTC CAAAGTGTAT ATGGATGCAA GTTGCAGATT TTGGCCAGTA TTTCCATCAT CAGAAGCCAG GCTTTGACCT TGATTCCCGA AGATCTTATT GGCGAAACCG TTGATTTAAT GGTAGACAAC ATTTCCACCT TGCCATCAGC CATCAAGGCC AAAAACGATA TCTTGCAAAA AGAAAGCCTG AAACCTTTTG GTAATGCTGG TAATGAAGAA GAAGATGACG AATACAATGC TGCTTACTAT GAAGACGAAT TGGAGGCAGA CGAAGCCGAA TTGGAAGCAT TGAAGCAAAC TCCTATCGAC GAAATCAACG TTTTCCAGGT AATTGCTGAC AACTTGCAAA CCATGATCCA TCAGGATCCA GGCAAATACG AGGCATTGTT CGGAGGTGTT AGCGACAACA AGAAGGAGAT GCTCCAGCAG ATCTTGCACA TTGTGCACGA AAAGGCCAAA AACTAGAGCC AAAGAATACG TAGTTTATTT CCTCATACTT CTGTTTAAGC ATTTAGCCCT GCTAATAAAA GTCGCATTAT AATTTTGCAC CC
|
Protein sequence | MDKPSLLKAL AGTLDADFHT RKSSERQLNV YEQQPGFTAY LLELITDPEA QLGIQISAAI LFKNRVMTYW LTPENKAPSP LTIRDNEKPQ IKEKLIQTLI KTYKNTQLKL QLSTALHNIL SSEKWDEILA IIKNLLNDSS NIDHVYVGLI CLYEYTKNYR WSSFEHANSS NPVLEDVANE VFPQLQTLIH NLINSDSATA DEMTYLIVKI FKFTTFSSLP SYFLNTENLG NWCQIHIMII NKPLPASVLN EDSIELRNQN PRIKAVKWCF GNLHRLLSRH GGGITTKDKT NNQFATAFLE NFVPVILNAF WKIIEEWSTK QIWLSESSLY HIISFLEQIV DTPAWNLIND KIDAIIKHVI LPTLNATEET IELYEDDSDE YIRRFFDTNR ESNTADVASI NFIYRLSVKR FTASINTVLA IVNDIFNRRA GDRGNVDVAK ETEGAFRVLS TLSHKLDNKN SPVHGQVDKV LHTFIYPELA EPVIASTPWL TARACDTLAM FRHNYKDQEV LRDIFQGVVN CFQKEDQFPI QLTAVDALCT LVEEDTVAEH VGEQAPQLMG TLLEMSKKFE SDILTSVMET FVEKFAKNLE PYATELARKL MEQFLRTVSE LMEQQSADYN NVDVDKEYKA AGVLGTLTSL VIAMGTSPEV SVALEGVLSE MIIFILENAQ VSFLCETIEI LESLIFSSRN VSPVMWNIYQ VVIDSFDTYA HEYFDSFQPF FEGIINHGFT QPVITVESPQ IQQLLSVCFK LLKSDSLDPV FAHSTFEIME LTILALNTRF VPILPQFLPE IFETFSSLES QDAFDGYMLH HLSILRVFFA AFYVDPVTTI QFLNEKGFTP ALFQLWIKHS SDFQSVYGCK LQILASISII RSQALTLIPE DLIGETVDLM VDNISTLPSA IKAKNDILQK ESSKPFGNAG NEEEDDEYNA AYYEDELEAD EAELEALKQT PIDEINVFQV IADNLQTMIH QDPGKYEALF GGVSDNKKEM LQQILHIVHE KAKN
|
| |