Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_42583 |
Symbol | NUP116 |
ID | 4836889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2106428 |
End bp | 2109283 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640388204 |
Product | nuclear pore protein |
Protein accession | XP_001382631 |
Protein GI | 150863970 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTTTTGGCT CGTCAAATAC AGCTGGATCA CCGTTTGGGG GTTCTGGAGC CACTTCCACA TTCGGGACCG GTACTGGTAC AAATTCTACT TCATTTGGTG CTTCCAATGC GGCTGCAGGT GGTATTTTCG GCGGTGCTAA CAAATCGGGC TTTGGAGCCT CTACGGGAGC GTTTGGTTCT GGTGGCACTT CTCCATTTGG AGCTACTTCA GGTACTGGCT TCGGTGGAGT AGGAGGCACT GTAGATTCTA ATACCAACAA CGGAACAGCC GTAAAGCCGT TTGCAGCTGT TAGCGAAAAG GACAATACTG GAACTTTGAA CGTTTTCCAG AACATCTGTG CTATGCCTGA GTACAAAAAT TTTTCGTTTG AAGAGTTGAG ATTGAAAGAT TACGAACAAA ACAGGCGTTA TGGTTCTCAG AATGCTGCTG GTGCTGCTGG TTCTACAACC GGCGGATTCA GCTTCGGGGC TTCCACCAAT GCTCCTTCCT CTACTTCTGC ATTTGGTTCT GGTGGACAAA CTGGCTTCGG AGCACAACCT AACACAGGCC TGGCCTTTGG GTCTTCAGGA GCATTTGGAG CAGGTCAGCC ATCAAGTTCT CCATTTGCAG CACAATCCAA TACTACAAGC ACCTTTGGCC AGCAACAGCA GAGCTCACCT TTTGGAGGAG CTGCAGCTGG TGGTTCTGGA TTTGGAACTT CTAATGCAGC TAGTGTTTTT GGAAGTTCCA ACACTGCTTT TGGTGCCAAT AAGCCGGCTA CAGGGTTTGG AGCCACTGGT TCTTCATTTG GAAATGCTGG CACTGGCGGA GGTCTCTTTG GTTCTAACAA TGCCAATTCT AATACTAACG CCAATGCATC TCCTTTTGGA GGTACTGGCA ATACAGCTGG TGGTTTTGGC CAGACCAGCA ACTCGCCATT TGGCCAAAGC AATACTGGAA ACGCCTTTGG CCAGAGCAAC ACTGGCAGTT CACCATTTGG AACTCAGAAC AATGCTGGTG GAGGTTTGTT TGGAGGAGCT CGGAATAACC TATCAAATAG TAATGCTTTT GGAGCCTCCA ACACCAACGC GGGTTTTGGT GCCTCTTCTA CTGGTGGAGG CTTGTTTGGT GGAGCTCAGA ATACGGCCAG CACCTTTGGT GCCAATAAAC CTGCTACAGG AGGATTGTTT GGAGGAGCTG GTCAGAGTAA TACTCAAGCA GGAGGATTAT TTGGACAAAA TAACCAGCAA CAAAATACTT CTGGTGGATT GTTTGGCCAA AATAACCAAC AACAACAACA ACAAGCATCA GGAGGCTTAT TTGGAGCAAA ACCTGCTGGA AATACTGGAG GTTTATTTGG TAGTAGTCAG ACTTCTAACA CAGCTACTGG CGGAGGTTTG TTTGGTGGTA ATACCAATCA ATCTGGAGCT GGTGGTTTAT TTGGAGCCAA AACTAATACA AGTGGTGGTT TATTTGGAGC ATCGCAGCCT GCCTCCAGTG GTAATGCAGC CTTTGGCCAA GGTGCTGCCA ATACACAGCA ATCGGGTGGA TTGTTTGGGA ACAAACCAGC CGGAGCTACT GGTGGTGGCT TGTTTGGTGG AAGCAATACT GTCAACGCTG CAGCTGGTGC TAGTGGCGGA TTGTTTGGAG GTAGTTCTAC CAATCAAGGT GGAGGTCTTT TTGGTTCAAG CAACAATGCT GGTGGATTAG GGACCAATAC ACAACAATCT GGCGGTTTAT TTGGTGCCAA ACCTGGTAAT ACAGGAACAG GTGGAGGTTT GTTTGGTGGT GGATCGTCGG GAACAAACAC TCTTGGAAGT TCGACTCTTG GAACCAACGT GTTGCAGAAT AATGCCCAGC AATCGCAACA GCCATTGGTC AACATAAACT CTCAAAATCC ATATGGGAAC AACCCGATTG TTATGTTCCA GAGTATTACC GGAAGTCAGC AAACTAATGC TGCTGGTCCA TCTATCACAC CAGTAAAGAC TGTCAACAAG AAACCCGTAA CATTGGGCAA TCCTCACAGA GTTGCTCCAA TATTCAGGGT GAGTGCATTG CCAAAGAAGA AGTCTACATC TCTGAACGAG AAGGCTCTTG TAACGAAGAA GCCCGACGGT GTATTCACGG CCAGTACAGA TGCGGCTATT ATATCATCGG ATATTTTTGC ACCAAAGGAC AATTTCAAGA AGCTTGTCTT GGATAAGTCT CAGGAAAGCA ATACCAAGTT GCTTCCAAGT ACCGAGTCCC AGAATGGTCC TACGAAACAA GTTACTTTCC AATTGGAGAA ATCCAGCACT TCCGCTGCTA CTGAAGCTAT TGAAAATGGC CAAGACAAAA CCAAGTCCAA GCATAAAAAG AAAGAAGTTG AATCAGAAAA GATCGTTCCT CCAATCACTT CGTCTACACC AGAACATACT CTTACAGATG AAGTTGACGA TAATGGCTAC TGGACATCGC CCCCATTGTC TCAATTGAAG AGCAAATCGT TGTCAGAGTT ACGTTCTATC AAGGGGTTCA AGATTGGACG CAAGCACTAC GGACAAGTCG AATTCTTGGA TCCAGTCGAC TTGTCTAGCA TCGTCAACTT GGATGATATT GCCGGTAATG TGATTATCTT TGGATCCAAG TCGTGTCTCG CTTACCCAGA CGAAAACAGA CCCAAGAAGG GAGAAGGGTT GAATTTGCCA GCGCGAATCA CTTTGGAAGG ATGCTATCCT TTGAACAAGG CTGATAAGAT GCCAATTCTT GATCCTAAGA GCGAGGTCGT AAAATTGCAT ATTGAGAATT TGAAGAAGTT GCCATACATG AACTTCGAGT CATACGACGC AGCCACCGGC AACTGGACAT TCACAGTAGA GTCGATGGAT GCTTGA
|
Protein sequence | LFGSSNTAGS PFGGSGATST FGTGTGTNST SFGASNAAAG GIFGGANKSG FGASTGAFGS GGTSPFGATS GTGFGGVGGT VDSNTNNGTA VKPFAAVSEK DNTGTLNVFQ NICAMPEYKN FSFEELRLKD YEQNRRYGSQ NAAGAAGSTT GGFSFGASTN APSSTSAFGS GGQTGFGAQP NTGSAFGSSG AFGAGQPSSS PFAAQSNTTS TFGQQQQSSP FGGAAAGGSG FGTSNAASVF GSSNTAFGAN KPATGFGATG SSFGNAGTGG GLFGSNNANS NTNANASPFG GTGNTAGGFG QTSNSPFGQS NTGNAFGQSN TGSSPFGTQN NAGGGLFGGA RNNLSNSNAF GASNTNAGFG ASSTGGGLFG GAQNTASTFG ANKPATGGLF GGAGQSNTQA GGLFGQNNQQ QNTSGGLFGQ NNQQQQQQAS GGLFGAKPAG NTGGLFGSSQ TSNTATGGGL FGGNTNQSGA GGLFGAKTNT SGGLFGASQP ASSGNAAFGQ GAANTQQSGG LFGNKPAGAT GGGLFGGSNT VNAAAGASGG LFGGSSTNQG GGLFGSSNNA GGLGTNTQQS GGLFGAKPGN TGTGGGLFGG GSSGTNTLGS STLGTNVLQN NAQQSQQPLV NINSQNPYGN NPIVMFQSIT GSQQTNAAGP SITPVKTVNK KPVTLGNPHR VAPIFRVSAL PKKKSTSSNE KALVTKKPDG VFTASTDAAI ISSDIFAPKD NFKKLVLDKS QESNTKLLPS TESQNGPTKQ VTFQLEKSST SAATEAIENG QDKTKSKHKK KEVESEKIVP PITSSTPEHT LTDEVDDNGY WTSPPLSQLK SKSLSELRSI KGFKIGRKHY GQVEFLDPVD LSSIVNLDDI AGNVIIFGSK SCLAYPDENR PKKGEGLNLP ARITLEGCYP LNKADKMPIL DPKSEVVKLH IENLKKLPYM NFESYDAATG NWTFTVESMD A
|
| |