Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80670 |
Symbol | NUP159 |
ID | 4850797 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 75713 |
End bp | 79820 |
Gene Length | 4108 bp |
Protein Length | 1203 aa |
Translation table | |
GC content | 40% |
IMG OID | 640392505 |
Product | nuclear pore protein |
Protein accession | XP_001387254 |
Protein GI | 126273541 |
COG category | [N] Cell motility |
COG ID | [COG5651] PPE-repeat proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.381762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTTGAAATCT GGAGATATCA GGGTGAAGAA TTTATGGTGT TGTATATAAG AGAATTTTCA AAAAGTACAC TTTTGCAATG AGAACCACTA GAAACCAGAG TCAAATAAGA AGTTATCCTA TAGTGTTGTA AAATAGAAAA TTTATCCGAC TTTAAGTCAC AGTCCGTGGA ATTAAGCCGA TGCCAAGTTC ATTCAATTAT TCACTCGTTC GCATACTTAA TTATTCATAA GGAGCTTTAC GTAACTGATT ACGTAATCTA ATACTCAAAA CAGAGAATGG TAATTGACAA TATATGATCT CTATTCTCTT TCTTTGTTAG GTCGTCCATC AAAATCGCCA TGGATTCAAT TGAGGAAGTC AATACAGACG ATTTTGGGTT CAGTCTTGCT ACGGATGAGA AGGGATTCTC GGTGTTCGGC TCCACTATAG ACTTTGAGGT TCACGAAGAG CCTTTGAATT TGCTTGCAAT AGGAAACAAA TCAGGAATCA TTGCCATTTC CAACGTTACC ACCTTGTGTT TAGACACTCT AAGTGCAGTA GACAAGTTAA TTGAAGAGAG CCACAATAGT CCGGAAGCTG AGGAAGCCAA AGAGACTCTT TTGCGACTGG TAGCTGGAGA TGACCTAGAG ATCAAACAAG TTTTTTTCAG TCCTGATGAA CAAACTCTAT TTGTTGTGAA TAACAACAAG TTGCAAAAGT TAAGCGCAAA GAAATTTATT GCTGGCGAGG GTTCCAGTTT GGCTGATTAC GAAATTGGGG TAGCAATTAT CAAAAATGTG ATCCCTTCAC CTGTTCACAA TGAGAAGTTC CTCGTTTTGG ACGATTCCAA TACGCTTTTC TTATGGGAGA AAAGCAGCTT GGAAACTGTA GGCTCAAACA TTGCTTCAGC GTGCTGGTCT AGAGACGGAA GTTCGTATTC GTACATTATA AACAATACAA CGACTATCAC AAATAGTGCA GATTCGACGC ACATACCAAT AGACGTGTCT CAGGACGAAG AATACGAGTC GGACAATTAT CATATTAAGC AGATCTTCGA CCTTAAAACC AAATTTATAG CTATCTTTGA GCTGAATGAC TCGGAATTGG ACTACCATAC TACCAAGGCA TACTATATAG AAAAGACAAG TTCTGGCTAT GCTACTCAGA AACTAGAGTT AGAACCCCCG TCGAATACCG TTTCTCGTCA CTTGACTTAC TACACGACCT GGATATCCGA ATGGAAGAGA AACGAGACTC TTTTCTTCTT TTCTTCAGGC TCATCCACAG ATGTTGATTC ACTAGTTTCA GACGCTTCTG GATTCAAAAT CTTGCAATTA GAGGATACAA ACAGAGCGCT GTTCCCCATT GATGATGAGG CAGATGTAGA CACTTCACCT GTAGGTATGG CCGTTGATTT GCGAGCCTTT GAAGCTGTCG TCAAGGAACC TTGTTCAGGT GTAGAAGAAG TCAAAGGAAA GTTGCCTAAG GTATATACAT TATTGCACAC GGGAAAGTTA ATTGCCTGGT GGGTGTTTGA CAAACACGGC GTATTAGATG GGACCGTTGA CTTGCAGAGA GCATATTCTG CTTTTCCTGC TGTTGAAAGG GACATCTCTT CATCTATTTC TGTAATTAAG GAGACACCAT CCTCTCAATC AGAAGATTCT ACAACTAAAG GCACCTTTGA TGGTTCCATT GGATTTGGAA GTAGTTCTAC TAGATTTGGC AGTAGCACTG GAGCAAATGA AGTTGCTGGT GGTGCTTTTG GTAGTACCTC TGGAGCATTT GGAAACTTGC TGACATCTGA TAAAACTCCT AGTAAAACGG CAGAAGAAGG ACCAACACTG TTTGGAAGTG TTGCCTTTGG ATCCGCAGCT CCAACTTCAC AACCAGCTGC CTTTGGATCA TCACCGCAGG CTTCGAAACC AGTTGCTTCT GGTATTGGGG CTACACCTGC ATTTGGATCT GTAGGCTTTG GAGCACCTTC TTTTGGATCT ACCAGTTTTG TTAATGCAAA TTCCAGTGTT ATTCCAACAG CTTCTCTGAA ATCAACCTTT TCTAACTACG CCAATACTCC CTCTGGGTTT GGAGCATCCA CTGCGAAAGC ATCACCATTT GGAAACCTTA CTTCAGGTTC AGGAAGTTCT CCATTTGCCA GTATAGGTAG TGGTGACAAG GAGTCACCAT TTGGCAAACT CCAAGAATCA AAATCAATAT TCGGACAGGA GAGTAACGAA GGTAAGCCTT CTAGTTTTGG CAGCACCGCG AATGTGGAAT CCCCATTTGC TAAGCTTGGT AATTTGGACA TAAACAAAGA TGGTTTTGGT GAACCAAAAG AATCTCCATT CGCTTCACTT GCACAGGATA CAAAACCAAA AGAATCTCCA TTCGCTTCTT TTGCACAAGA TACAAAACAG GTGTCTCCAT TTTCTTCCCT TGCACAAGAC AAAAAACCAA GCGATTCTCC GTTTTCTTCT CTTGGACAGG AAAGAGAATC AAAGGAATCT CCATTCGCTT CTTTAAAGCA GACTACTAAA TTCGAATCAC CATTTGCGAA GTTAGAACCT TTTAAACGAG GTGAAAAACA ACTCACAGAA TTGGAATCCT TACGTAAACA CGATTCTGCG GAGAGTGAGT CAACCATTGG AAGTTATGTC AATGTTGAAG GATCCAATGA GCCACTGTTT GGAGATCTTG CGATAATTGA CAACTCAGAT AAAGCTGCCT TTGGCAGCAA TAAATCAGCG TTTGGAGGCT TCGGTGGTTC GTCAACAAAA TCTTTTGGCT CCGCCTTTGG TTCATCCTTG ACAACTACAC AACAAGCAGT ACAGGATTCT CTGGATTCAG ATGAGACATA CGACAGTGAG GAATCATCAC AAGAAGAAAA TCTGCAAGCT GTGGATGTAA GAGCTGAAGA AGTTCGTGAA CATTCATATA CTATAGCAGG AGGAAGAGGA AATTACATAA ATGATTCGAC TTCTTACGAT GAACAAATTC CTGACAAGCT GACGCTCAAC AAGGAAGCAA ACAAGCTCAA TGAATTCATG TTGTCCAGAG GTACCAGAGA AAAAATTGTT GAAGATTCGT CTGAAAGCTA CGAAGAAATG GCTCAACATC ATTCAGGCGG ATATATTAGC GACGAGAGTT ATGAAGATTT GGAAGAAGAA GACGAAATTG AAAAATTACT AGCAAATAGA CCACCGGCAA ATCCTGAACT TTTGAAGTTT GATGGCCTTG AAAAAGGTAT TAAAGCTACG AAGAACCCAA TTGAAGATAT GATATCAACC ATCTTCCAGA ATACTACTGG CCAATTAAAG ATATTGGAGA GAAACAGTGA CAAGATCATT GGTTTTATTG ATGAACACGA CTACGAGACC TCTTATTCTG ATGCTGCTCT CAAATACCCT GACTACTGGC ATTTGGCCTC CTCACACAAC ATTGGAATTC TTGCAAAAGA AGAGATTCAG GATATCACTG CAATCATAGA ACAAGCAGAA TTGCAAGAAA CAAAGTCTAA GAAGTTGGAA GATGAGGTGA AACTATTGCA ACAGAAGAGA ATCCAATTGG ACAAGCTTAT CAGCCACTTA TCTATAATCA GCAAGTCTGA AACTGATCCG TTGTTGAAGA GTAGGCCCTT GGATCTTGCC AACGAAGCAC TTCAAGTCAG CATTCGTAAG AAGTTGACTA GAGTCAAGCT GTTGGAAAGG GAGTTGATAT CCAAGATGAT GCCTTTAAAG GCTAGATGTT CCGTTAATGA AGGAATTGCT CTGAACCTTG AGAAGGTTAC TCTTAAACTC CACAGCAACG TTGCTGATCA AAGGGCCAGG ATCGACGTTT TGTTGAAAGA AGTAGAGGAG TTGTCAGTTA ATGAAAAGAA GGAGATTCCT CTCATAGAAG CATCGTATAA CACTGGTTCG ATCAAAGCCA TTGCGAAAAC GAGATTGAGT AATCGTTTGA AAGATTCTTC TAAAGTCACA AAAGTGAAGT TCTAATACAG AAATAATTCA ATTTTGTATT CTAGCAACGC GAGCCAGCGA CTATAACATA TAGTGCTCTT ATATATATAA TTCCGTATTA ATGTTTCAAG AGGAGCTCTC TAAATGGCCA AATATACCAT ATATATTC
|
Protein sequence | MDSIEEVNTD DFGFSLATDE KGFSVFGSTI DFEVHEEPLN LLAIGNKSGI IAISNVTTLC LDTLSAVDNP EAEEAKETLL RLVAGDDLEI KQVFFSPDEQ TLFVVNNNKL QKLSAKKFIA GEGSSLADYE IGVAIIKNVI PSPVHNEKFL VLDDSNTLFL WEKSSLETVG SNIASACWSR DGSSYSYIIN NTTTITNSAD STHIPIDVSQ DEEYESDNYH IKQIFDLKTK FIAIFELNDS ELDYHTTKAY YIEKTSSGYA TQKLELEPPS NTVSRHLTYY TTWISEWKRN ETLFFFSSGS STDVDSLVSD ASGFKILQLE DTNRALFPID DEADVDTSPV GMAVDLRAFE AVVKEPCSGV EEVKGKLPKV YTLLHTGKLI AWWVFDKHGV LDGTVDLQRA YSAFPAVERD ISSSISVIKE TPSSQSEDST TKGTFDGSIG FGSSSTRFGS STGANEVAGG AFGSTSGAFG NLLTSDKTPS KTAEEGPTLF GSVAFGSAAP TSQPAAFGSS PQASKPVASG IGATPAFGSV GFGAPSFGST SFVNANSSVI PTASLKSTFS NYANTPSGFG ASTAKASPFG NLTSGSGSSP FASIGSGDKE SPFGKLQESK SIFGQESNEG KPSSFGSTAN VESPFAKLGN LDINKDGFGE PKESPFASLA QDTKPKESPF ASFAQDTKQV SPFSSLAQDK KPSDSPFSSL GQERESKESP FASLKQTTKF ESPFAKLEPF KRGEKQLTEL ESLRKHDSAE SESTIGSYVN VEGSNEPLFG DLAIIDNSDK AAFGSNKSAF GGFGGSSTKS FGSAFGSSLT TTQQAVQDSL DSDETYDSEE SSQEENLQAV DVRAEEVREH SYTIAGGRGN YINDSTSYDE QIPDKLTLNK EANKLNEFML SRGTREKIVE DSSESYEEMA QHHSGGYISD ESYEDLEEED EIEKLLANRP PANPELLKFD GLEKGIKATK NPIEDMISTI FQNTTGQLKI LERNSDKIIG FIDEHDYETS YSDAALKYPD YWHLASSHNI GILAKEEIQD ITAIIEQAEL QETKSKKLED EVKLLQQKRI QLDKLISHLS IISKSETDPL LKSRPLDLAN EALQVSIRKK LTRVKLLERE LISKMMPLKA RCSVNEGIAL NLEKVTLKLH SNVADQRARI DVLLKEVEEL SVNEKKEIPL IEASYNTGSI KAIAKTRLSN RLKDSSKVTK VKF
|
| |