Gene PICST_80670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_80670 
SymbolNUP159 
ID4850797 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp75713 
End bp79820 
Gene Length4108 bp 
Protein Length1203 aa 
Translation table 
GC content40% 
IMG OID640392505 
Productnuclear pore protein 
Protein accessionXP_001387254 
Protein GI126273541 
COG category[N] Cell motility 
COG ID[COG5651] PPE-repeat proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.381762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTTGAAATCT GGAGATATCA GGGTGAAGAA TTTATGGTGT TGTATATAAG AGAATTTTCA 
AAAAGTACAC TTTTGCAATG AGAACCACTA GAAACCAGAG TCAAATAAGA AGTTATCCTA
TAGTGTTGTA AAATAGAAAA TTTATCCGAC TTTAAGTCAC AGTCCGTGGA ATTAAGCCGA
TGCCAAGTTC ATTCAATTAT TCACTCGTTC GCATACTTAA TTATTCATAA GGAGCTTTAC
GTAACTGATT ACGTAATCTA ATACTCAAAA CAGAGAATGG TAATTGACAA TATATGATCT
CTATTCTCTT TCTTTGTTAG GTCGTCCATC AAAATCGCCA TGGATTCAAT TGAGGAAGTC
AATACAGACG ATTTTGGGTT CAGTCTTGCT ACGGATGAGA AGGGATTCTC GGTGTTCGGC
TCCACTATAG ACTTTGAGGT TCACGAAGAG CCTTTGAATT TGCTTGCAAT AGGAAACAAA
TCAGGAATCA TTGCCATTTC CAACGTTACC ACCTTGTGTT TAGACACTCT AAGTGCAGTA
GACAAGTTAA TTGAAGAGAG CCACAATAGT CCGGAAGCTG AGGAAGCCAA AGAGACTCTT
TTGCGACTGG TAGCTGGAGA TGACCTAGAG ATCAAACAAG TTTTTTTCAG TCCTGATGAA
CAAACTCTAT TTGTTGTGAA TAACAACAAG TTGCAAAAGT TAAGCGCAAA GAAATTTATT
GCTGGCGAGG GTTCCAGTTT GGCTGATTAC GAAATTGGGG TAGCAATTAT CAAAAATGTG
ATCCCTTCAC CTGTTCACAA TGAGAAGTTC CTCGTTTTGG ACGATTCCAA TACGCTTTTC
TTATGGGAGA AAAGCAGCTT GGAAACTGTA GGCTCAAACA TTGCTTCAGC GTGCTGGTCT
AGAGACGGAA GTTCGTATTC GTACATTATA AACAATACAA CGACTATCAC AAATAGTGCA
GATTCGACGC ACATACCAAT AGACGTGTCT CAGGACGAAG AATACGAGTC GGACAATTAT
CATATTAAGC AGATCTTCGA CCTTAAAACC AAATTTATAG CTATCTTTGA GCTGAATGAC
TCGGAATTGG ACTACCATAC TACCAAGGCA TACTATATAG AAAAGACAAG TTCTGGCTAT
GCTACTCAGA AACTAGAGTT AGAACCCCCG TCGAATACCG TTTCTCGTCA CTTGACTTAC
TACACGACCT GGATATCCGA ATGGAAGAGA AACGAGACTC TTTTCTTCTT TTCTTCAGGC
TCATCCACAG ATGTTGATTC ACTAGTTTCA GACGCTTCTG GATTCAAAAT CTTGCAATTA
GAGGATACAA ACAGAGCGCT GTTCCCCATT GATGATGAGG CAGATGTAGA CACTTCACCT
GTAGGTATGG CCGTTGATTT GCGAGCCTTT GAAGCTGTCG TCAAGGAACC TTGTTCAGGT
GTAGAAGAAG TCAAAGGAAA GTTGCCTAAG GTATATACAT TATTGCACAC GGGAAAGTTA
ATTGCCTGGT GGGTGTTTGA CAAACACGGC GTATTAGATG GGACCGTTGA CTTGCAGAGA
GCATATTCTG CTTTTCCTGC TGTTGAAAGG GACATCTCTT CATCTATTTC TGTAATTAAG
GAGACACCAT CCTCTCAATC AGAAGATTCT ACAACTAAAG GCACCTTTGA TGGTTCCATT
GGATTTGGAA GTAGTTCTAC TAGATTTGGC AGTAGCACTG GAGCAAATGA AGTTGCTGGT
GGTGCTTTTG GTAGTACCTC TGGAGCATTT GGAAACTTGC TGACATCTGA TAAAACTCCT
AGTAAAACGG CAGAAGAAGG ACCAACACTG TTTGGAAGTG TTGCCTTTGG ATCCGCAGCT
CCAACTTCAC AACCAGCTGC CTTTGGATCA TCACCGCAGG CTTCGAAACC AGTTGCTTCT
GGTATTGGGG CTACACCTGC ATTTGGATCT GTAGGCTTTG GAGCACCTTC TTTTGGATCT
ACCAGTTTTG TTAATGCAAA TTCCAGTGTT ATTCCAACAG CTTCTCTGAA ATCAACCTTT
TCTAACTACG CCAATACTCC CTCTGGGTTT GGAGCATCCA CTGCGAAAGC ATCACCATTT
GGAAACCTTA CTTCAGGTTC AGGAAGTTCT CCATTTGCCA GTATAGGTAG TGGTGACAAG
GAGTCACCAT TTGGCAAACT CCAAGAATCA AAATCAATAT TCGGACAGGA GAGTAACGAA
GGTAAGCCTT CTAGTTTTGG CAGCACCGCG AATGTGGAAT CCCCATTTGC TAAGCTTGGT
AATTTGGACA TAAACAAAGA TGGTTTTGGT GAACCAAAAG AATCTCCATT CGCTTCACTT
GCACAGGATA CAAAACCAAA AGAATCTCCA TTCGCTTCTT TTGCACAAGA TACAAAACAG
GTGTCTCCAT TTTCTTCCCT TGCACAAGAC AAAAAACCAA GCGATTCTCC GTTTTCTTCT
CTTGGACAGG AAAGAGAATC AAAGGAATCT CCATTCGCTT CTTTAAAGCA GACTACTAAA
TTCGAATCAC CATTTGCGAA GTTAGAACCT TTTAAACGAG GTGAAAAACA ACTCACAGAA
TTGGAATCCT TACGTAAACA CGATTCTGCG GAGAGTGAGT CAACCATTGG AAGTTATGTC
AATGTTGAAG GATCCAATGA GCCACTGTTT GGAGATCTTG CGATAATTGA CAACTCAGAT
AAAGCTGCCT TTGGCAGCAA TAAATCAGCG TTTGGAGGCT TCGGTGGTTC GTCAACAAAA
TCTTTTGGCT CCGCCTTTGG TTCATCCTTG ACAACTACAC AACAAGCAGT ACAGGATTCT
CTGGATTCAG ATGAGACATA CGACAGTGAG GAATCATCAC AAGAAGAAAA TCTGCAAGCT
GTGGATGTAA GAGCTGAAGA AGTTCGTGAA CATTCATATA CTATAGCAGG AGGAAGAGGA
AATTACATAA ATGATTCGAC TTCTTACGAT GAACAAATTC CTGACAAGCT GACGCTCAAC
AAGGAAGCAA ACAAGCTCAA TGAATTCATG TTGTCCAGAG GTACCAGAGA AAAAATTGTT
GAAGATTCGT CTGAAAGCTA CGAAGAAATG GCTCAACATC ATTCAGGCGG ATATATTAGC
GACGAGAGTT ATGAAGATTT GGAAGAAGAA GACGAAATTG AAAAATTACT AGCAAATAGA
CCACCGGCAA ATCCTGAACT TTTGAAGTTT GATGGCCTTG AAAAAGGTAT TAAAGCTACG
AAGAACCCAA TTGAAGATAT GATATCAACC ATCTTCCAGA ATACTACTGG CCAATTAAAG
ATATTGGAGA GAAACAGTGA CAAGATCATT GGTTTTATTG ATGAACACGA CTACGAGACC
TCTTATTCTG ATGCTGCTCT CAAATACCCT GACTACTGGC ATTTGGCCTC CTCACACAAC
ATTGGAATTC TTGCAAAAGA AGAGATTCAG GATATCACTG CAATCATAGA ACAAGCAGAA
TTGCAAGAAA CAAAGTCTAA GAAGTTGGAA GATGAGGTGA AACTATTGCA ACAGAAGAGA
ATCCAATTGG ACAAGCTTAT CAGCCACTTA TCTATAATCA GCAAGTCTGA AACTGATCCG
TTGTTGAAGA GTAGGCCCTT GGATCTTGCC AACGAAGCAC TTCAAGTCAG CATTCGTAAG
AAGTTGACTA GAGTCAAGCT GTTGGAAAGG GAGTTGATAT CCAAGATGAT GCCTTTAAAG
GCTAGATGTT CCGTTAATGA AGGAATTGCT CTGAACCTTG AGAAGGTTAC TCTTAAACTC
CACAGCAACG TTGCTGATCA AAGGGCCAGG ATCGACGTTT TGTTGAAAGA AGTAGAGGAG
TTGTCAGTTA ATGAAAAGAA GGAGATTCCT CTCATAGAAG CATCGTATAA CACTGGTTCG
ATCAAAGCCA TTGCGAAAAC GAGATTGAGT AATCGTTTGA AAGATTCTTC TAAAGTCACA
AAAGTGAAGT TCTAATACAG AAATAATTCA ATTTTGTATT CTAGCAACGC GAGCCAGCGA
CTATAACATA TAGTGCTCTT ATATATATAA TTCCGTATTA ATGTTTCAAG AGGAGCTCTC
TAAATGGCCA AATATACCAT ATATATTC
 
Protein sequence
MDSIEEVNTD DFGFSLATDE KGFSVFGSTI DFEVHEEPLN LLAIGNKSGI IAISNVTTLC 
LDTLSAVDNP EAEEAKETLL RLVAGDDLEI KQVFFSPDEQ TLFVVNNNKL QKLSAKKFIA
GEGSSLADYE IGVAIIKNVI PSPVHNEKFL VLDDSNTLFL WEKSSLETVG SNIASACWSR
DGSSYSYIIN NTTTITNSAD STHIPIDVSQ DEEYESDNYH IKQIFDLKTK FIAIFELNDS
ELDYHTTKAY YIEKTSSGYA TQKLELEPPS NTVSRHLTYY TTWISEWKRN ETLFFFSSGS
STDVDSLVSD ASGFKILQLE DTNRALFPID DEADVDTSPV GMAVDLRAFE AVVKEPCSGV
EEVKGKLPKV YTLLHTGKLI AWWVFDKHGV LDGTVDLQRA YSAFPAVERD ISSSISVIKE
TPSSQSEDST TKGTFDGSIG FGSSSTRFGS STGANEVAGG AFGSTSGAFG NLLTSDKTPS
KTAEEGPTLF GSVAFGSAAP TSQPAAFGSS PQASKPVASG IGATPAFGSV GFGAPSFGST
SFVNANSSVI PTASLKSTFS NYANTPSGFG ASTAKASPFG NLTSGSGSSP FASIGSGDKE
SPFGKLQESK SIFGQESNEG KPSSFGSTAN VESPFAKLGN LDINKDGFGE PKESPFASLA
QDTKPKESPF ASFAQDTKQV SPFSSLAQDK KPSDSPFSSL GQERESKESP FASLKQTTKF
ESPFAKLEPF KRGEKQLTEL ESLRKHDSAE SESTIGSYVN VEGSNEPLFG DLAIIDNSDK
AAFGSNKSAF GGFGGSSTKS FGSAFGSSLT TTQQAVQDSL DSDETYDSEE SSQEENLQAV
DVRAEEVREH SYTIAGGRGN YINDSTSYDE QIPDKLTLNK EANKLNEFML SRGTREKIVE
DSSESYEEMA QHHSGGYISD ESYEDLEEED EIEKLLANRP PANPELLKFD GLEKGIKATK
NPIEDMISTI FQNTTGQLKI LERNSDKIIG FIDEHDYETS YSDAALKYPD YWHLASSHNI
GILAKEEIQD ITAIIEQAEL QETKSKKLED EVKLLQQKRI QLDKLISHLS IISKSETDPL
LKSRPLDLAN EALQVSIRKK LTRVKLLERE LISKMMPLKA RCSVNEGIAL NLEKVTLKLH
SNVADQRARI DVLLKEVEEL SVNEKKEIPL IEASYNTGSI KAIAKTRLSN RLKDSSKVTK
VKF