Gene PICST_66703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66703 
SymbolNUP49 
ID4851921 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3189175 
End bp3190637 
Gene Length1463 bp 
Protein Length402 aa 
Translation table 
GC content44% 
IMG OID640393629 
ProductNucleoporin NUP49/NSP49 (Nuclear pore protein NUP49/NSP49) 
Protein accessionXP_001386936 
Protein GI126276026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTTGGTTTA GGAAAAGAAC TCACAAGGAT AGAATATCAT CATATGTTTG GAACTGCCAA 
TAATTCACAG GCTCCAACGT CCGGTTTCGG TTTTGGTGGT GCCAATTCTA CCGGTTCAGG
CTTCGGAGCC AAACCCAGTG GAGGTCTTTT TGGCGCCAAT CAAACAACAA ATACCGGCCC
AGGCACTTTT GGCAGTGGAA ATGCCTTTGG AAACAATGCC AATAACCAAC AGACAGCAGG
ATCTGGTGGC TTGTTTGGCG CTTCTGGCCA GAATCAACAG CAGCCGACAC AGAACCAGAA
CCAACAAGGA GGAGGATTGT TTGGAAGCAA TAGCAATACG GCCGGAACTA GTAGTGGAGG
TCTTTTCGGA AGCAAACCTG CCGCTGGTGG ATTATTTGGT GGTTCCACTG GGGCTGCTAC
GACAGGTCTT TTTGGAGGTC AAAACCAAAC TCAAAATCCA CAAAACCAGC AAAACCAACA
GAATACGGGA CTTTTTGGTA GTAAACCTGC TGTTGGCGGT GGATTATTTG GAGCAAGTAC
AAGTGGACAG ACTCCTGCTG CGACCGGCGG CTTATTTGGT GGAAATACGG CTAACACGGC
CTCTTCTACC ATGGGTGGAG GGTTATTTGG AGGATCTGCA GTAGGAAATA CACAGCAGAA
CAAACCACTT TTTGGAGGAT TGGGCGCTTC TGGTAGCTCT GGAACAACTG GAGGTTTGTT
CGGAGGTTCG ACTGCAAATC CTGGTGGCTT GTTCTCTCAA CAGAATCAAA ATCAACAGAA
TCAATTTCAA CAACAACAAC AGAATCAGCA ACAAAATCAA CAGCAGTTGA CTGCAATGAC
GAGAGTGGGC GATTTGCCTC CGGCTATAAG GCAAGAGCTC GAGGAGTTTG ATCGTTACAT
TAACAAGCAA CATCTTGTAG CGACTACTTT ACAAGCTGAC TATGGCAAAC ACGACCAGCT
CATCAATACT ATTCCCAAGG ATATCAATTA TCTTCATAAC AAGCTTATGT CGACAAAACA
GGCGCTTAAA TTCGACTCTG GACAACTAGT TCATCTCAAG GAGCTCAATA ACGAAATCAC
AGACGACATC TCAAAGATAA TGCAACTCAT ATTACAGTTA TCTACACCTG GAACACGTCT
TTCTTCTTCT TTCCAGTTAA ATGAATTCTT TGTCAAGAAG ATCAAGAAGT ACTACGAGAT
TTTGCGTCAG TACGAGGGAG TCGTCGCTGA ACTAGATTCA ATTCTCGGTG GCTTGGAAAG
ACTGTGTACG GAAGGTTTTG GTAACTTGTT TAATATAGTA GAGGTTATCA AGTCGCAGTA
CCATTTGTTC ATGGAGTTGT GTGAAACGAT GGCTCAACTT CATAATGAGG TGAACAAGTT
GTCGAAGTAG GATCACGATG TATTATAATA AAGCAGCATA GATAGCAGTA GACTATCATA
AAAAGTGAGA TATTCTGTTA TGG
 
Protein sequence
MFGTANNSQA PTSGFGFGGA NSTGSGFGAK PSGGLFGANQ TTNTGPGTFG SGNAFGNNAN 
NQQTAGSGGL FGASGQNQQQ PTQNQNQQGG GLFGSNSNTA GTSSGGLFGS KPAAGGLFGG
STGAATTGLF GGQNQTQNPQ NQQNQQNTGL FGSKPAVGGG LFGASTSGQT PAATGGLFGG
NTANTASSTM GGGLFGGSAN QFQQQQQNQQ QNQQQLTAMT RVGDLPPAIR QELEEFDRYI
NKQHLVATTL QADYGKHDQL INTIPKDINY LHNKLMSTKQ ALKFDSGQLV HLKELNNEIT
DDISKIMQLI LQLSTPGTRL SSSFQLNEFF VKKIKKYYEI LRQYEGVVAE LDSILGGLER
LCTEGFGNLF NIVEVIKSQY HLFMELCETM AQLHNEVNKL SK