Gene PICST_31476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31476 
SymbolNUP84 
ID4838420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp898505 
End bp901039 
Gene Length2535 bp 
Protein Length844 aa 
Translation table12 
GC content42% 
IMG OID640389735 
ProductNucleoporin NUP84 (Nuclear pore protein NUP84) 
Protein accessionXP_001384477 
Protein GI150865315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.267975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGT CACGATGTCG TGATATTTTT CACTACTTAA TTTCGATAAA CAGAACTATT 
CTTCAAGAGT TACCTAAGCT ATCTCCTTTG CCAGCGAGAA AATTGGGCTC TTACACTTGC
AAAACAGTGC ACATGACGAC TACGGAGTTA TTGCCCTTTC ATGCCGTTGG CACGTCACCA
GATGCCGCAG CCCAGGCCGA CTCCAACATA GAGACGCAGT TTGCCAATGT GCTCCATTCA
CTACAAGTGA ACAAACAGAG GGATCCATTT GATATCATTC AGGATTTCAA GCAGATTTGC
GCTGACAGGG CACTTTCTAC TCGTGACAAA CTTTCACTTG ACGAAAGCAA TACCGCACTT
GCTGAAGAGT TTGACAACTG GGACTTGGAG TTCAAACTCT GGGAACTTGT TGATCGGTTG
TTCCGTTTCC GAGCTCTATT CAGCAATAAA GCCAAAACCG AATTGCTTCG TGAATACGAC
TTTTCGTCTA TGGGAATCAA GCAGGAGAAC TTTTTGCGGA AAAATCCAGC GATCAGAGAG
CTTTCCATCA TCATCCTCTG GCTCCAGTCG CATACTCCAT CTCTTGTAGG AGACGAAACC
GAATCTTACC AGACGAAATG GAAGAATACT GCCATGGCCG TTGCAAATTC CGACTTCGAT
GTCTTAGCTA GTCGAGCTAC TGATGCTGAT TTGATTGATA AGCTCGATAT AGATGCACCT
TTGCGTAGTA ATAGATCCAT TCATCCAGCA GATGAATCCA ACGACTCTAA AGTATTTGCT
CAGATCTACA AGTTGCTTCT TCAGGATAGG GTCCAGGATG CAATTGATAT TGCCAACAAT
ACGGGTAATT ATGCTTTGGC TTTGATTCTT GTCGGCGCTA CCCAGGAGTA CTTTGATCCT
GTTCTAGACA AGCAGGATCT GGACTTTGAT TCTATTGTAG CAGAACAGAC AAAACCTTCC
GGTATCAAGC ACAAGCTTTT ATGGAAGAAA ACAGTGTACA AACTCTCGCA ACAGGCGAAC
TTGAACAAGT ACGAGAAGTT GATCTATAAT TACTTGTGCG GCGGTGACAT TTCAGGCAAC
TTACTGGTGG CTTCTGATGA CTGGGAACAG ACTTTACTTT TGTATTTGAG TCAGCTCTAC
TCCTACAACA TTGACAATTT TATTGTATCG CAGTTGTCTT CTGAACAGGA GATTTTGCCT
GTCAACATTC CCAAGCCTCA GCTAAACACG ATTGCTGAAA TCTTGAATAC CATACTGCAC
TCAGGTAATT TGTTGACTCA GCAGAGTGAA AACCCGCTCA GAGTCATCAT GGGAAGCGTG
ATGTTAGATA ATGTCCCTTC TTTATTGCAT AATTTGACCA GTTCACTGAC TGGAGAGCCA
GAAGCTCTCA AGAAACCATA CATTTCCAGA GTGCTAACTC ATTTGGCTAT ATTCCAGCTT
CTAGTAGTAG GTACAGACAA CATCAACAAC GAGGATATCA CGACTATAAT TACCTCGTAC
ACGTCCAAAT TGGCAGAGTA CAAGTTGCCT GAACTTATTC CTATCTATCT ATCGTTTGTA
CCTAACGAAA AGGACTCTCG TGAAGCGTAT GCACTTTTCT TGTCCTCGTT GACAGACTCA
TCTGATCGTA GCAAACAGAT AGAAATCTCG CGCAGAATTG CAAATTCGAT CTCAGAAGTA
GACGAGGCTA TGGAGCTCGC TGCAAATACC CAGGAAGACA AGATGATGAA TGTTTTGCGT
AGAACTGTTG AAAGGGTGAT GAAAGATACT GAATCGCACT ATAAACCACA GGCAGTAATT
GAAGTTCAGG ATGATATCAA CTCTGTAGAT GATATTGATT CCAAGTTGTA CCACTCCGTT
GAGTGGTTCT ATGAGAATAA AATGTATGAG GATGCGATTG TCGCAACTAT TGCCATCATC
AGAAGATTTC TTCTTTGCGG AAAACTAGCT CCTTTAAAAG CATTTAGTAA GGGAAAGAAC
TTCAAGTTGT TATTAGTCGA ATACGATACT CAGCTTCAAA CAAAGAGTTT AATTTTTCAA
AATGAGCCAG AGATCGTCAC AGAAGAAGAC AAAGAGGAAT TGTTAGCGTA CGCATCTCTC
ATAGAAGGTT TGATCTTGAT AGACGAGTGG AAGAAGTTTG TAAGCACTCA AATCAATAGT
CAGGGTAAGT GGCTTTCTAC TGGTGTAGAT AGCTCGATAG ATAAAACAAG CAAGACTTTA
TTGAATTTGA TTTTCAAGTG GTTCAAGAAT ACGTCATCCG AGAAAGATAG CGACTTGGCA
ATCTACTCTG AGTTCAGAAG CATTTATGTT CCGTACTTGA TTATCGAGTT GTTGAAGATC
TTCCAAAATG CAAGAGAAAA GGACTGGAAA TACATGAGAA GCGCCTTCTC GTTGATTAAT
GATGTTGCCA ATGAAGAACA AAACGACTTC TTGAGCTGTT TCTTGAAATG TGGAAGACTC
GATGAGTTTT TGGTGAAGGC CGGAGAGGTC TCGATCGTTG CTGTAGAGAA AGGAATCTCT
GGGATATTCT ACTAG
 
Protein sequence
MSQSRCRDIF HYLISINRTI LQELPKLSPL PARKLGSYTC KTVHMTTTEL LPFHAVGTSP 
DAAAQADSNI ETQFANVLHS LQVNKQRDPF DIIQDFKQIC ADRALSTRDK LSLDESNTAL
AEEFDNWDLE FKLWELVDRL FRFRALFSNK AKTELLREYD FSSMGIKQEN FLRKNPAIRE
LSIIILWLQS HTPSLVGDET ESYQTKWKNT AMAVANSDFD VLASRATDAD LIDKLDIDAP
LRSNRSIHPA DESNDSKVFA QIYKLLLQDR VQDAIDIANN TGNYALALIL VGATQEYFDP
VLDKQDSDFD SIVAEQTKPS GIKHKLLWKK TVYKLSQQAN LNKYEKLIYN YLCGGDISGN
LSVASDDWEQ TLLLYLSQLY SYNIDNFIVS QLSSEQEILP VNIPKPQLNT IAEILNTISH
SGNLLTQQSE NPLRVIMGSV MLDNVPSLLH NLTSSSTGEP EALKKPYISR VLTHLAIFQL
LVVGTDNINN EDITTIITSY TSKLAEYKLP ELIPIYLSFV PNEKDSREAY ALFLSSLTDS
SDRSKQIEIS RRIANSISEV DEAMELAANT QEDKMMNVLR RTVERVMKDT ESHYKPQAVI
EVQDDINSVD DIDSKLYHSV EWFYENKMYE DAIVATIAII RRFLLCGKLA PLKAFSKGKN
FKLLLVEYDT QLQTKSLIFQ NEPEIVTEED KEELLAYASL IEGLILIDEW KKFVSTQINS
QGKWLSTGVD SSIDKTSKTL LNLIFKWFKN TSSEKDSDLA IYSEFRSIYV PYLIIELLKI
FQNAREKDWK YMRSAFSLIN DVANEEQNDF LSCFLKCGRL DEFLVKAGEV SIVAVEKGIS
GIFY