Gene PICST_88123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88123 
SymbolNSP1 
ID4837117 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp445669 
End bp448344 
Gene Length2676 bp 
Protein Length727 aa 
Translation table12 
GC content43% 
IMG OID640388432 
ProductNucleoporin NSP1 (Nuclear pore protein NSP1) (Nucleoskeletal-like protein) (p110) 
Protein accessionXP_001382320 
Protein GI150863747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.26666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.354188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TAAAAGAACA GCAACCTATT GTATTGCAGC CTTTGTCTGT CACACATACA AAGCACAGTT 
AGTGGTAGAT CGAAGAAAAC CTACTGATTC TAACGATAGT TCCTATACTT TGATTCAAAA
GTAGTTCACT AGAAATCTTA CATAGTATAA GAATCTGTTC AACAAGCACA ATTACATTTC
AATCGCTTGA TCGTTTTCTT CATCAATTCA TTGAAGAACA TCTATTATCA TTTACGGTTG
TAGCGGATTG CCAAAAAAAA ATATAATTTA GTTATTCATT ATTCATTGAA TTCACATTTA
TACTCCTAAG ATATGTTTGG TTCCAATAAC GACCCACAAA AACCACCTGC TTTCAGTTTC
GGAAACCCAT CCACTTCCAA CACCTCTGGT GAAGCCCTGA AACCACCTGC CTTCAGTTTT
GGAGCTCCTT CCACGTCTAA TACTTCTGGT GATGCTTCCA AACCTCCAGC CTTTAGTTTT
GGCTCAGGTG CCTCCAAGCC TCTGGGATTT GGTTCCCTTA ACAGTAGCGG CACTTCTACA
CCCGCATTTG GAACATCAGG TCCCTCTTTT GGAACTTCTG GCTCTGGTTC TACTAGTGCT
TTTGGAGCTT CTGCTAAGGA CAACACTGCT GCTCCTTCTG GTGGCTTGTT TGGATCTTCT
AACGCAGGCA CTTCCACCCC TTTTGGACAA GCTTCAAACG CTCCTCCAGC TAGTGGAAGT
CTCTTTGGAA CTAGCAACAG TAATACGTCT ACAACTCTAG CACCTTCGGG AGGTTTGTTT
GGTAAAAAAG ACGACAAACC TGCGACACCT GCTGCTGATT CTGGCGCTTT GTTCAGTTCG
GCATCGACAA CTACAGGAAA TACTGGTTTT AGTTTTGGGC AGAAACCAGA AGAAAAGAAG
ACCACGGGCT CTGTAACTCC GTTCTCCACT TCTTCTACAC CTCAATTGGG TGATGGATCC
TCTACTTTTG GTGCCAGTTC CAAACCTGCT GAAGGAGGCG AGAAGCCAGC ATTCAGCTTT
GGTGCACCTA AGACGGCTGC TTCAGACAAA CCAGCAGTGT CGTTTAGTAC AACCTTAAAC
AGCAGCGACT CTAATATTCC TAAACTTGCT TTAGGAACGT CAACTATAGA TGAGACTAAA
AAAGATTCGG AAGCTTCGAA GCCTGCCTTT TCTTTTGGTG GTGCAGCCAA ATCTACGACA
GCTCCTGGTG GTTTATTCGG TGGAAGTGCT CCTTCTTCAT CTTCTTCCGG TGGGGGATTT
TCATTTGGTG GTAATAAGGA CGAAGAGAAG AAAGACGACA AGCCTGCCTC TGGTGGATTA
TTTGCTAGTA GTACGTCTTC TGCAGCTCCC GCTGCCGCAG GTGGTTTTTC TTTTGGAACC
AAATCTGAAG AAAAGAAAGA TGATAAACCA GCTGCTGGTG GATTGTTTGG GGGAAGTAGT
TCTACACAAG GCGGGTTTTC ATTTGGCGCC AAGAAGGATG AAACCAAAAA AGATGACAAA
CCAGCCACTC TGGGATTTTC GTTTGGAGCA AAGGATAATG AAGGAAAGAG TGATAAGGCT
ACCTCTTCAG GATTCTCTTT TGGCAAGCAG AAGGACGACA ACAAAACCAG TGAAAAATCA
ACAACTGGAG GTTTCTCTTT TGGAACTAAG AAAGATGAAG AAAAGAAGGA TTCCGCTCCT
GCTGGACCTT CTGGAGGATT CAGTCTTGGT GCCAAAAAGT CAGATGAAAC TACTGATGCT
AAAAAGACTG AAGTTACAGG TAGCACCGCT ACTTCTACCA CAGCCACCAC TACAATTGAT
CCAAGTCTAA AGCCCACCAA AATCCAGCCT ATAGAAGTGT CGATTGACAA CAAGACTCTT
GACGAGCTTA TCATAAAATG GTCTAAACAG TTAGCTGGAA CGTCCAAGAT TTTTAACACA
TACACCGACA AGGTGAGAGA ATGGGACCAA CAATTAGTTG TGAGTGGAGA TGAGATCTTG
AGATTGAACC AGGATTCGAT TGAAGCTGAA GCATTGCAGA GTAAGATCGA TCAACACTTG
TTGTTTGTAG AAAGCCAGCA GAATGAGTTG GAAAAGGTGT TAGATAACTA TGAACAACAA
GCCGATATCT TGTTGAACAA CATAGAATTA AATTCCAGCA GCAATAGTAT CACCAATCAC
TCTGCCGATG CTGGTGGATC TGGATCTGGT GCTGATGATG GTGGCTCTTC AGCTGGCAAC
AATGGGGACA GTAGCACGCT AAGTGTCACT GATAAGTTAA GAGAGAAGGC ATACCATTCT
GCTGAATTGT TGGACGAGAG ACTCGACAAC TTGGGCGACA ACTTATCGAC ATTGATCAGC
GAGATCAATG CTGTCAGTGA AGTTTTTACC AAGAACTCCA TCAACGAGTT TGCTCTTCCA
GCACCTGCAA ATGGTTCTCA AAATGGAAGC GCCTCTGAGA GCAAGTCCGG TAACGGCGAT
GAAAATCCTA TCGAAGAGAT AGTCAAGTTG TTGAACTTGC ATTTGGACAA CTTGAAGTAC
ATCGAAGACA CCGAAGAGAA GTTGAAGACC AAGATCAACA AGATCAATAA TATTAAGAAG
ACACATTAGG AGAGAATATG CAATAGCATA GTAGATACAT AGTGTAATGT AGGTATATTA
TATTAGTAGA TATATACATA GAAGAAGAAT GTTCTC
 
Protein sequence
MFGSNNDPQK PPAFSFGNPS TSNTSGEASK PPAFSFGAPS TSNTSGDASK PPAFSFGSGA 
SKPSGFGSLN SSGTSTPAFG TSGPSFGTSG SGSTSAFGAS AKDNTAAPSG GLFGSSNAGT
STPFGQASNA PPASGSLFGT SNSNTSTTLA PSGGLFGKKD DKPATPAADS GALFSSASTT
TGNTGFSFGQ KPEEKKTTGS VTPFSTSSTP QLGDGSSTFG ASSKPAEGGE KPAFSFGAPK
TAASDKPAVS FSTTLNSSDS NIPKLALGTS TIDETKKDSE ASKPAFSFGG AAKSTTAPGG
LFGGKKKDDK PASGGLFASS TSSAAPAAAG GFSFGTKSEE KKDDKPAAGG FSFGAKKDET
KKDDKPATSG FSFGAKDNEG KSDKATSSGF SFGKQKDDNK TSEKSTTGGF SFGTKKDEEK
KDSAPAGPSG GFSLGAKKSD ETTDAKKTEV TGSTATSTTA TTTIDPSLKP TKIQPIEVSI
DNKTLDELII KWSKQLAGTS KIFNTYTDKV REWDQQLVVS GDEILRLNQD SIEAEALQSK
IDQHLLFVES QQNELEKVLD NYEQQADILL NNIELNSSSN SITNHSADAG GSGSGADDGG
SSAGNNGDSS TLSVTDKLRE KAYHSAELLD ERLDNLGDNL STLISEINAV SEVFTKNSIN
EFALPAPANG SQNGSASESK SGNGDENPIE EIVKLLNLHL DNLKYIEDTE EKLKTKINKI
NNIKKTH