Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88123 |
Symbol | NSP1 |
ID | 4837117 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 445669 |
End bp | 448344 |
Gene Length | 2676 bp |
Protein Length | 727 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388432 |
Product | Nucleoporin NSP1 (Nuclear pore protein NSP1) (Nucleoskeletal-like protein) (p110) |
Protein accession | XP_001382320 |
Protein GI | 150863747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.26666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.354188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TAAAAGAACA GCAACCTATT GTATTGCAGC CTTTGTCTGT CACACATACA AAGCACAGTT AGTGGTAGAT CGAAGAAAAC CTACTGATTC TAACGATAGT TCCTATACTT TGATTCAAAA GTAGTTCACT AGAAATCTTA CATAGTATAA GAATCTGTTC AACAAGCACA ATTACATTTC AATCGCTTGA TCGTTTTCTT CATCAATTCA TTGAAGAACA TCTATTATCA TTTACGGTTG TAGCGGATTG CCAAAAAAAA ATATAATTTA GTTATTCATT ATTCATTGAA TTCACATTTA TACTCCTAAG ATATGTTTGG TTCCAATAAC GACCCACAAA AACCACCTGC TTTCAGTTTC GGAAACCCAT CCACTTCCAA CACCTCTGGT GAAGCCCTGA AACCACCTGC CTTCAGTTTT GGAGCTCCTT CCACGTCTAA TACTTCTGGT GATGCTTCCA AACCTCCAGC CTTTAGTTTT GGCTCAGGTG CCTCCAAGCC TCTGGGATTT GGTTCCCTTA ACAGTAGCGG CACTTCTACA CCCGCATTTG GAACATCAGG TCCCTCTTTT GGAACTTCTG GCTCTGGTTC TACTAGTGCT TTTGGAGCTT CTGCTAAGGA CAACACTGCT GCTCCTTCTG GTGGCTTGTT TGGATCTTCT AACGCAGGCA CTTCCACCCC TTTTGGACAA GCTTCAAACG CTCCTCCAGC TAGTGGAAGT CTCTTTGGAA CTAGCAACAG TAATACGTCT ACAACTCTAG CACCTTCGGG AGGTTTGTTT GGTAAAAAAG ACGACAAACC TGCGACACCT GCTGCTGATT CTGGCGCTTT GTTCAGTTCG GCATCGACAA CTACAGGAAA TACTGGTTTT AGTTTTGGGC AGAAACCAGA AGAAAAGAAG ACCACGGGCT CTGTAACTCC GTTCTCCACT TCTTCTACAC CTCAATTGGG TGATGGATCC TCTACTTTTG GTGCCAGTTC CAAACCTGCT GAAGGAGGCG AGAAGCCAGC ATTCAGCTTT GGTGCACCTA AGACGGCTGC TTCAGACAAA CCAGCAGTGT CGTTTAGTAC AACCTTAAAC AGCAGCGACT CTAATATTCC TAAACTTGCT TTAGGAACGT CAACTATAGA TGAGACTAAA AAAGATTCGG AAGCTTCGAA GCCTGCCTTT TCTTTTGGTG GTGCAGCCAA ATCTACGACA GCTCCTGGTG GTTTATTCGG TGGAAGTGCT CCTTCTTCAT CTTCTTCCGG TGGGGGATTT TCATTTGGTG GTAATAAGGA CGAAGAGAAG AAAGACGACA AGCCTGCCTC TGGTGGATTA TTTGCTAGTA GTACGTCTTC TGCAGCTCCC GCTGCCGCAG GTGGTTTTTC TTTTGGAACC AAATCTGAAG AAAAGAAAGA TGATAAACCA GCTGCTGGTG GATTGTTTGG GGGAAGTAGT TCTACACAAG GCGGGTTTTC ATTTGGCGCC AAGAAGGATG AAACCAAAAA AGATGACAAA CCAGCCACTC TGGGATTTTC GTTTGGAGCA AAGGATAATG AAGGAAAGAG TGATAAGGCT ACCTCTTCAG GATTCTCTTT TGGCAAGCAG AAGGACGACA ACAAAACCAG TGAAAAATCA ACAACTGGAG GTTTCTCTTT TGGAACTAAG AAAGATGAAG AAAAGAAGGA TTCCGCTCCT GCTGGACCTT CTGGAGGATT CAGTCTTGGT GCCAAAAAGT CAGATGAAAC TACTGATGCT AAAAAGACTG AAGTTACAGG TAGCACCGCT ACTTCTACCA CAGCCACCAC TACAATTGAT CCAAGTCTAA AGCCCACCAA AATCCAGCCT ATAGAAGTGT CGATTGACAA CAAGACTCTT GACGAGCTTA TCATAAAATG GTCTAAACAG TTAGCTGGAA CGTCCAAGAT TTTTAACACA TACACCGACA AGGTGAGAGA ATGGGACCAA CAATTAGTTG TGAGTGGAGA TGAGATCTTG AGATTGAACC AGGATTCGAT TGAAGCTGAA GCATTGCAGA GTAAGATCGA TCAACACTTG TTGTTTGTAG AAAGCCAGCA GAATGAGTTG GAAAAGGTGT TAGATAACTA TGAACAACAA GCCGATATCT TGTTGAACAA CATAGAATTA AATTCCAGCA GCAATAGTAT CACCAATCAC TCTGCCGATG CTGGTGGATC TGGATCTGGT GCTGATGATG GTGGCTCTTC AGCTGGCAAC AATGGGGACA GTAGCACGCT AAGTGTCACT GATAAGTTAA GAGAGAAGGC ATACCATTCT GCTGAATTGT TGGACGAGAG ACTCGACAAC TTGGGCGACA ACTTATCGAC ATTGATCAGC GAGATCAATG CTGTCAGTGA AGTTTTTACC AAGAACTCCA TCAACGAGTT TGCTCTTCCA GCACCTGCAA ATGGTTCTCA AAATGGAAGC GCCTCTGAGA GCAAGTCCGG TAACGGCGAT GAAAATCCTA TCGAAGAGAT AGTCAAGTTG TTGAACTTGC ATTTGGACAA CTTGAAGTAC ATCGAAGACA CCGAAGAGAA GTTGAAGACC AAGATCAACA AGATCAATAA TATTAAGAAG ACACATTAGG AGAGAATATG CAATAGCATA GTAGATACAT AGTGTAATGT AGGTATATTA TATTAGTAGA TATATACATA GAAGAAGAAT GTTCTC
|
Protein sequence | MFGSNNDPQK PPAFSFGNPS TSNTSGEASK PPAFSFGAPS TSNTSGDASK PPAFSFGSGA SKPSGFGSLN SSGTSTPAFG TSGPSFGTSG SGSTSAFGAS AKDNTAAPSG GLFGSSNAGT STPFGQASNA PPASGSLFGT SNSNTSTTLA PSGGLFGKKD DKPATPAADS GALFSSASTT TGNTGFSFGQ KPEEKKTTGS VTPFSTSSTP QLGDGSSTFG ASSKPAEGGE KPAFSFGAPK TAASDKPAVS FSTTLNSSDS NIPKLALGTS TIDETKKDSE ASKPAFSFGG AAKSTTAPGG LFGGKKKDDK PASGGLFASS TSSAAPAAAG GFSFGTKSEE KKDDKPAAGG FSFGAKKDET KKDDKPATSG FSFGAKDNEG KSDKATSSGF SFGKQKDDNK TSEKSTTGGF SFGTKKDEEK KDSAPAGPSG GFSLGAKKSD ETTDAKKTEV TGSTATSTTA TTTIDPSLKP TKIQPIEVSI DNKTLDELII KWSKQLAGTS KIFNTYTDKV REWDQQLVVS GDEILRLNQD SIEAEALQSK IDQHLLFVES QQNELEKVLD NYEQQADILL NNIELNSSSN SITNHSADAG GSGSGADDGG SSAGNNGDSS TLSVTDKLRE KAYHSAELLD ERLDNLGDNL STLISEINAV SEVFTKNSIN EFALPAPANG SQNGSASESK SGNGDENPIE EIVKLLNLHL DNLKYIEDTE EKLKTKINKI NNIKKTH
|
| |