Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66436 |
Symbol | NIP1 |
ID | 4851313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1473225 |
End bp | 1475960 |
Gene Length | 2736 bp |
Protein Length | 839 aa |
Translation table | |
GC content | 45% |
IMG OID | 640393021 |
Product | Protein required for nuclear import with some similarity to Nsr1p, another protein involved in nuclear transport |
Protein accession | XP_001387924 |
Protein GI | 126274335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACATCTAC ACAACGTGAA ACAGCACAAC AGTGATATAT ACAAGATGTC TCGTTTCTTT GTTGCCGGCT ACAACTCGGA CTCGTCGTCT GAGGAAGAGG ACTTGTTGAG TTCTGATGAA GAATTGCTTT CTTCCTCCTC TGAAGGAGAA CAGGAAACGT CAGACGATGA CAGCTTAGAT TTCGATGACC AGTCTGATTC TGACTCAAGT GACTCTGACT CCGATGGCCG TCCAAGTGGT CCAGCATATT TCTTGAAGAA GGACTTCATG AAAGGTGGTG CTGGCGGAGA CTCTGACTCT GACTCTGAGG ATGAAGGCAG AAGAGTAGTG AAGTCTGCCA AGGACAAGTT GCTTGACGAC ATGAACGAAT CCATTGAAGC CATTAACGTA GCCAGAAGAT CCGACACTTG GACCACCGTT GTATCGGAAT TTGACAAGTT GGGACGTCTC TTGGTTAGAG CTGGTCAGCA AAGCGTGTCT ACGCCTAATG CCTACATCAG ATGTTTGGCC GATTTGGAAG ACTACATCAC TGCTACTAGT GAAAACGAGA AAACCGAAAA GTCTTTAAAC GCAGCCGAAG CCAGAGCATT TAACATGGCT AAACAGAGAG TCAGAAAGCA AATCAAAGAG TACCAGGCCC AATATGACCT CTACAGAGAG AACCCAGAAT TGTTTGAAAG GGAAGAATCC GTAGACATCG CTGCTCCTTC CAGATCTGAT GTTCCAGTAG AAGATACTAC GGGCAGAGTC TTGAGTCCTG TGTTCACCAT CTTGAAACAG ATCGCTGAGA CTCGTGGTAA GAAGAACATC GACAAGTACG AACAGATCAA AACCTTGGAA GACTTGTTGA ACGACAATTT GGCAAAGGGC TCTGTTTTCG AGTTGATTTC CATCTATCAG ATGTTGTTGT CGATCCGTTT CGACGCTTCG GCCAACCAGA ACTTCATGCC TATTGAACAA TGGAAGAACA ACGAGGCAGA CCTCACCTCT TTGATCGGAC TTTTGGAATC CAACAAGGAC ACCTACCAGT TGTCTGAGCT TGGTTCTACT ACTGATGATA TCGACATCGA ACCTGTAGCC AATGAATCTG GTGTCAAGGC TATCTTTGGC TCCATAACTT CGTTGATCGA CAGATTGGAC GATGAATTCA CCAGATCCTT GCAGAACACC GATCCTCACT CGATTGAGTA CGTGCAAAGA TTAAAGGATG AAACCACCAT CTACCAGTTG ATCGTGAGGG GCCAGTCTTA TATTGAGTCT ATAACGCCAG CTGAAGTCCA GCAGTCGGTT GAGCAGTTGT CGAGAGTCGT TCTCCGTAGA TTGGAACATA TCTACTACAA GCCTGACCAG TTGATCAAAG CTAACGAGGC TGAAGCCTGG AAGGGTATTT CTCACGAGTC TGTCATTGTA TCCAAGGACT CCACTCCTGC TGAGTTGATT GAAGGATTGT CGTCGTTCTT AACCAAGCAC AAGAATCCAG TCTACGCCAA ACACGCTTTG TTGTTCTCGG TCTACTACTA CGCCGTCAAC AACAACTACA ACAGAGCCAA GGAGTTGTTC TTGGACTCGC AAATCTTCAA CAAAATCCAC CATGCTGACT CCAGCTTGCA AGTCCAGTAC AACAGAGCCA TTGTTCAGTT GGGTTTGAGT GCATTCAGAA ACGGAGCTGT CGAAGAATCT CACAAGGTAT TGAACGAAAT CGTCAACTCG CAAAGATCAA AGGAATTGTT GGGCCAAGGG TTCAACTCTA AATATCCCAA CCAAGCTACA ACCGTGGAAA AGGCTAAGTT GTTGCCTTTC CACCAACACA TAAATTTGGA ATTGCTTGAG TGTGTTTACC TGACCTGCTC GTTATTGATT GAAATTCCAG CCTTGGCTGC AGCTACAAAC TCCAAGGACT CCAGACGTAA GGCAACCACC AAGTCATTCA AGAGTAAGTT AGAATTCCAC GACAGACAGT TCTTCACTGG TCCTCCAGAG AGTATCAAAG ATCACATAGT GCATGCCTCC ATAGCTTTAC TGAAGGGTGA TTGGGCTAAG GCTTACCAGT TGTTGTCGTC TATCAAGATC TGGAAGTTGT TCCCTGATAA CGATGACTTG TTGGCCATGA TGAAGAACCA GTTACAAGTC GAAGGTTTAA GAACATACAT TTTCTCCTAC AAGTCGATTT TCTCTAAGTT ATCTTTAGGC AAGTTGTCGC AGATCTTCGA GCTTGAAGCT GACAAAGTTG AGTCCATTGT TCAAAAGATG ATAGAGACAA ATGAGATTGG CGGAACATTA GACGAATCCA AGGCATTTAT CCAATTCGCC AGTACCGAGC CACAAAGATC CAGATTGCAA GAGTTAGCTA TTGTTATGAA CGAAAAGGTT GGTTTGTTGA CCGAAAAGAA TGAAAAGACT TCGTCCAACG GTTACGGTAA GAAGCAGCCT CAACAACAGC AACAACAGCA ACAACAGCAA CAGCAACAGC AGCAACAGCA GAAGGATTTG CTCCAAGAGG ACAACAGCAG ATTCAGATAC GCCAACGTTA ACACTAACAA CGATGAATTC CAAACTACTG CCTAAGCTGA GCTGGCTGTT CCAGTCACGT TTTGTACAAT AATCTCTTTA GCATGGATCT CTATTTGCAT CTAGACCAGA TAGACTCTGT CTTCCTGAAC TTATCAAGGC TGATTTGTTC CTGAGTCCCT GACTCGATAT ATATATTAGT TCTTTTTAAT ATAACGTTCT ACGTGA
|
Protein sequence | MSRFFVAGYN SDSSSEEEDL LSSDEELLSS SSEGEQETSD DDSLDFDDQS DSDSSDSDSD GRPSGPAYFL KKDFMKGGAG GDSDSDSEDE GRRVVKSAKD KLLDDMNESI EAINVARRSD TWTTVVSEFD KLGRLLVRAG QQSVSTPNAY IRCLADLEDY ITATSENEKT EKSLNAAEAR AFNMAKQRVR KQIKEYQAQY DLYRENPELF EREESVDIAA PSRSDVPVED TTGRVLSPVF TILKQIAETR GKKNIDKYEQ IKTLEDLLND NLAKGSVFEL ISIYQMLLSI RFDASANQNF MPIEQWKNNE ADLTSLIGLL ESNKDTYQLS ELGSTTDDID IEPVANESGV KAIFGSITSL IDRLDDEFTR SLQNTDPHSI EYVQRLKDET TIYQLIVRGQ SYIESITPAE VQQSVEQLSR VVLRRLEHIY YKPDQLIKAN EAEAWKGISH ESVIVSKDST PAELIEGLSS FLTKHKNPVY AKHALLFSVY YYAVNNNYNR AKELFLDSQI FNKIHHADSS LQVQYNRAIV QLGLSAFRNG AVEESHKVLN EIVNSQRSKE LLGQGFNSKY PNQATTVEKA KLLPFHQHIN LELLECVYLT CSLLIEIPAL AAATNSKDSR RKATTKSFKS KLEFHDRQFF TGPPESIKDH IVHASIALLK GDWAKAYQLL SSIKIWKLFP DNDDLLAMMK NQLQVEGLRT YIFSYKSIFS KLSLGKLSQI FELEADKVES IVQKMIETNE IGGTLDESKA FIQFASTEPQ RSRLQELAIV MNEKVGLLTE KNEKTSSNGY GKKQPQQQQQ QQQQQQQQQQ QQKDLLQEDN SRFRYANVNT NNDEFQTTA
|
| |