Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64901 |
Symbol | NEM1 |
ID | 4851335 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1530250 |
End bp | 1532239 |
Gene Length | 1990 bp |
Protein Length | 401 aa |
Translation table | |
GC content | 42% |
IMG OID | 640393043 |
Product | Nuclear Envelope Morphology |
Protein accession | XP_001387525 |
Protein GI | 126274376 |
COG category | [K] Transcription |
COG ID | [COG5190] TFIIF-interacting CTD phosphatases, including NLI-interacting factor |
TIGRFAM ID | [TIGR02251] Dullard-like phosphatase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTTGT AGACTGCTGT GGGTCGGTAA ATAAACTCAT CTCGTCACAT CGGTGTTCGG CCATCTCGTC CTAAACTACC CATTCTATAC TACCTACTAT ATCTCAAACT CATATTGACA CAGGCATAGA TTCGTAGAAT CTCAACTATC ATTAGCCAAG TATTACTTCA ATCGCCCGCA AGTTCAACAA CCACATAAAG GCAACGCCAG CTTCCATTGC AGCCGCAACA TCTTCGGGCT GTACCTCCTT CAACTGCTTC ATTATCATCG TATAGGATCA AATCAGATCA TCCACTTGTC TTATCATACT GTGAGACAAT ATCTAGTACC TTCACTACCA TTCACGCTTC ATCCTTCCGT CTTACAACTA TTCTTTACAT CTCGCTGCAG ACCTCCATGG ACATTTCGCA TAACTATGAA CTCGCTTAAA ATAATAGTTA ACTCCTTTGA TACGCTCTAT CCGAAAAAGG ATTACGAACT CACAAGCTCG GCTCAAGACC TCGACGAAGA AGATGACATA GATGATGCTG GCGAAATTAA TTTAGCCAAA GCAGACATAG CGGAGCCCAA TGAAAGCACA ACCAGCATCA ACAGTAATGT GAACAACTCC GGCTCTTCTA CCGCAACCAT GACTATTACC GAGTCTCCCG TGACGCAGGC ACTTCCGGCT GATTACGAGA AAATCATGGC CAAACAGAGT TCTGAACAAG ACTCGATATT GCGGTCCATC GCAAACCTTT TACGCTTTGC TATCAAGACC ATTTTGTTTG TGCCCAACGT ACTCATAGTG AAACCTATCA GCTTCATGTG GCTCCTTGTG ACTTTCCCAT TCATCTACAC CTTTGAACAA CTCGGACTCG TTAACTTTGG GAAGTCCAGC CTGATTGTAG TCAAAGAAAT TCCAGAGTCG ATGGATTCTT CCTACGAAAA AATCTTACCA GAACAGACAG AGATAGAGGA AGACATCGAC TTGATAAAAG ACGACAGCAA TATCAGTAAA CTTCGTCGCA ACAATAATAA CATTAGTAGC GGTAAAATAA TGACAAGCTC GTCTTCAACT CCTGACCAGC GCTCGGAAAT AGACGATAAG ACCGAGATCC TCAAACAGGA AAACATTCTT TCCAACAACA TCAAATCTCC AACCTCATAT TCCAAATACA TAATACCGCC ACCATTACGA TTATATCCCT TATCCAGAAA TCCACAGAAG AAACGTAAGA GAAAGACTCT CATTCTCGAT CTTGACGAGA CGTTAATCCA CTCTTTATCA CGAGGTTCTC CCCGTTCTTT TAACACTTCG TCGTCTTCGG CTCCGAAAAT GATCGAAATC AAACTCAACA ATATTGCATC TCTATACTAC GTTCACAAAC GGCCCTATTG TGACTACTTC CTCAAAGAAA TCTCGAAGTG GTTTGAGCTC CAGATCTTTA CGGCTAGTGT CAAGGAATAC GCTGATCCAA TCATTGACTG GTTGGAAAGT GACATAATAG ACAACTCCCG GAAGAACTCC AAGCATGAGT CAGACTCAGA GGTTCCCAGC AAAATCTTCA CCAGAAGATA CTACAGAACC GATTGCACAT ATCGACAAGG AGTAGGATAC ATCAAGGATT TGTCTAAGTT CTTCGCCAAA GACGATGAGC TTAAGAACGT AATTATCCTC GACAATTCTC CCATAAGTTA TGCTCTTCAT GAAGATAATG CCGTCATGAT TGAAGGGTGG ATCAACGATC AGCGAGACCG CGATCTTTTG CATTTGTTGC CCATGCTTCA CAGTTTGAGT CTCTGTATAG ATGTAAGGTA CATCTTGGGC TTGCGACACG GAGAGAAGTC CTTTGAAAGG TAACCAATTA TATCATTACT ATTAATCTCA ATATTTATTG CTTGCTTTAA TGAACTAGAG TCAAAACATC AACTCCAAAA TTACACCGTT AATAGAAAAG TTTTGAATTA GATACTACAT ACATTTATGT CAATGTATAC GATTGCATTT
|
Protein sequence | MNSLKIIVNS FDTLYPKKDY ELTSSAQDLD EEDDIDDAGE INLAKADIAE PNESTTSINS NSSEQDSILR SIANLLRFAI KTILFVPNVL IVKPISFMWL LVTFPFIYTF EQLGLVNFGN KLRRNNNNIS SGKIMTSSSS TPDQRSEIDD KTEILKQENI LSNNIKSPTS YSKYIIPPPL RLYPLSRNPQ KKRKRKTLIL DLDETLIHSL SRGSPRSFNT SSSSAPKMIE IKLNNIASLY YVHKRPYCDY FLKEISKWFE LQIFTASVKE YADPIIDWLE SDIIDNSRKN SKHESDSEVP SKIFTRRYYR TDCTYRQGVG YIKDLSKFFA KDDELKNVII LDNSPISYAL HEDNAVMIEG WINDQRDRDL LHLLPMLHSL SLCIDVRYIL GLRHGEKSFE R
|
| |