Gene PICST_64901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_64901 
SymbolNEM1 
ID4851335 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1530250 
End bp1532239 
Gene Length1990 bp 
Protein Length401 aa 
Translation table 
GC content42% 
IMG OID640393043 
ProductNuclear Envelope Morphology 
Protein accessionXP_001387525 
Protein GI126274376 
COG category[K] Transcription 
COG ID[COG5190] TFIIF-interacting CTD phosphatases, including NLI-interacting factor 
TIGRFAM ID[TIGR02251] Dullard-like phosphatase domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTTGT AGACTGCTGT GGGTCGGTAA ATAAACTCAT CTCGTCACAT CGGTGTTCGG 
CCATCTCGTC CTAAACTACC CATTCTATAC TACCTACTAT ATCTCAAACT CATATTGACA
CAGGCATAGA TTCGTAGAAT CTCAACTATC ATTAGCCAAG TATTACTTCA ATCGCCCGCA
AGTTCAACAA CCACATAAAG GCAACGCCAG CTTCCATTGC AGCCGCAACA TCTTCGGGCT
GTACCTCCTT CAACTGCTTC ATTATCATCG TATAGGATCA AATCAGATCA TCCACTTGTC
TTATCATACT GTGAGACAAT ATCTAGTACC TTCACTACCA TTCACGCTTC ATCCTTCCGT
CTTACAACTA TTCTTTACAT CTCGCTGCAG ACCTCCATGG ACATTTCGCA TAACTATGAA
CTCGCTTAAA ATAATAGTTA ACTCCTTTGA TACGCTCTAT CCGAAAAAGG ATTACGAACT
CACAAGCTCG GCTCAAGACC TCGACGAAGA AGATGACATA GATGATGCTG GCGAAATTAA
TTTAGCCAAA GCAGACATAG CGGAGCCCAA TGAAAGCACA ACCAGCATCA ACAGTAATGT
GAACAACTCC GGCTCTTCTA CCGCAACCAT GACTATTACC GAGTCTCCCG TGACGCAGGC
ACTTCCGGCT GATTACGAGA AAATCATGGC CAAACAGAGT TCTGAACAAG ACTCGATATT
GCGGTCCATC GCAAACCTTT TACGCTTTGC TATCAAGACC ATTTTGTTTG TGCCCAACGT
ACTCATAGTG AAACCTATCA GCTTCATGTG GCTCCTTGTG ACTTTCCCAT TCATCTACAC
CTTTGAACAA CTCGGACTCG TTAACTTTGG GAAGTCCAGC CTGATTGTAG TCAAAGAAAT
TCCAGAGTCG ATGGATTCTT CCTACGAAAA AATCTTACCA GAACAGACAG AGATAGAGGA
AGACATCGAC TTGATAAAAG ACGACAGCAA TATCAGTAAA CTTCGTCGCA ACAATAATAA
CATTAGTAGC GGTAAAATAA TGACAAGCTC GTCTTCAACT CCTGACCAGC GCTCGGAAAT
AGACGATAAG ACCGAGATCC TCAAACAGGA AAACATTCTT TCCAACAACA TCAAATCTCC
AACCTCATAT TCCAAATACA TAATACCGCC ACCATTACGA TTATATCCCT TATCCAGAAA
TCCACAGAAG AAACGTAAGA GAAAGACTCT CATTCTCGAT CTTGACGAGA CGTTAATCCA
CTCTTTATCA CGAGGTTCTC CCCGTTCTTT TAACACTTCG TCGTCTTCGG CTCCGAAAAT
GATCGAAATC AAACTCAACA ATATTGCATC TCTATACTAC GTTCACAAAC GGCCCTATTG
TGACTACTTC CTCAAAGAAA TCTCGAAGTG GTTTGAGCTC CAGATCTTTA CGGCTAGTGT
CAAGGAATAC GCTGATCCAA TCATTGACTG GTTGGAAAGT GACATAATAG ACAACTCCCG
GAAGAACTCC AAGCATGAGT CAGACTCAGA GGTTCCCAGC AAAATCTTCA CCAGAAGATA
CTACAGAACC GATTGCACAT ATCGACAAGG AGTAGGATAC ATCAAGGATT TGTCTAAGTT
CTTCGCCAAA GACGATGAGC TTAAGAACGT AATTATCCTC GACAATTCTC CCATAAGTTA
TGCTCTTCAT GAAGATAATG CCGTCATGAT TGAAGGGTGG ATCAACGATC AGCGAGACCG
CGATCTTTTG CATTTGTTGC CCATGCTTCA CAGTTTGAGT CTCTGTATAG ATGTAAGGTA
CATCTTGGGC TTGCGACACG GAGAGAAGTC CTTTGAAAGG TAACCAATTA TATCATTACT
ATTAATCTCA ATATTTATTG CTTGCTTTAA TGAACTAGAG TCAAAACATC AACTCCAAAA
TTACACCGTT AATAGAAAAG TTTTGAATTA GATACTACAT ACATTTATGT CAATGTATAC
GATTGCATTT
 
Protein sequence
MNSLKIIVNS FDTLYPKKDY ELTSSAQDLD EEDDIDDAGE INLAKADIAE PNESTTSINS 
NSSEQDSILR SIANLLRFAI KTILFVPNVL IVKPISFMWL LVTFPFIYTF EQLGLVNFGN
KLRRNNNNIS SGKIMTSSSS TPDQRSEIDD KTEILKQENI LSNNIKSPTS YSKYIIPPPL
RLYPLSRNPQ KKRKRKTLIL DLDETLIHSL SRGSPRSFNT SSSSAPKMIE IKLNNIASLY
YVHKRPYCDY FLKEISKWFE LQIFTASVKE YADPIIDWLE SDIIDNSRKN SKHESDSEVP
SKIFTRRYYR TDCTYRQGVG YIKDLSKFFA KDDELKNVII LDNSPISYAL HEDNAVMIEG
WINDQRDRDL LHLLPMLHSL SLCIDVRYIL GLRHGEKSFE R