Gene PICST_38082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38082 
SymbolERK1 
ID4850864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp267039 
End bp268127 
Gene Length1089 bp 
Protein Length362 aa 
Translation table 
GC content45% 
IMG OID640392572 
ProductExtracellular signal-regulated kinase 1 (ERK1) (MAP kinase 1) (MAPK 1) 
Protein accessionXP_001387701 
Protein GI126273769 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAG AACAACCAGA ACAGGCGCCA GCACGGCAGA TATCGTTCAA TGTATCGAGC 
CATTACCAAA TTTTGGAGAT AGTTGGCGAG GGAGCATATG GAATCGTTTG TTCAGCCATT
CACAAGCCTC TGAACCAGAA AGTAGCCATC AAGAAAATCG AGCCGTTTGA GAGATCAATG
CTTTGCCTTA GAACACTAAG AGAACTCAAG CTCCTTAAGC ATTTCAACCA CGAGAACATC
ATCAGCATTC TTGCTATCCA GAGACCTGTG AGCTACGAGT TTTTCAACGA AATCTATCTT
ATACAAGAGC TCATGGAAAC AGACTTACAT AGAGTGATCC GCACCCAGAA ACTCACCGAT
GACCATATCC AGTATTTCAT CTACCAGACA CTTCGTGCCC TTAAGGCTAT GCATCTGGCC
AATGTGTTAC ATAGAGACCT CAAACCGTCA AACTTGTTGC TCAACTCCAA TTGCGACTTG
AAAGTATGTG ACTTTGGCCT TGCCCGTTCC ATCGCTAGTA GTGAAGACAA TTTCGGGTAT
ATGACTGAAT ATGTCGCGAC CAGATGGTAT CGAGCACCAG AAATCATGCT CACTTTCCAG
GAGTACACCA CGGCTATCGA TGTCTGGTCT GTAGGCTGTA TTCTCGCCGA AATGCTCAGC
GGTAGGCCTC TTTTCCCGGG CAGGGACTAC CACAATCAGC TTTGGCTCAT AATGGAGGTC
CTTGGGACAC CTAACATGGA AGACTACTAC AACATCAAGA GCAAGCGAGC ACGAGAGTAT
ATCCGATCAT TACCGTTCTG CAAAAAGATC CCGTTCCAGG ACCTCTTTGG AAACATCAAC
CCCAACGTCC AAATCAACCC GTTGGCCATA GACTTGTTGG AGAACTTGCT TATTTTCAAT
CCTGCCAAAC GTATCACAGT AGACGACGCA TTAAAACATC CTTACTTGAA GCTCTATCAT
GATCCAAATG ATGAGCCTGT TAGCGAGAAA ATCCCCGAGG ACTTCTTTGA CTTTGACAAG
AGAAAGGACG AGCTTAGCAT TGATGATTTG AAGAAAATGT TGTACGAAGA AATCATGAAA
CCTTTATAG
 
Protein sequence
MNIEQPEQAP ARQISFNVSS HYQILEIVGE GAYGIVCSAI HKPLNQKVAI KKIEPFERSM 
LCLRTLRELK LLKHFNHENI ISILAIQRPV SYEFFNEIYL IQELMETDLH RVIRTQKLTD
DHIQYFIYQT LRALKAMHLA NVLHRDLKPS NLLLNSNCDL KVCDFGLARS IASSEDNFGY
MTEYVATRWY RAPEIMLTFQ EYTTAIDVWS VGCILAEMLS GRPLFPGRDY HNQLWLIMEV
LGTPNMEDYY NIKSKRAREY IRSLPFCKKI PFQDLFGNIN PNVQINPLAI DLLENLLIFN
PAKRITVDDA LKHPYLKLYH DPNDEPVSEK IPEDFFDFDK RKDELSIDDL KKMLYEEIMK
PL