Gene PICST_32740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32740 
SymbolSET1 
ID4840007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp537373 
End bp540540 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table12 
GC content42% 
IMG OID640391322 
Producthistone methyltransferase involved in gene regulation 
Protein accessionXP_001385792 
Protein GI150866258 
COG category[R] General function prediction only 
COG ID[COG2940] Proteins containing SET domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGCC ACAGAGGATT CAGGCGAGAA CGGAGTCGAT ACCGTTCCTT CAATAATAAC 
GATTACGAAG ATGATGACAG AGAAACCAAC GGATCTACTC GAGGAAGCCG CTATCCCAAA
GATAGTCGAT ATAGCCATAA TGACTCTGGC AACTCGCAGA GCTCTTCAGT CAGACTCTCA
AGAGTACCAT CTGGAGAAAG ATCACCCAGC ATTGCCTCAG GCCACGGCTT CAAACTGGAG
ACTCCAGATA TCACCTATCT GAATATCCCG GGAACTGATC CTGTGACAGG AGAAAACAGT
GCCGACAAGA ACTTCACCAA ACTTTTGCAT CACATCGACT TTTTAGGAAA GTTTCCCCAA
AGCAGAGACA CAGACGTAAG GAAAAATTAC CGTGTAGTGT ATGATCCCGA GCTTGACAAG
AGTCTCACGA AAGAAGAACG TAAGTCTAGA ACTAAAAAAA TCAGGTTCAA CGGTGAGTCG
CTTCCGTTTC TGGAAAGAAC CGATCCTAGA ACAGCAAATC TCCAGTTCTA TTTCCAGAGA
CCAAACAAGA AATCAAAGAA ATTCCCCTTT AAACAATTGC CTCAACCTAA ATTTGCCTTT
GACAAAGATT CATTGGGTCC AGCTCCGCAA ACTGAGCTTG TAGTCTGGGA CTTACCAGCT
ACAACTAATG AGGTATATTT GACTAATTAT TTTAAGAGCT ATGGTGATCC TATTCAAGAC
ATGAAATTCG TTAACGACCC AATAAATGCC GTGCCGTTGG GTATTGCAAC GTTCAAATTC
CAAGGACCTC CAGAGAAAGC TATGAGATTG GCTAAGAAGC TTATCAGCAT TATCAGAGTA
GAAGGAGCTA AAGTAGATGG AGTAGACCTC AGAATTGCAT TAAACGATAA TGATAACAAC
TTGTTGAAGA ACAAGATCGA TTTGGCTCGT GATAAACTCC GTGCTAGCAG GCAAAAGCGA
GAAGAGGAAG AAAGAAGAAG GTTGAGAAGA CAACAGGAAG AAGAACGAGC CAAAAGAGAT
GAAGAACGCA AGGCTGCTGC TGAATCCAAA AGACGAGAAG AAGAAGCTAA GGTGGAGAAG
CTTCAACATC AAAAGACATC TGGCAAGAAC CATTCTAAAT TCAAACCCAA CACTAGCACG
CTTTCAATTA GGCACCACAA TAAAGTTGTA CCGGGCGTTT TCTTGCCAAA GGAATTGGTC
AAATACATCA AAGATCGTCC CTATATCGTT ATACATGACA AATACGTTTC TGCCAAGAAG
GTTTCGTCTC AGGATATCAA ACGTGCTTTG AACAAATACG ACTGGACAAG AGTATTGTCG
GACAGAACTG GTTTCTTTAT TGTGTTCAAC AGTTTGAAGG AATGCGAACG ATGCTTCTTT
AATGAGGATG GAAAGAGGTT TTTTGAATTC AGACTTTACA TGGAACTTGC TATTCCAGAA
GGTTATGACA CTAGTTCTAT AGAGTCCGCT GAGGACCTTC TAGAGCTTCA AAACAAGAAA
CACGACATCT TGGAAGAAGC AACCAATATT TTGATTAAGG AATTCGAAAC CTTCTTGGCT
AAAGACATTA GAGAGAGAAT TATTGCTCCA GCTGTTTTAG ATTTATTGAG CCACGACAAG
TACCCCAAAT TGGTTGAGGA ACTTAAAGCG AAGGAAAATG CTTCTAAACC TACGTCGTTT
GTCAGCACAA ACGATCAGCT CAAACAGTCT GCACTTTCCA TTCTTGCCGG ACGCAAATCG
CAACAGTCTC AATCTCTACC CTCGTTCAAA AAGAAGGTAG AATCTCCTAC CAAGAATAAA
CCACTTAGGA AGAGTCTCAT TCCCATGCAA CATGCTCTTA ACTATGATCA TGACTCAGAT
GAAGAGGATG AAGATGATTC TAGTAGATCT GCTACACCAG CTGCGCTGAT CTTAAAAAGA
GAACGTAGTT CTACTGTAAC AACAGTAAAC AATGAAGATA ATGAAGAGCA ACCGTCCAAG
AAACAGAAGA CTGGGTTGCA AAAATCTTTC TTATACGAGT CTTCTTCAGA TGAAGAGATG
GAGGATGAAG ATGTTTCAGA TGGTGCCGCG TCTACAATGG TAATTAATAC AGAGGAGCTA
GAATTAAAGG ATGAGGGTGA TGTAAAGAAG GAAGCTGAAG ATAAAGCAGA GGAGGACATT
GATTACTCCA AATTTGAAGC TAGATACCAT CCTACGGAAC GGAAGCCTTA CACAGTGTAT
GCTGAAACGA ACTTGTTACC CACTGATAAC TTCAACTTGG ATGTCTTACA GGATGTGCTT
AAAGATGAAG AGGATATCAA GTTGGCCAAG GAAGTGTTGC AGGAAACCAA TCCGCCTTCG
TCTGACATCA GAAACATTGA TTACTGGGCT TGGAAACAAA AGGATATCAG TGGTGCTTCT
CAAGAAATTG TTGAAGATGC TGAATTGATT GAAAAGTTGG ACTCGCGACT TGAATCACAA
TCAGGAGCTT TCAGAAGCGA TGGTTATAGA AGAATTCCTG ATGTTGACAA GATTGAGTAT
TTGCCACACC GTCGTAAAGT TCATAAGCCA CTTCAAACAG TACAGCATGA GGAAGTGGAA
GAAGAGGCTG GTAATAGAAA TAATGAAGGT ACCAATAGTG CTATCCAAAG TTCACGTGTC
AACCGTGCCA ACAATAGAAG ATTTGCTGCG GACATTACAG CTCAGCTTGG TAATGAAACC
GAAGTGTTGA GTTTGAATAC GTTGACCAAG AGAAAGAAGC CGGTGTCTTT TGCCCGTTCT
GCCATTCATA ACTGGGGATT ATACGCCTTG GAGCCTATAG CTGCTAAGGA AATGATTATC
GAGTATGTTG GTGAGAGTAT TAGACAACAG GTAGCAGAGC ATCGTGAGAA GAGCTACTTG
AAGACTGGTA TTGGGTCGTC GTATCTTTTC AGAATCGACG AGAACACGGT CATTGACGCA
ACAAAGAAAG GAGGCATTGC TAGATTCATC AACCATTGCT GTAGTCCTAG TTGTACTGCT
AAGATTATCA AGGTAGACAA TCAGAAGAGA ATTGTCATTT ACGCCTTGAG AGATATCGAT
GCCAACGAAG AATTGACCTA CGACTACAAA TTCGAGAGAG AGACCAATGA TGCTGAGAGG
ATCAGATGTT TGTGTGGAGC ACCTGGTTGT AAGGGCTATT TGAACTAA
 
Protein sequence
MSRHRGFRRE RSRYRSFNNN DYEDDDRETN GSTRGSRYPK DSRYSHNDSG NSQSSSVRLS 
RVPSGERSPS IASGHGFKSE TPDITYSNIP GTDPVTGENS ADKNFTKLLH HIDFLGKFPQ
SRDTDVRKNY RVVYDPELDK SLTKEERKSR TKKIRFNGES LPFSERTDPR TANLQFYFQR
PNKKSKKFPF KQLPQPKFAF DKDSLGPAPQ TELVVWDLPA TTNEVYLTNY FKSYGDPIQD
MKFVNDPINA VPLGIATFKF QGPPEKAMRL AKKLISIIRV EGAKVDGVDL RIALNDNDNN
LLKNKIDLAR DKLRASRQKR EEEERRRLRR QQEEERAKRD EERKAAAESK RREEEAKVEK
LQHQKTSGKN HSKFKPNTST LSIRHHNKVV PGVFLPKELV KYIKDRPYIV IHDKYVSAKK
VSSQDIKRAL NKYDWTRVLS DRTGFFIVFN SLKECERCFF NEDGKRFFEF RLYMELAIPE
GYDTSSIESA EDLLELQNKK HDILEEATNI LIKEFETFLA KDIRERIIAP AVLDLLSHDK
YPKLVEELKA KENASKPTSF VSTNDQLKQS ALSILAGRKS QQSQSLPSFK KKVESPTKNK
PLRKSLIPMQ HALNYDHDSD EEDEDDSSRS ATPAASILKR ERSSTVTTVN NEDNEEQPSK
KQKTGLQKSF LYESSSDEEM EDEDVSDGAA STMVINTEEL ELKDEGDVKK EAEDKAEEDI
DYSKFEARYH PTERKPYTVY AETNLLPTDN FNLDVLQDVL KDEEDIKLAK EVLQETNPPS
SDIRNIDYWA WKQKDISGAS QEIVEDAELI EKLDSRLESQ SGAFRSDGYR RIPDVDKIEY
LPHRRKVHKP LQTVQHEEVE EEAGNRNNEG TNSAIQSSRV NRANNRRFAA DITAQLGNET
EVLSLNTLTK RKKPVSFARS AIHNWGLYAL EPIAAKEMII EYVGESIRQQ VAEHREKSYL
KTGIGSSYLF RIDENTVIDA TKKGGIARFI NHCCSPSCTA KIIKVDNQKR IVIYALRDID
ANEELTYDYK FERETNDAER IRCLCGAPGC KGYLN