Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32740 |
Symbol | SET1 |
ID | 4840007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 537373 |
End bp | 540540 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391322 |
Product | histone methyltransferase involved in gene regulation |
Protein accession | XP_001385792 |
Protein GI | 150866258 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGCC ACAGAGGATT CAGGCGAGAA CGGAGTCGAT ACCGTTCCTT CAATAATAAC GATTACGAAG ATGATGACAG AGAAACCAAC GGATCTACTC GAGGAAGCCG CTATCCCAAA GATAGTCGAT ATAGCCATAA TGACTCTGGC AACTCGCAGA GCTCTTCAGT CAGACTCTCA AGAGTACCAT CTGGAGAAAG ATCACCCAGC ATTGCCTCAG GCCACGGCTT CAAACTGGAG ACTCCAGATA TCACCTATCT GAATATCCCG GGAACTGATC CTGTGACAGG AGAAAACAGT GCCGACAAGA ACTTCACCAA ACTTTTGCAT CACATCGACT TTTTAGGAAA GTTTCCCCAA AGCAGAGACA CAGACGTAAG GAAAAATTAC CGTGTAGTGT ATGATCCCGA GCTTGACAAG AGTCTCACGA AAGAAGAACG TAAGTCTAGA ACTAAAAAAA TCAGGTTCAA CGGTGAGTCG CTTCCGTTTC TGGAAAGAAC CGATCCTAGA ACAGCAAATC TCCAGTTCTA TTTCCAGAGA CCAAACAAGA AATCAAAGAA ATTCCCCTTT AAACAATTGC CTCAACCTAA ATTTGCCTTT GACAAAGATT CATTGGGTCC AGCTCCGCAA ACTGAGCTTG TAGTCTGGGA CTTACCAGCT ACAACTAATG AGGTATATTT GACTAATTAT TTTAAGAGCT ATGGTGATCC TATTCAAGAC ATGAAATTCG TTAACGACCC AATAAATGCC GTGCCGTTGG GTATTGCAAC GTTCAAATTC CAAGGACCTC CAGAGAAAGC TATGAGATTG GCTAAGAAGC TTATCAGCAT TATCAGAGTA GAAGGAGCTA AAGTAGATGG AGTAGACCTC AGAATTGCAT TAAACGATAA TGATAACAAC TTGTTGAAGA ACAAGATCGA TTTGGCTCGT GATAAACTCC GTGCTAGCAG GCAAAAGCGA GAAGAGGAAG AAAGAAGAAG GTTGAGAAGA CAACAGGAAG AAGAACGAGC CAAAAGAGAT GAAGAACGCA AGGCTGCTGC TGAATCCAAA AGACGAGAAG AAGAAGCTAA GGTGGAGAAG CTTCAACATC AAAAGACATC TGGCAAGAAC CATTCTAAAT TCAAACCCAA CACTAGCACG CTTTCAATTA GGCACCACAA TAAAGTTGTA CCGGGCGTTT TCTTGCCAAA GGAATTGGTC AAATACATCA AAGATCGTCC CTATATCGTT ATACATGACA AATACGTTTC TGCCAAGAAG GTTTCGTCTC AGGATATCAA ACGTGCTTTG AACAAATACG ACTGGACAAG AGTATTGTCG GACAGAACTG GTTTCTTTAT TGTGTTCAAC AGTTTGAAGG AATGCGAACG ATGCTTCTTT AATGAGGATG GAAAGAGGTT TTTTGAATTC AGACTTTACA TGGAACTTGC TATTCCAGAA GGTTATGACA CTAGTTCTAT AGAGTCCGCT GAGGACCTTC TAGAGCTTCA AAACAAGAAA CACGACATCT TGGAAGAAGC AACCAATATT TTGATTAAGG AATTCGAAAC CTTCTTGGCT AAAGACATTA GAGAGAGAAT TATTGCTCCA GCTGTTTTAG ATTTATTGAG CCACGACAAG TACCCCAAAT TGGTTGAGGA ACTTAAAGCG AAGGAAAATG CTTCTAAACC TACGTCGTTT GTCAGCACAA ACGATCAGCT CAAACAGTCT GCACTTTCCA TTCTTGCCGG ACGCAAATCG CAACAGTCTC AATCTCTACC CTCGTTCAAA AAGAAGGTAG AATCTCCTAC CAAGAATAAA CCACTTAGGA AGAGTCTCAT TCCCATGCAA CATGCTCTTA ACTATGATCA TGACTCAGAT GAAGAGGATG AAGATGATTC TAGTAGATCT GCTACACCAG CTGCGCTGAT CTTAAAAAGA GAACGTAGTT CTACTGTAAC AACAGTAAAC AATGAAGATA ATGAAGAGCA ACCGTCCAAG AAACAGAAGA CTGGGTTGCA AAAATCTTTC TTATACGAGT CTTCTTCAGA TGAAGAGATG GAGGATGAAG ATGTTTCAGA TGGTGCCGCG TCTACAATGG TAATTAATAC AGAGGAGCTA GAATTAAAGG ATGAGGGTGA TGTAAAGAAG GAAGCTGAAG ATAAAGCAGA GGAGGACATT GATTACTCCA AATTTGAAGC TAGATACCAT CCTACGGAAC GGAAGCCTTA CACAGTGTAT GCTGAAACGA ACTTGTTACC CACTGATAAC TTCAACTTGG ATGTCTTACA GGATGTGCTT AAAGATGAAG AGGATATCAA GTTGGCCAAG GAAGTGTTGC AGGAAACCAA TCCGCCTTCG TCTGACATCA GAAACATTGA TTACTGGGCT TGGAAACAAA AGGATATCAG TGGTGCTTCT CAAGAAATTG TTGAAGATGC TGAATTGATT GAAAAGTTGG ACTCGCGACT TGAATCACAA TCAGGAGCTT TCAGAAGCGA TGGTTATAGA AGAATTCCTG ATGTTGACAA GATTGAGTAT TTGCCACACC GTCGTAAAGT TCATAAGCCA CTTCAAACAG TACAGCATGA GGAAGTGGAA GAAGAGGCTG GTAATAGAAA TAATGAAGGT ACCAATAGTG CTATCCAAAG TTCACGTGTC AACCGTGCCA ACAATAGAAG ATTTGCTGCG GACATTACAG CTCAGCTTGG TAATGAAACC GAAGTGTTGA GTTTGAATAC GTTGACCAAG AGAAAGAAGC CGGTGTCTTT TGCCCGTTCT GCCATTCATA ACTGGGGATT ATACGCCTTG GAGCCTATAG CTGCTAAGGA AATGATTATC GAGTATGTTG GTGAGAGTAT TAGACAACAG GTAGCAGAGC ATCGTGAGAA GAGCTACTTG AAGACTGGTA TTGGGTCGTC GTATCTTTTC AGAATCGACG AGAACACGGT CATTGACGCA ACAAAGAAAG GAGGCATTGC TAGATTCATC AACCATTGCT GTAGTCCTAG TTGTACTGCT AAGATTATCA AGGTAGACAA TCAGAAGAGA ATTGTCATTT ACGCCTTGAG AGATATCGAT GCCAACGAAG AATTGACCTA CGACTACAAA TTCGAGAGAG AGACCAATGA TGCTGAGAGG ATCAGATGTT TGTGTGGAGC ACCTGGTTGT AAGGGCTATT TGAACTAA
|
Protein sequence | MSRHRGFRRE RSRYRSFNNN DYEDDDRETN GSTRGSRYPK DSRYSHNDSG NSQSSSVRLS RVPSGERSPS IASGHGFKSE TPDITYSNIP GTDPVTGENS ADKNFTKLLH HIDFLGKFPQ SRDTDVRKNY RVVYDPELDK SLTKEERKSR TKKIRFNGES LPFSERTDPR TANLQFYFQR PNKKSKKFPF KQLPQPKFAF DKDSLGPAPQ TELVVWDLPA TTNEVYLTNY FKSYGDPIQD MKFVNDPINA VPLGIATFKF QGPPEKAMRL AKKLISIIRV EGAKVDGVDL RIALNDNDNN LLKNKIDLAR DKLRASRQKR EEEERRRLRR QQEEERAKRD EERKAAAESK RREEEAKVEK LQHQKTSGKN HSKFKPNTST LSIRHHNKVV PGVFLPKELV KYIKDRPYIV IHDKYVSAKK VSSQDIKRAL NKYDWTRVLS DRTGFFIVFN SLKECERCFF NEDGKRFFEF RLYMELAIPE GYDTSSIESA EDLLELQNKK HDILEEATNI LIKEFETFLA KDIRERIIAP AVLDLLSHDK YPKLVEELKA KENASKPTSF VSTNDQLKQS ALSILAGRKS QQSQSLPSFK KKVESPTKNK PLRKSLIPMQ HALNYDHDSD EEDEDDSSRS ATPAASILKR ERSSTVTTVN NEDNEEQPSK KQKTGLQKSF LYESSSDEEM EDEDVSDGAA STMVINTEEL ELKDEGDVKK EAEDKAEEDI DYSKFEARYH PTERKPYTVY AETNLLPTDN FNLDVLQDVL KDEEDIKLAK EVLQETNPPS SDIRNIDYWA WKQKDISGAS QEIVEDAELI EKLDSRLESQ SGAFRSDGYR RIPDVDKIEY LPHRRKVHKP LQTVQHEEVE EEAGNRNNEG TNSAIQSSRV NRANNRRFAA DITAQLGNET EVLSLNTLTK RKKPVSFARS AIHNWGLYAL EPIAAKEMII EYVGESIRQQ VAEHREKSYL KTGIGSSYLF RIDENTVIDA TKKGGIARFI NHCCSPSCTA KIIKVDNQKR IVIYALRDID ANEELTYDYK FERETNDAER IRCLCGAPGC KGYLN
|
| |