Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_50903 |
Symbol | |
ID | 4841080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 780319 |
End bp | 781977 |
Gene Length | 1659 bp |
Protein Length | 553 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640392395 |
Product | predicted protein |
Protein accession | XP_001386562 |
Protein GI | 150866834 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5027] Histone acetyltransferase (MYST family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA AGACCCTTAT TCTCAATCTC AAGTCATCTT CAAAGCAAGC GACGTCTCGC AAACTGAACG ATTTGGATAC ACGAGAATTG CCGTACAGAG GTATATTTCC ATATCCTGAC TGCACCATCA ACGATACTGA TCCTACCAAA GAGGATCGAG AACTTTTTGA AAAGTTGGCT GAAGAAGGCA ATGCTCTTCG TTTAAAAGAT ACAAACCAGT TGACCCAGCA AAAGGATGAA ACTCCAACCA TTAACGATAG AAGTGTAGAA TCAACTCCTA CGCCTGCCTC AATGCCCAAT TTGCTCAAGT CTCAAATCGA GAAAATTGTT TTCCGCAACT ACGAAATCAA CACCTGGTAC ACAGCTCCAT ACCCAGAAGA ATACTCTCAA TCCAAAGTAT TATTCATCTG TGAGCATTGC TTAAAGTATA TGAACTCACC CATGTCCTAC AAGAGACACC AGCTCAAGAA CTGTAACTTT TCAAATAACC ATCCTCCAGG AGTAGAAATA TATCGTGATT TGGCTACACG GATCCTGATC TGGGAAGTAG ATGGCCGCAA GAATATTAAC TATTGCCAAA ACTTATGTCT TCTTGCAAAG CTATTTCTCA ATTCAAAAAC TTTGTATTAC GATGTGGAGC CATTCATATT TTACATATTG ACAGAAATTG ACGAATTCAA CCTCTCAAAA TACCATTTTG TCGGATACTT TTCCAAAGAG AAGTTAAATA ATTCCGACTA CAACGTCTCC TGTATTCTCA CGTTGCCTAT TTATCAGAGA AAAGGTTATG GTAACCTATT GATCGATTTC TCGTATTTGT TGAGCAGACA AGAATTCAAG TATGGTACAC CAGAAAAGCC ATTGAGTGAT TTAGGTTTAC TCAGTTATAG AAACTACTGG AGAGTAACCA TCGCCTATAA ACTTAGAGAA CTCTACACAG CATTTGGGTC AGAAGAAGAA TCCACAACTC CTTCCTCCAT CATTTCCCAC ACAACAATAT CAGTAGATAT ACTTTGTAAA CTAACTGGTA TGACTTCTTC AGATGTGGTA GTTGGTTTAG AACAGTTAGA TGCATTGATC AAAAACCCGT CCACCAACAC ATATGCAATT GTTCTTAACT TGAAGAAAAT AAACTATGAA ATCGCCAGAT GGGAGAAGAA AAACTATACT AAACTCAATT ACTCCAAGCT TCTTTGGAAG CCAATGCTTT TCGGGCCCTC TGGGGGGATT AATTCAGCAC CAGCGTTTGT AGCTCCTCTT GCTGCTGGTC ACAATAGTGT CTCTCTGATA GTTGGTTTCT TGAAAGATGA CATCAATAAC CCATATTCAT ATGAAGAAGA AGCATATAAG GAGATAGAAA TGAGAAGAGA AGTTAGTCTT CTGAAATCGG ACGACAATGA CAACGCAGAT GATCAAGAAG ATCCCGACGA AGATTTAGAT AACTATTTGA TATGTTATCC AGGAATACAG TACAGCACCA AGAAGAAACC AATAAAGCTG TCTACTGAGA TCAAGCAAGT TTCTTTTGTA GACCTCAACA ATCTACTGGA CGAGTTCCCT GAAATATTCG AAGATGACGA ACCTGCCAGT AGCTCCAGTG AGTCGGAAGA CTATGTGGAA GCATCAGAAG TGGAAGATGT TGACGAAGAA GAAGAGGAG
|
Protein sequence | MKKKTLILNL KSSSKQATSR KSNDLDTREL PYRGIFPYPD CTINDTDPTK EDRELFEKLA EEGNALRLKD TNQLTQQKDE TPTINDRSVE STPTPASMPN LLKSQIEKIV FRNYEINTWY TAPYPEEYSQ SKVLFICEHC LKYMNSPMSY KRHQLKNCNF SNNHPPGVEI YRDLATRISI WEVDGRKNIN YCQNLCLLAK LFLNSKTLYY DVEPFIFYIL TEIDEFNLSK YHFVGYFSKE KLNNSDYNVS CILTLPIYQR KGYGNLLIDF SYLLSRQEFK YGTPEKPLSD LGLLSYRNYW RVTIAYKLRE LYTAFGSEEE STTPSSIISH TTISVDILCK LTGMTSSDVV VGLEQLDALI KNPSTNTYAI VLNLKKINYE IARWEKKNYT KLNYSKLLWK PMLFGPSGGI NSAPAFVAPL AAGHNSVSSI VGFLKDDINN PYSYEEEAYK EIEMRREVSL SKSDDNDNAD DQEDPDEDLD NYLICYPGIQ YSTKKKPIKS STEIKQVSFV DLNNLSDEFP EIFEDDEPAS SSSESEDYVE ASEVEDVDEE EEE
|
| |