Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29380 |
Symbol | |
ID | 4836816 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 124146 |
End bp | 125417 |
Gene Length | 1272 bp |
Protein Length | 357 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388131 |
Product | predicted protein |
Protein accession | XP_001382253 |
Protein GI | 150863695 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5027] Histone acetyltransferase (MYST family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000817876 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.405165 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATG GGCGAACGCA GGCAAAGAAG TCCGTGCCCA AGGATGAGGG ATATGATAAC GGCTTCTATG GCGTATTGGA CAAACCCAAC ATAAAGACCG TGCTCTTTGG CAATTACCGA TTCAATACGT GGTACGGCAA TGCAGCGTAT TTCAACGCTT ACGATACAGC CCATATGGCT TTGGGATACG ACTTTTCCAA TCGTATCGCC TCAGATCCCA GTTTGCGGGC TCGGAAACGA TCTAGATCTA CGTCTATGGT TGCTTCGTCT TCTATGACAT CCAATAATGG AAACCACGAT GGAAATCGTG AAAATTCCAT AGATACAAAT CAAAACGAAA ATTCAGCAAA AATTACGAGG TTGTCTAAAG CTGCAGCTCG AAAAGCAAAT GCAAACTCGA CCCATAGCAA CTTTACAGGT ACAGTCACAA ATACTGAAAA CTCCAACATC GATAATAACC ATAATGATGA CGACTACTGG TTGAACGAAC TTTATGTATG CGAGTACTGT TTCAAATATA CCTCCAATTC ACACGAGATG CAACAGCACA GAGTGGTCTG CTCATATAAT GTGGCCAGAC CGAAAGTGGG AAAGCTCTTG TATCGAGATG ACCATACCCC ATATCTCATA CGAGAAGTGC GAGGATTCAC TGATCCTCTT TTCTGTCAGA ACCTCTGTTT GTTTGGTAAG CTCTTTCTCG ACGATAAATC TGTGTACTAC AACATCGACC ATTTCAACTT CTATATTGTT TACGGCTATG ACAACGATGT AAATGCTGAC CCTTACACTG AACAACACTT CAAACCCATG GGTTTCTTTT CAAAGGAAAT GCTAGCTTAC GATAACGACA ATAACCTAGC ATGCATCTGT GTATTTCCCC CGTTTCAAAG ACGGCACTTG GGCTCATTGC TAATAGAGTT TCTGTACGCG TTGGCGCATG TCACTCCTGG CCAATACCAC AGTGGACCTG AATTCCCACT CTCTCCGTAT GGCAAGGTTA GCTACCTTCG GTTCTGGTCC AAAAAGTTGG CTAGTGTGAT AACTTCGCAT TTCAAGCCTG GTCTGTCGTT CAGTTTGAAT GATATTTCCG ACTTCACCGG GTTTAGAAAG GAAGATATCT TGCTCACGTT GGAGTACATG AAACTCTTGA AGAAAGACCT GCGGGGCAAT GTGAAGTTGC TCCTTGGAAA TCTTCAAGAA TGGTGCACTG CCAATAATGT TGACCCGAAC CAGGAGAAGT CTATGATGAA TACTGAGTAC CTTCTACTAT AA
|
Protein sequence | MSDGRTQAKK SVPKDEGYDN GFYGVLDKPN IKTVLFGNYR FNTWYGNAAY FNAYDTAHMA LGYDFSNRIA SDPSTVTNTE NSNIDNNHND DDYWLNELYV CEYCFKYTSN SHEMQQHRVV CSYNVARPKV GKLLYRDDHT PYLIREVRGF TDPLFCQNLC LFGKLFLDDK SVYYNIDHFN FYIVYGYDND VNADPYTEQH FKPMGFFSKE MLAYDNDNNL ACICVFPPFQ RRHLGSLLIE FSYALAHVTP GQYHSGPEFP LSPYGKVSYL RFWSKKLASV ITSHFKPGSS FSLNDISDFT GFRKEDILLT LEYMKLLKKD SRGNVKLLLG NLQEWCTANN VDPNQEKSMM NTEYLLL
|
| |