Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82300 |
Symbol | |
ID | 4837154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 661059 |
End bp | 662708 |
Gene Length | 1650 bp |
Protein Length | 516 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388469 |
Product | predicted protein |
Protein accession | XP_001382368 |
Protein GI | 150863775 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5027] Histone acetyltransferase (MYST family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.284716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.810962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCTACAAAG CAATGTCCGG GGGCGAGAAC AACCAAGACT CGCATTCGCC CGATCCCAGC GCGCCTATAG CGATTAAAGC AGGCAAAGAT GAGCCAGCAG CTGATGATCA GTTCATAGAA GATGAAATCA TTATAGGATG TAAGGTGTAT GTTTCCAAGG ATGGAGAGCA CAGACTAGCA GAGATTCTTC AGCAACATTT GAAGAAGGGC AAGAAGGTAT TCTATGTACA CTATCAAGAG TTCAACAAGC GTTTAGATGA ATGGATTCTG AGCGACCGAA TCGACTACCG TAGGCCGATG ATCCTCCCAG AGGTTAAGGC TGATAAGAAA GAAGAGAAAA AAGATCTGAA ACTGAAGAAA AAGACGTCTA AGCCCAAGAG CTCGAAGGCG GCTTCAAAGC TTGCTCAGCT GTCTTCGGGA GCTGGTACTC CCCAGGCTGA AGATAGCGAA AATATGGAAA CCCCGGGCCA GGAAGATGAA ATGGATCTTG ATGACTTGAA CGTTCAAGGA TTAAAAAGAC CAGGAGAAGA AGTTAACCGC GAAGACGAGA TCAAAAAATT AAGAACAAGT GGGTCCATGA CCCAGAACCA TCTGGAGGTG GCCCGAGTTA GAAATTTATC CAGGGTCATC TTGGGAGAAC ATATCATAGA GCCGTGGTAC TTTTCGCCAT ACCCCATAGA ATTGACCGAG GAGGACGAAA TATACATCTG TGACTTCACA TTGTCGTATT TCGGGTCTAG AAAACAGTTT GAACGGTTCA GATCCAAATG CTGCTTGAAA CACCCGCCCG GAAATGAGAT CTACAGGGAC TCTAAGGTTT CTTTCTGGGA GATAGACGGT AGAAGACAGC GTACTTGGTG TCGTAATCTT TGTTTGCTTT GCAAACTCTT CTTAGACCAC AAGACGTTGT ACTATGATGT AGATCCGTTC TTATTCTACG TTATGACCAT CAAGTCAGAA CAAGGACATC ATGTTGTTGG CTTCTTTTCC AAGGAAAAAG AAAGTGGCGA TGGATATAAC GTAGCGTGCA TTCTCACCTT GCCTTGTTAC CAGAAACGGG GCTACGGGAA GTTGCTTATC CAGTTCTCGT ATATGTTGTC GAATGTGGAA CACAAAGTAG GATCTCCCGA AAAGCCTCTA TCCGACTTGG GCTTGTTGTC GTACAGGGCC TATTGGACAG ATACATTGGT CAAATTGCTT GTAGAGAGAA GTAATCCGAT GTTGTACAAA AAGAACAACC CTGCCATTGA TGAAGACGAA GACGGTGGAT CTACTCCTCC TGCCAATACC AACAAACCAG GCAGCGTAAA CGAGATTACA ATCGAAGAAA TCTCGGCTCT TACTTGTATG ACTACGACAG ACATTCTTCA TACCTTGACG ACGTTGCAGA TCTTACGCTA CTATAAAGGT CAACATATCA TCGTGATCAC TGACCAGATC ATGACGTTGT ACGAGAAGCT CGTCAAGAAG GTCAAGGACA AGAAGAAACA CGAGTTGGAG CCTCGCAGAC TCAAATGGAC CCCACCACTT TTCACAGCTA ACCAATTAAG GTTCGGATGG TAGATACATG ATTAACGCCC TGTATTTTAA TTGTACCATA GATATATAAT GTTGATATAC CATGAAATAT GTATTGAAAC GTCTGAAATG
|
Protein sequence | MSGGENNQDS HSPDPSAPIA IKAGKDEPAA DDQFIEDEII IGCKVYVSKD GEHRLAEILQ QHLKKGKKVF YVHYQEFNKR LDEWISSDRI DYRRPMILPE VKADKKEEKK DSKSKKKTSK PKSSKAASKL AQSSSGAGTP QAEDSENMET PGQEDEMDLD DLNVQGLKRP GEEVNREDEI KKLRTSGSMT QNHSEVARVR NLSRVILGEH IIEPWYFSPY PIELTEEDEI YICDFTLSYF GSRKQFERFR SKCCLKHPPG NEIYRDSKVS FWEIDGRRQR TWCRNLCLLC KLFLDHKTLY YDVDPFLFYV MTIKSEQGHH VVGFFSKEKE SGDGYNVACI LTLPCYQKRG YGKLLIQFSY MLSNVEHKVG SPEKPLSDLG LLSYRAYWTD TLVKLLVERS NPMLYKKNNP AIDEDEDGGS TPPANTNKPG SVNEITIEEI SALTCMTTTD ILHTLTTLQI LRYYKGQHII VITDQIMTLY EKLVKKVKDK KKHELEPRRL KWTPPLFTAN QLRFGW
|
| |