Gene PICST_82300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82300 
Symbol 
ID4837154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp661059 
End bp662708 
Gene Length1650 bp 
Protein Length516 aa 
Translation table12 
GC content44% 
IMG OID640388469 
Productpredicted protein 
Protein accessionXP_001382368 
Protein GI150863775 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5027] Histone acetyltransferase (MYST family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.284716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.810962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCTACAAAG CAATGTCCGG GGGCGAGAAC AACCAAGACT CGCATTCGCC CGATCCCAGC 
GCGCCTATAG CGATTAAAGC AGGCAAAGAT GAGCCAGCAG CTGATGATCA GTTCATAGAA
GATGAAATCA TTATAGGATG TAAGGTGTAT GTTTCCAAGG ATGGAGAGCA CAGACTAGCA
GAGATTCTTC AGCAACATTT GAAGAAGGGC AAGAAGGTAT TCTATGTACA CTATCAAGAG
TTCAACAAGC GTTTAGATGA ATGGATTCTG AGCGACCGAA TCGACTACCG TAGGCCGATG
ATCCTCCCAG AGGTTAAGGC TGATAAGAAA GAAGAGAAAA AAGATCTGAA ACTGAAGAAA
AAGACGTCTA AGCCCAAGAG CTCGAAGGCG GCTTCAAAGC TTGCTCAGCT GTCTTCGGGA
GCTGGTACTC CCCAGGCTGA AGATAGCGAA AATATGGAAA CCCCGGGCCA GGAAGATGAA
ATGGATCTTG ATGACTTGAA CGTTCAAGGA TTAAAAAGAC CAGGAGAAGA AGTTAACCGC
GAAGACGAGA TCAAAAAATT AAGAACAAGT GGGTCCATGA CCCAGAACCA TCTGGAGGTG
GCCCGAGTTA GAAATTTATC CAGGGTCATC TTGGGAGAAC ATATCATAGA GCCGTGGTAC
TTTTCGCCAT ACCCCATAGA ATTGACCGAG GAGGACGAAA TATACATCTG TGACTTCACA
TTGTCGTATT TCGGGTCTAG AAAACAGTTT GAACGGTTCA GATCCAAATG CTGCTTGAAA
CACCCGCCCG GAAATGAGAT CTACAGGGAC TCTAAGGTTT CTTTCTGGGA GATAGACGGT
AGAAGACAGC GTACTTGGTG TCGTAATCTT TGTTTGCTTT GCAAACTCTT CTTAGACCAC
AAGACGTTGT ACTATGATGT AGATCCGTTC TTATTCTACG TTATGACCAT CAAGTCAGAA
CAAGGACATC ATGTTGTTGG CTTCTTTTCC AAGGAAAAAG AAAGTGGCGA TGGATATAAC
GTAGCGTGCA TTCTCACCTT GCCTTGTTAC CAGAAACGGG GCTACGGGAA GTTGCTTATC
CAGTTCTCGT ATATGTTGTC GAATGTGGAA CACAAAGTAG GATCTCCCGA AAAGCCTCTA
TCCGACTTGG GCTTGTTGTC GTACAGGGCC TATTGGACAG ATACATTGGT CAAATTGCTT
GTAGAGAGAA GTAATCCGAT GTTGTACAAA AAGAACAACC CTGCCATTGA TGAAGACGAA
GACGGTGGAT CTACTCCTCC TGCCAATACC AACAAACCAG GCAGCGTAAA CGAGATTACA
ATCGAAGAAA TCTCGGCTCT TACTTGTATG ACTACGACAG ACATTCTTCA TACCTTGACG
ACGTTGCAGA TCTTACGCTA CTATAAAGGT CAACATATCA TCGTGATCAC TGACCAGATC
ATGACGTTGT ACGAGAAGCT CGTCAAGAAG GTCAAGGACA AGAAGAAACA CGAGTTGGAG
CCTCGCAGAC TCAAATGGAC CCCACCACTT TTCACAGCTA ACCAATTAAG GTTCGGATGG
TAGATACATG ATTAACGCCC TGTATTTTAA TTGTACCATA GATATATAAT GTTGATATAC
CATGAAATAT GTATTGAAAC GTCTGAAATG
 
Protein sequence
MSGGENNQDS HSPDPSAPIA IKAGKDEPAA DDQFIEDEII IGCKVYVSKD GEHRLAEILQ 
QHLKKGKKVF YVHYQEFNKR LDEWISSDRI DYRRPMILPE VKADKKEEKK DSKSKKKTSK
PKSSKAASKL AQSSSGAGTP QAEDSENMET PGQEDEMDLD DLNVQGLKRP GEEVNREDEI
KKLRTSGSMT QNHSEVARVR NLSRVILGEH IIEPWYFSPY PIELTEEDEI YICDFTLSYF
GSRKQFERFR SKCCLKHPPG NEIYRDSKVS FWEIDGRRQR TWCRNLCLLC KLFLDHKTLY
YDVDPFLFYV MTIKSEQGHH VVGFFSKEKE SGDGYNVACI LTLPCYQKRG YGKLLIQFSY
MLSNVEHKVG SPEKPLSDLG LLSYRAYWTD TLVKLLVERS NPMLYKKNNP AIDEDEDGGS
TPPANTNKPG SVNEITIEEI SALTCMTTTD ILHTLTTLQI LRYYKGQHII VITDQIMTLY
EKLVKKVKDK KKHELEPRRL KWTPPLFTAN QLRFGW