Gene PICST_29380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29380 
Symbol 
ID4836816 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp124146 
End bp125417 
Gene Length1272 bp 
Protein Length357 aa 
Translation table12 
GC content44% 
IMG OID640388131 
Productpredicted protein 
Protein accessionXP_001382253 
Protein GI150863695 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5027] Histone acetyltransferase (MYST family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000817876 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.405165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATG GGCGAACGCA GGCAAAGAAG TCCGTGCCCA AGGATGAGGG ATATGATAAC 
GGCTTCTATG GCGTATTGGA CAAACCCAAC ATAAAGACCG TGCTCTTTGG CAATTACCGA
TTCAATACGT GGTACGGCAA TGCAGCGTAT TTCAACGCTT ACGATACAGC CCATATGGCT
TTGGGATACG ACTTTTCCAA TCGTATCGCC TCAGATCCCA GTTTGCGGGC TCGGAAACGA
TCTAGATCTA CGTCTATGGT TGCTTCGTCT TCTATGACAT CCAATAATGG AAACCACGAT
GGAAATCGTG AAAATTCCAT AGATACAAAT CAAAACGAAA ATTCAGCAAA AATTACGAGG
TTGTCTAAAG CTGCAGCTCG AAAAGCAAAT GCAAACTCGA CCCATAGCAA CTTTACAGGT
ACAGTCACAA ATACTGAAAA CTCCAACATC GATAATAACC ATAATGATGA CGACTACTGG
TTGAACGAAC TTTATGTATG CGAGTACTGT TTCAAATATA CCTCCAATTC ACACGAGATG
CAACAGCACA GAGTGGTCTG CTCATATAAT GTGGCCAGAC CGAAAGTGGG AAAGCTCTTG
TATCGAGATG ACCATACCCC ATATCTCATA CGAGAAGTGC GAGGATTCAC TGATCCTCTT
TTCTGTCAGA ACCTCTGTTT GTTTGGTAAG CTCTTTCTCG ACGATAAATC TGTGTACTAC
AACATCGACC ATTTCAACTT CTATATTGTT TACGGCTATG ACAACGATGT AAATGCTGAC
CCTTACACTG AACAACACTT CAAACCCATG GGTTTCTTTT CAAAGGAAAT GCTAGCTTAC
GATAACGACA ATAACCTAGC ATGCATCTGT GTATTTCCCC CGTTTCAAAG ACGGCACTTG
GGCTCATTGC TAATAGAGTT TCTGTACGCG TTGGCGCATG TCACTCCTGG CCAATACCAC
AGTGGACCTG AATTCCCACT CTCTCCGTAT GGCAAGGTTA GCTACCTTCG GTTCTGGTCC
AAAAAGTTGG CTAGTGTGAT AACTTCGCAT TTCAAGCCTG GTCTGTCGTT CAGTTTGAAT
GATATTTCCG ACTTCACCGG GTTTAGAAAG GAAGATATCT TGCTCACGTT GGAGTACATG
AAACTCTTGA AGAAAGACCT GCGGGGCAAT GTGAAGTTGC TCCTTGGAAA TCTTCAAGAA
TGGTGCACTG CCAATAATGT TGACCCGAAC CAGGAGAAGT CTATGATGAA TACTGAGTAC
CTTCTACTAT AA
 
Protein sequence
MSDGRTQAKK SVPKDEGYDN GFYGVLDKPN IKTVLFGNYR FNTWYGNAAY FNAYDTAHMA 
LGYDFSNRIA SDPSTVTNTE NSNIDNNHND DDYWLNELYV CEYCFKYTSN SHEMQQHRVV
CSYNVARPKV GKLLYRDDHT PYLIREVRGF TDPLFCQNLC LFGKLFLDDK SVYYNIDHFN
FYIVYGYDND VNADPYTEQH FKPMGFFSKE MLAYDNDNNL ACICVFPPFQ RRHLGSLLIE
FSYALAHVTP GQYHSGPEFP LSPYGKVSYL RFWSKKLASV ITSHFKPGSS FSLNDISDFT
GFRKEDILLT LEYMKLLKKD SRGNVKLLLG NLQEWCTANN VDPNQEKSMM NTEYLLL