Gene PICST_61538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61538 
SymbolHAT2 
ID4839737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp48690 
End bp49883 
Gene Length1194 bp 
Protein Length397 aa 
Translation table12 
GC content44% 
IMG OID640391052 
Producthistone acetyltransferase subunit 
Protein accessionXP_001385343 
Protein GI126137640 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.204154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAA ATGTTCAACG CGAGCTCACC ATCAAGGAGG AATACCAATT GTGGAGAAAG 
AACTGTCGGT ATATGTATGA GTTTGTTTCG GAAACAGCTT TGACCTGGCC TTCTTTAACC
ATTCAATGGT TACCTCAGCA TACCGAAGAA GACGGAGTGA TTCAGTCCAA GTTGCTCTTG
GGTACACACA CTTCTGGCGA AGATACCAAC TATTTGAAAG TTGCTTCTAC CGAACTTCCC
TCTTCCCAGC CAACAGAAAG TGCCAAAAAG GCTACTTCCA GGATCAAAAT TAGTAAGAAG
TTAACCAACG ACTACGAAAT CAACCGTGCT CGTTATATGC CGCAAGATCC CGATACGGTA
GCCACCATAA ACGGTGAAGG CAACATTGAT ATCTACGGCT TAAAAAGTGA AGAAAAGAAC
TCCCTTCTTC ACATCACACC TCACGACCGC AATGGGTATG GTCTATCTTG GAACAGCCAC
AGAAAGGGTT ATTTGTTGTC GTCTTCAGAC GATAAGTCAA TTGTTTTGAC TGATATCAAT
CGTGAAGCAC TTACTTCTAA TCAGATATTC AAGAACAATT CTCACTCTGA CATAGTCAAC
GACGTAAAAT GGCACACCCT TGACGAAAAC ATGTTTGCTT CAGTTTCAGA CGACAAACAT
GCCTACATTT TCGATTTGAG AACGCCCAAT AGGCCGGTAT CGTTGTTCTA CAACGAAGTA
TCTGACGGAA TCAACTCTGT AGCCTTCTCC CCCTTCTCCA AGTACTTGTT AGCCGTGGGT
AACACTAACT CCAACATTAA TGTATTGGAC TTGCGAAAGT TTAGTAACAA CGTCAAAAGT
AAAGACGGCC TACTTCATAC CATGATGGGC CATTCAGACT CGATTACTTC GTTGGAATTT
TCTCCACACA GGGACGGAAT AATAGCGTCT GGAGCCCAGG ATCGCCGGTT GATAGTCTGG
GACTTATTCA AGATTGGGGA AGAACAGCAA CAAGAGGACG CCGAAGATGG ATGCCCAGAA
TTATTTATGA TGCATGCTGG ACATACTGGT TCAGTGACAG ACTTGAGTTG GTGTCCATAC
AAAGACTGGA CCATTGGGTC TGTAGCTGAT GACAACATTG TCCATCTTTG GGAAGTGGGC
AAGAGTTTGC TTGAAGACGG CGTTGGCGAG ATCAAGGAAA CTGATCTTGA GTAG
 
Protein sequence
MDENVQRELT IKEEYQLWRK NCRYMYEFVS ETALTWPSLT IQWLPQHTEE DGVIQSKLLL 
GTHTSGEDTN YLKVASTELP SSQPTESAKK ATSRIKISKK LTNDYEINRA RYMPQDPDTV
ATINGEGNID IYGLKSEEKN SLLHITPHDR NGYGLSWNSH RKGYLLSSSD DKSIVLTDIN
REALTSNQIF KNNSHSDIVN DVKWHTLDEN MFASVSDDKH AYIFDLRTPN RPVSLFYNEV
SDGINSVAFS PFSKYLLAVG NTNSNINVLD LRKFSNNVKS KDGLLHTMMG HSDSITSLEF
SPHRDGIIAS GAQDRRLIVW DLFKIGEEQQ QEDAEDGCPE LFMMHAGHTG SVTDLSWCPY
KDWTIGSVAD DNIVHLWEVG KSLLEDGVGE IKETDLE