Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_61538 |
Symbol | HAT2 |
ID | 4839737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 48690 |
End bp | 49883 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391052 |
Product | histone acetyltransferase subunit |
Protein accession | XP_001385343 |
Protein GI | 126137640 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.204154 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAA ATGTTCAACG CGAGCTCACC ATCAAGGAGG AATACCAATT GTGGAGAAAG AACTGTCGGT ATATGTATGA GTTTGTTTCG GAAACAGCTT TGACCTGGCC TTCTTTAACC ATTCAATGGT TACCTCAGCA TACCGAAGAA GACGGAGTGA TTCAGTCCAA GTTGCTCTTG GGTACACACA CTTCTGGCGA AGATACCAAC TATTTGAAAG TTGCTTCTAC CGAACTTCCC TCTTCCCAGC CAACAGAAAG TGCCAAAAAG GCTACTTCCA GGATCAAAAT TAGTAAGAAG TTAACCAACG ACTACGAAAT CAACCGTGCT CGTTATATGC CGCAAGATCC CGATACGGTA GCCACCATAA ACGGTGAAGG CAACATTGAT ATCTACGGCT TAAAAAGTGA AGAAAAGAAC TCCCTTCTTC ACATCACACC TCACGACCGC AATGGGTATG GTCTATCTTG GAACAGCCAC AGAAAGGGTT ATTTGTTGTC GTCTTCAGAC GATAAGTCAA TTGTTTTGAC TGATATCAAT CGTGAAGCAC TTACTTCTAA TCAGATATTC AAGAACAATT CTCACTCTGA CATAGTCAAC GACGTAAAAT GGCACACCCT TGACGAAAAC ATGTTTGCTT CAGTTTCAGA CGACAAACAT GCCTACATTT TCGATTTGAG AACGCCCAAT AGGCCGGTAT CGTTGTTCTA CAACGAAGTA TCTGACGGAA TCAACTCTGT AGCCTTCTCC CCCTTCTCCA AGTACTTGTT AGCCGTGGGT AACACTAACT CCAACATTAA TGTATTGGAC TTGCGAAAGT TTAGTAACAA CGTCAAAAGT AAAGACGGCC TACTTCATAC CATGATGGGC CATTCAGACT CGATTACTTC GTTGGAATTT TCTCCACACA GGGACGGAAT AATAGCGTCT GGAGCCCAGG ATCGCCGGTT GATAGTCTGG GACTTATTCA AGATTGGGGA AGAACAGCAA CAAGAGGACG CCGAAGATGG ATGCCCAGAA TTATTTATGA TGCATGCTGG ACATACTGGT TCAGTGACAG ACTTGAGTTG GTGTCCATAC AAAGACTGGA CCATTGGGTC TGTAGCTGAT GACAACATTG TCCATCTTTG GGAAGTGGGC AAGAGTTTGC TTGAAGACGG CGTTGGCGAG ATCAAGGAAA CTGATCTTGA GTAG
|
Protein sequence | MDENVQRELT IKEEYQLWRK NCRYMYEFVS ETALTWPSLT IQWLPQHTEE DGVIQSKLLL GTHTSGEDTN YLKVASTELP SSQPTESAKK ATSRIKISKK LTNDYEINRA RYMPQDPDTV ATINGEGNID IYGLKSEEKN SLLHITPHDR NGYGLSWNSH RKGYLLSSSD DKSIVLTDIN REALTSNQIF KNNSHSDIVN DVKWHTLDEN MFASVSDDKH AYIFDLRTPN RPVSLFYNEV SDGINSVAFS PFSKYLLAVG NTNSNINVLD LRKFSNNVKS KDGLLHTMMG HSDSITSLEF SPHRDGIIAS GAQDRRLIVW DLFKIGEEQQ QEDAEDGCPE LFMMHAGHTG SVTDLSWCPY KDWTIGSVAD DNIVHLWEVG KSLLEDGVGE IKETDLE
|
| |