Gene PICST_40760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40760 
SymbolHAT3 
ID4837331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1608135 
End bp1609340 
Gene Length1206 bp 
Protein Length402 aa 
Translation table12 
GC content42% 
IMG OID640388646 
Productsubunit of histone acetyltransferase 
Protein accessionXP_001382533 
Protein GI150863899 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.500001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.43958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCA CACAGAAGGA TTTGGCTGTG GCCGAACGTG AAATCGTGGA AGAGCATCAG 
CTCAAAGAAA AAGTCGTCAA CGAAGAATTC AAGATCTGGA AAAAAACAGT TCCGCTTCTT
TACGACACCA TACATACCTA TGTATTGGAC TATCCATCCT TGGCCATCAA GTGGCTTCCT
GATTACACTT ATTCAGATAA CAAGAACTCT GTCAATGTCA AGTTTTTGAT AGGCACCAAT
ACTTCACACA ATTCTCTGAA CTACTTGAAA TTGGGATCTG TAAACATTCC CAGTACATTG
GCTCCCGATT TTTCCACTGT GAATCCAGAT GTTGACAGCA TTACCGTTCC CTCGCTGGTT
ATCGAAGACA CTTCCGACTT CAGAATCTTG TCTAAATGGA AACAGACCTC GGAAATTAAC
AAGCTCGACA TCTCTCCAAA TGGAAAGAAA GTATTGAGTT TCAACAGCGA TGGAGTTGTC
CACTCCTACG ACTTGGAAAA CAACGACGTC ATCGACTACA AGTATCACAA GTCTGAGGGT
TATGCACTTA CTTGGTTTGG AAATGATAGC TTCATCAGTG GTTCCAACGA TTCGCAGATT
GCATTGTGGT CACTTGACAA ACCTTCTACT CCCATCCAGC TCTTCAAGAG CCACAATGGA
GCCGTCAACG ACATCTCGTA TAATCCCAAC TTTGTCAGTA TATTTGGCTC TGTTTCGGAC
GATTCATCAA CTCAATTCCA TGACTCTAGA GCTTCTGGTG ACAATCCTGT TATCAAGCAG
GAAAACCAAC ATATTCAGAT GGCTATAAGT GTCCATCCTG AGATCGAAAC CTTGTACGCA
ACTGGAGGAA AGGACAATGT GGTGTCGTTG TACGATATCA GAAACTACAA GATTCCTTTA
CGTAAGTTTT TCGGCCACAA TGACAGTGTT GCTGGTATCA AGTGGGATGT AGAAGACCCC
AGAACATTGA TATCGTGGAG TTTGGATAAG CGCATAATAA CGTGGGATTT GAAGGATTTG
GAGGAGGAAT ATGCATATCC TGATGGAAAT GAAAACTCAA GAAGAAGAGC CGCTGTAAAA
ATAGACCCTT GCTTGAGATT TATCCATGGA GGTCACACTA ATAGAGTCAA CGACTTTGAT
GTACATCCCA AAATAAGGAG CTTATATGCA AGTGTAGGCG ATGACAATTT GTTGGAGGTC
TGGAAA
 
Protein sequence
MTITQKDLAV AEREIVEEHQ LKEKVVNEEF KIWKKTVPLL YDTIHTYVLD YPSLAIKWLP 
DYTYSDNKNS VNVKFLIGTN TSHNSSNYLK LGSVNIPSTL APDFSTVNPD VDSITVPSSV
IEDTSDFRIL SKWKQTSEIN KLDISPNGKK VLSFNSDGVV HSYDLENNDV IDYKYHKSEG
YALTWFGNDS FISGSNDSQI ALWSLDKPST PIQLFKSHNG AVNDISYNPN FVSIFGSVSD
DSSTQFHDSR ASGDNPVIKQ ENQHIQMAIS VHPEIETLYA TGGKDNVVSL YDIRNYKIPL
RKFFGHNDSV AGIKWDVEDP RTLISWSLDK RIITWDLKDL EEEYAYPDGN ENSRRRAAVK
IDPCLRFIHG GHTNRVNDFD VHPKIRSLYA SVGDDNLLEV WK