Gene PICST_68494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68494 
SymbolHAP3.2 
ID4840953 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp540388 
End bp542183 
Gene Length1796 bp 
Protein Length116 aa 
Translation table12 
GC content38% 
IMG OID640392268 
ProductTranscriptional activator HAP3 (UAS2 regulatory protein A) 
Protein accessionXP_001386697 
Protein GI126140350 
COG category[B] Chromatin structure and dynamics 
COG ID[COG2036] Histones H3 and H4 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.342746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0633704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACAAAGTTG ACTTTGCATA TCAAGCTATA TTGAAACCAG AGTGTTTTCC AAGAAGTCTC 
GAATATCGTT ACTTTCACCT GTACAAATAT AAAAGTGAAA AACTGCTTTT GTTTTCAGTG
AAAAATTGAA ATTTTGAGTT ATTTGAATTA TTTAAATGAA CTAAAAGGTA TATTTTTCAT
TATATAACTC ACAGTACTTT TCATTCTATA GCAGCATTGA CTCATAAAGC TCATTCGGTT
AACTACAGAG AATACAATTA TTCTAATATT TGCACAATAT ATTCAACTAC CTTTTCATCC
TTGAAGTACT ATTTCACCTT TTCGAAAGAA ATCTTTGTGA ATCACATAAA TTAAGTAATA
CGATTGTACT CATAGATTTT TCACAAGTTT TGTCTTCATT AAATCAATAT TCAATAGTCC
TAAACCGAAT ATGGACCCTA ACAATTTAAA CCCACAAGAA GTGGAACTAA GAGAGCAGGA
CAGATGGTTG CCCATCGCCA ATGTAGCTCG ACTCATGAAG AACACCTTGC CTACTACCGC
CAAAGTATCC AAGGATGCCA AAGAGTGTAT GCAAGAATGT GTCTCTGAGT TCATTTCCTT
CATAACCAGT GAAGCCAGCG ATAAGTGTTT GAGAGAAAAA CGAAAGACAA TTAATGGAGA
AGATATCTTG TACTCGATGC ACGACTTGGG GTTTGAAAAC TACGCCGAAG TGTTGAAGAT
CTACTTGGCC AAGTATCGTG AACAACAGGC TTTGAGGCAA GAAAGGGGAG AATCCAGAAC
TTCTAAGAGG CAACAGAAAC AGGCTGCTGC TGCGGCCGCT GCTGCTGCTG CTGAGGCTGC
GGCTTCCGAG GCTGCTTCTA CTGAAGACAT TGACGAGCTG GAGCACATGG AGTACCAAGA
AGATGGCACT GGATCCAATT CTCCATCCCA GAATGGTGAT CTTAATGGTG AACACTATAT
TGAAAATGAA GAGACATACC AAAACGGTCA TGAAGACCAA GAAAACGAAG AAGTTGAAGC
TGTGGAAGAC GTGGTAGCTA CTGAAACCCA ACAACAGGAA CATAATGAAG TAGACTCCAA
CTCAGTCACC CATGTAAATA GCGCTCTCGG CTCGAACTCT TCGTCAACTA GCTCGATAGC
TTCACCAGAG CCTTACTTTA ACCATTACGA AAATAACGAA AACGAAAACG AAAACGAACA
GGACAATTCC AAACCAGACC AACCAAACAC CGAAGAAACA GAACAGTCGG AACTAGACGT
TGTTGTTCCC CTCAAAGAAG GAGAAGAGTA TGGATATGAC GACTTTATCA CCGATGAGCC
ACATGATGAA CTCCACATCA GCGAAACCTC CAGTCATAAC CATATCGAGG AGTTGGCAAA
ACAGCTTACC CATGACGAAC CAGATATTGA CACATTGACA GCTGGAATCA CGAATGGAAC
CGCCAACAAT GGTTTTTTCT AGTCAATTCT GCAAATCTAG ACTATTCATA GAGGTGCTTC
AAAAGTTACG GTTTATTCAT AGAAGTGAAC TTCATATTTA TTCCATTGTA CAAATAAGTT
GTTGTATTTT GTTCTTTTTC TTTGGTGGAG GTGGATGAAA ATCTTGTATT TCAATTCTTT
TTTGTGTTAA CTACTATACA TGTATTATGT TGCGGTGTTG TAGATATCAA AGGTATTGTA
TTAAATTAGT TAAATGTGCA AACTTTACAC ATTAGTAGAA ATGAACCCAA TAAATAGTAC
CGGAAAGACA AACTATGCTA TTATATTTTT TCTCTGCAAA TAGGGCAGCT TCTTGA
 
Protein sequence
MDPNNLNPQE VELREQDRWL PIANVARLMK NTLPTTAKVS KDAKECMQEC VSEFISFITS 
EASDKCLREK RKTINGEDIL YSMHDLGFEN YAEVLKIYLA KQTMLLYFFS ANRAAS