Gene PICST_64717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_64717 
SymbolTAF5 
ID4840945 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp507439 
End bp509820 
Gene Length2382 bp 
Protein Length782 aa 
Translation table12 
GC content45% 
IMG OID640392260 
ProductTFIID and SAGA subunit 
Protein accessionXP_001386490 
Protein GI150866779 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0491121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAG ACAATAATAA CAGCAATGCC ACCAATCAAC CTCACGAGTT GAACCCTCCC 
ACGTCTGTCG GGGGATCTGC TGGCACAGAC TCTAGTCAAA CACTGTCTCA ACCATCACAA
CCTTCCCAAG CTGGATCGGT ACGAGCTCCA GCCAATCAGA CGGCATCGCA ACCAGCTCAG
CCCAGGGGCC AACAGCCACT GTTCTCACAA GCAGATTTGA ACCGAATTGT TCTTGAATAT
CTCAACAAGA AAGGATACCA TAGAACAGAA TCAATGTTGA GATTAGAAAG CTCCAATACT
CCCACACCAG CAGTAACGCC TGTGAGTCCA GCGACAAGTC TGTTGGCTAG TCCAGGCGAA
ATAGTATCAC CAGCCAATGC TGCTAGAAGG GAAAAGGAAT TAAAAGATAA ATTGAACAAG
AACGACCGCG AGATGAGAGA ATTGAAGGAA AGACAAGCCA GAGTAGAGCG AGAATTGAGA
GAAGCAAGAG ACAGAGAAAT TCGCTTGGTT AAAGAGAAGG AATTGCGCGA AATCAAGGAT
TTGGAAGAGA AGAAGAAACG CGAAAATGAT CCAGACGTCT ACTTCACTGT GTACTCTATG
TTGAAAAAAT GGGTTGATAC GTCGTTGGAC TTGTACAAGC CGGAGTTATC GCGTGTATTG
TATCCGTTGT TCATCCACTG TTTCTTTGAG TTGATCTCGA AGAACTTCGT GGATCTGGGA
AAGCGGTTCT TTGACAAATT TAAGAGCGAC CATATTATCT TGCACGGTGT AGAAATCAAC
CAGTTGGCAG GCATTCTGTT GCCAGAGCAT TTGAAGGAAA ATGAGTTGGC ACTTGCTTAC
AGAAGAAATA AGTACAAGAT CGTTGTTTCC AAGACGTCGA TGAATTTGTT ATTGTACTTT
TTGCACGAAA ATGAAGCTGT AGGTGGAGCT ATCTTGATTC GTATCATCAA CCAGTATTTG
GATACAGTCA TTTCCAGCGC CAAGCTCGAC AAAGTAGACC AGGAAGGCGA AGCCAATCCC
GAAGAAGGAA TTCCACAGTA TGTGGCCAAG ACAAACGAAA TAGATAAATT CAACGAACAA
CTGGTAAAAT TGGGAAAATT CCCTATCGAC CCTGAAGTCC AGAAGGAAGT AGAAGCTGAG
CTCAAGGTTA AGGATGAGAG ATCCTCTCCA GTAAATGGAA AGACTTTGGT AGAAGAATTC
GCTGAAATGA CAAAACCAGA AACAGATTCT CCAGCCAGAG AAGCTTTGCC TCTTCCTCTT
AAGGACCATG CTGATATCAA AAGAATGATA TTAGATGTAG AGGATTCCAG ATCCAAGATC
AAGTTGGGAG CCTTACAAGC GTCTGCCCCT TCCGTTTGTA TGTATACTTT CCACAACACT
TCCAACGACA TGACATGCAT AGATTTCAAC GAGGATTCCA ATATGATAGC TGCTGGATTC
CAGGACAGTT TCATAAAGTT GTGGAGTTTA GATGGAAGAC CACTTAAATC TGTATTTAAG
AGAGATCGCT ACAACTCTGA CAACACCCGT AAGCTCATTG GCCATAGTGG TCCTGTTTAT
AGCGTGTCAT TCTCACCAGA CAACAGATAC CTTTTATCTG GTTCAGAGGA TAAAACAGTG
CGACTCTGGT CGCTTGACTC TTACACAGCT CTTGTATCAT ACAAGGGACA TAACCAGCCT
ATTTGGGATG TCAAATTCTC ACCATTGGGC CACTACTTCG CTACAGCTTC TCACGATCAA
ACTGCCAGGT TGTGGGCTAC AGACCATATC TATCCCTTGA GGATATTTGC TGGCCATATC
AATGACGTGG ACTGCGTAGA ATTCCACCCT AACTCTAACT ACGTGTTCAC TGGTTCGTCT
GACAAGACAT GTAGAATGTG GGACGTACAG ACTGGTAACT GCGTCAGAGT GTTTATGGGC
CACACTGGGC CTGTGAACTG CATGGCAGTT TCTCCAGACG GTAGATGGCT AGCCAGTGCT
GGCGAAGACA GTGTAGTCAA CATCTGGGAC GCTGGAACTG GCAGACGTTT GAAGACAATG
AAGGGTCATG GCCGTTCTTC TATCTACTCT TTGTCCTTTT CTAGAGATGG TGGTGTCTTG
GTTAGTGGAG GTGCCGATAA TACTGTGAGA GTATGGGATG TCAAGAGAGA CACAAATGAT
GCTGGACCGG AGCCAGAGAT GTTTTCATCT GTAGAAAATG GCTCCAGCAA TGGCTCTGAT
CCAGAAGCTG CCAGAGCCAA AGCTGTAGAT AAGGTCAATA AGAAGGAAAT CATAGCGACT
AGCGACCATA TGACGGCTTA CTTCACTAAG AAAACTCCCG TGTACAAGGT CCATTTCACA
AGAAGAAACT TGTGTCTTGC TGGTGGAGCA TTCATGAGTT AG
 
Protein sequence
MAGDNNNSNA TNQPHELNPP TQTSSQPSQP SQAGSVRAPA NQTASQPAQP RGQQPSFSQA 
DLNRIVLEYL NKKGYHRTES MLRLESSNTP TPAVTPVSPA TSSLASPGEI VSPANAARRE
KELKDKLNKN DREMRELKER QARVERELRE ARDREIRLVK EKELREIKDL EEKKKRENDP
DVYFTVYSML KKWVDTSLDL YKPELSRVLY PLFIHCFFEL ISKNFVDSGK RFFDKFKSDH
IILHGVEINQ LAGISLPEHL KENELALAYR RNKYKIVVSK TSMNLLLYFL HENEAVGGAI
LIRIINQYLD TVISSAKLDK VDQEGEANPE EGIPQYVAKT NEIDKFNEQS VKLGKFPIDP
EVQKEVEAEL KVKDERSSPV NGKTLVEEFA EMTKPETDSP AREALPLPLK DHADIKRMIL
DVEDSRSKIK LGALQASAPS VCMYTFHNTS NDMTCIDFNE DSNMIAAGFQ DSFIKLWSLD
GRPLKSVFKR DRYNSDNTRK LIGHSGPVYS VSFSPDNRYL LSGSEDKTVR LWSLDSYTAL
VSYKGHNQPI WDVKFSPLGH YFATASHDQT ARLWATDHIY PLRIFAGHIN DVDCVEFHPN
SNYVFTGSSD KTCRMWDVQT GNCVRVFMGH TGPVNCMAVS PDGRWLASAG EDSVVNIWDA
GTGRRLKTMK GHGRSSIYSL SFSRDGGVLV SGGADNTVRV WDVKRDTNDA GPEPEMFSSV
ENGSSNGSDP EAARAKAVDK VNKKEIIATS DHMTAYFTKK TPVYKVHFTR RNLCLAGGAF
MS