Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64717 |
Symbol | TAF5 |
ID | 4840945 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 507439 |
End bp | 509820 |
Gene Length | 2382 bp |
Protein Length | 782 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640392260 |
Product | TFIID and SAGA subunit |
Protein accession | XP_001386490 |
Protein GI | 150866779 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0491121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAG ACAATAATAA CAGCAATGCC ACCAATCAAC CTCACGAGTT GAACCCTCCC ACGTCTGTCG GGGGATCTGC TGGCACAGAC TCTAGTCAAA CACTGTCTCA ACCATCACAA CCTTCCCAAG CTGGATCGGT ACGAGCTCCA GCCAATCAGA CGGCATCGCA ACCAGCTCAG CCCAGGGGCC AACAGCCACT GTTCTCACAA GCAGATTTGA ACCGAATTGT TCTTGAATAT CTCAACAAGA AAGGATACCA TAGAACAGAA TCAATGTTGA GATTAGAAAG CTCCAATACT CCCACACCAG CAGTAACGCC TGTGAGTCCA GCGACAAGTC TGTTGGCTAG TCCAGGCGAA ATAGTATCAC CAGCCAATGC TGCTAGAAGG GAAAAGGAAT TAAAAGATAA ATTGAACAAG AACGACCGCG AGATGAGAGA ATTGAAGGAA AGACAAGCCA GAGTAGAGCG AGAATTGAGA GAAGCAAGAG ACAGAGAAAT TCGCTTGGTT AAAGAGAAGG AATTGCGCGA AATCAAGGAT TTGGAAGAGA AGAAGAAACG CGAAAATGAT CCAGACGTCT ACTTCACTGT GTACTCTATG TTGAAAAAAT GGGTTGATAC GTCGTTGGAC TTGTACAAGC CGGAGTTATC GCGTGTATTG TATCCGTTGT TCATCCACTG TTTCTTTGAG TTGATCTCGA AGAACTTCGT GGATCTGGGA AAGCGGTTCT TTGACAAATT TAAGAGCGAC CATATTATCT TGCACGGTGT AGAAATCAAC CAGTTGGCAG GCATTCTGTT GCCAGAGCAT TTGAAGGAAA ATGAGTTGGC ACTTGCTTAC AGAAGAAATA AGTACAAGAT CGTTGTTTCC AAGACGTCGA TGAATTTGTT ATTGTACTTT TTGCACGAAA ATGAAGCTGT AGGTGGAGCT ATCTTGATTC GTATCATCAA CCAGTATTTG GATACAGTCA TTTCCAGCGC CAAGCTCGAC AAAGTAGACC AGGAAGGCGA AGCCAATCCC GAAGAAGGAA TTCCACAGTA TGTGGCCAAG ACAAACGAAA TAGATAAATT CAACGAACAA CTGGTAAAAT TGGGAAAATT CCCTATCGAC CCTGAAGTCC AGAAGGAAGT AGAAGCTGAG CTCAAGGTTA AGGATGAGAG ATCCTCTCCA GTAAATGGAA AGACTTTGGT AGAAGAATTC GCTGAAATGA CAAAACCAGA AACAGATTCT CCAGCCAGAG AAGCTTTGCC TCTTCCTCTT AAGGACCATG CTGATATCAA AAGAATGATA TTAGATGTAG AGGATTCCAG ATCCAAGATC AAGTTGGGAG CCTTACAAGC GTCTGCCCCT TCCGTTTGTA TGTATACTTT CCACAACACT TCCAACGACA TGACATGCAT AGATTTCAAC GAGGATTCCA ATATGATAGC TGCTGGATTC CAGGACAGTT TCATAAAGTT GTGGAGTTTA GATGGAAGAC CACTTAAATC TGTATTTAAG AGAGATCGCT ACAACTCTGA CAACACCCGT AAGCTCATTG GCCATAGTGG TCCTGTTTAT AGCGTGTCAT TCTCACCAGA CAACAGATAC CTTTTATCTG GTTCAGAGGA TAAAACAGTG CGACTCTGGT CGCTTGACTC TTACACAGCT CTTGTATCAT ACAAGGGACA TAACCAGCCT ATTTGGGATG TCAAATTCTC ACCATTGGGC CACTACTTCG CTACAGCTTC TCACGATCAA ACTGCCAGGT TGTGGGCTAC AGACCATATC TATCCCTTGA GGATATTTGC TGGCCATATC AATGACGTGG ACTGCGTAGA ATTCCACCCT AACTCTAACT ACGTGTTCAC TGGTTCGTCT GACAAGACAT GTAGAATGTG GGACGTACAG ACTGGTAACT GCGTCAGAGT GTTTATGGGC CACACTGGGC CTGTGAACTG CATGGCAGTT TCTCCAGACG GTAGATGGCT AGCCAGTGCT GGCGAAGACA GTGTAGTCAA CATCTGGGAC GCTGGAACTG GCAGACGTTT GAAGACAATG AAGGGTCATG GCCGTTCTTC TATCTACTCT TTGTCCTTTT CTAGAGATGG TGGTGTCTTG GTTAGTGGAG GTGCCGATAA TACTGTGAGA GTATGGGATG TCAAGAGAGA CACAAATGAT GCTGGACCGG AGCCAGAGAT GTTTTCATCT GTAGAAAATG GCTCCAGCAA TGGCTCTGAT CCAGAAGCTG CCAGAGCCAA AGCTGTAGAT AAGGTCAATA AGAAGGAAAT CATAGCGACT AGCGACCATA TGACGGCTTA CTTCACTAAG AAAACTCCCG TGTACAAGGT CCATTTCACA AGAAGAAACT TGTGTCTTGC TGGTGGAGCA TTCATGAGTT AG
|
Protein sequence | MAGDNNNSNA TNQPHELNPP TQTSSQPSQP SQAGSVRAPA NQTASQPAQP RGQQPSFSQA DLNRIVLEYL NKKGYHRTES MLRLESSNTP TPAVTPVSPA TSSLASPGEI VSPANAARRE KELKDKLNKN DREMRELKER QARVERELRE ARDREIRLVK EKELREIKDL EEKKKRENDP DVYFTVYSML KKWVDTSLDL YKPELSRVLY PLFIHCFFEL ISKNFVDSGK RFFDKFKSDH IILHGVEINQ LAGISLPEHL KENELALAYR RNKYKIVVSK TSMNLLLYFL HENEAVGGAI LIRIINQYLD TVISSAKLDK VDQEGEANPE EGIPQYVAKT NEIDKFNEQS VKLGKFPIDP EVQKEVEAEL KVKDERSSPV NGKTLVEEFA EMTKPETDSP AREALPLPLK DHADIKRMIL DVEDSRSKIK LGALQASAPS VCMYTFHNTS NDMTCIDFNE DSNMIAAGFQ DSFIKLWSLD GRPLKSVFKR DRYNSDNTRK LIGHSGPVYS VSFSPDNRYL LSGSEDKTVR LWSLDSYTAL VSYKGHNQPI WDVKFSPLGH YFATASHDQT ARLWATDHIY PLRIFAGHIN DVDCVEFHPN SNYVFTGSSD KTCRMWDVQT GNCVRVFMGH TGPVNCMAVS PDGRWLASAG EDSVVNIWDA GTGRRLKTMK GHGRSSIYSL SFSRDGGVLV SGGADNTVRV WDVKRDTNDA GPEPEMFSSV ENGSSNGSDP EAARAKAVDK VNKKEIIATS DHMTAYFTKK TPVYKVHFTR RNLCLAGGAF MS
|
| |