Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80611 |
Symbol | SPT7 |
ID | 4851522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2053276 |
End bp | 2057085 |
Gene Length | 3810 bp |
Protein Length | 1106 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393230 |
Product | transcription factor, member of the histone acetyltransferase SAGA complex |
Protein accession | XP_001388019 |
Protein GI | 126274742 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.538787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.471232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GAAACAGCGT GGAAACATCC ATATATCAGC TTGCTTCGTT TTTTCTTGCC ATTTCAGCTC TCTTTTTAGC TGGTTCCGAC GCTCTCCTCT GTAGAGCTAA TATTAATCTC GAGAATCAGA TCTCTCATTG AATCCAGACT GAGTCCACAA CGATTCCTTT AAAAGGATAT TCAGAGCAGA TAGACGTGAG TTCAAGTCTG GTAGAGTTGA CAATCCATAC ACGTGATTCA GTAGTGATTC CAAATTTGAA TAGAAACCCA ACTAGGTTTC ATTCTTTCAT TATTTTATAT TCTGTTCACA TTTTTCAATA ACATTTTCAG TAATATTCAC GAATTTGTTC TTCTTGAAAG TTCACAGAAA TTCCAATCCT TCTAAAGTTG ACTAATTATC ACAGTTCCAT TTCCGAAGTG AAATTTCAGA ATTTCTTAGC ATAGAATGGA GAAGCTATCG ACGTTTGAGC ATAACAATCC CAAGAAGCTC TTCGAGTTGG CCAAACGGTT GCATTCCAGC AACTTCTTCG AGTCGTATCT CAACGAAACT CATCTCAAAG TGTTGGACTA TATAATACTG TTGAACAACG CCGAGATTTG GGACAATTTC CTTGAGGGTA ACTGTACTAT CACATTCAGC AAGGATCCAG AAGCCAGCGA GGTGGTCAAG TCTGAAGAAG GTATTGCGGA CAATAATGAC AATGTAAGCG ACACTCCAGA GCTTCTTTTT GAAGAAGGAT CCAAAATGGT TAAGATGTTA GCATTACATA TCAGATACTT GTTATGGGAG AAAGCAATTG ACTACTACTA CAAAAACACA GGCTCTTCAT CAGAACAAGT AGTAGAAGTA GATGATGACT TTGAAATGAT AGATACGCTA GATAACTTCT CGGATGACGA AGATAAGGAA AAAGAGGCCG AGAAAGAGGA AAAAGCACAG CCAAAAGTCC GTGAAGTTGA GGACGATTAT GACGATGAAG ACGAAGAAGA AGAAGAAAGT GACCAAAAAG ATGATAAGGA TGACAATAAA GACAATGAGC AATCTGAAAA TGGAATATAT CAATGGAAAT ATAACGACGA AAAACAGATT GTTTTGCAAG TGCCTGTTTC GCTTGTTACT GTGGCTCCAG ATACTTCTTC TGAAATACAA ACAGAATCGC AAGATGGCTC TGGTTCAACT GGAAGTGAGC CTCCCAAGTC TTCATCTAAC GGTGATTCTC CTGATTCAAA TGCTGACGAC CAGGAAAAGC TCATAAGAGA ATATAATAAA GTGTATCATA ACTTCGAGTA CGATCGTGAA ACATTAATCA AAAGAAGGAA GTTAGAGAAG TCGGATTTGC AGTTAGAAGA CTCCAAGAAC GGCAACGCTG ACTCTACCAA AGATGGGCTT TCTGGATTAG GTGAAGCTGA TAGCATGGGA ATCAGTTTAG GAACTGGAAG CACTTCCTTA AAGCATCTTT TGTCCACTAT TCAGCTGAAG AGGGACGATG TTCCTCTTAA CGATCATGAA TTGAGAACTC TTTTTATGGA TGTAAGAAAG AACCGAGGTA AGTGGGCCAA TAACGATCGT GTTGGCCAAG AAGAGTTGTA TGAAGCCTGT GAGAAAGTGG TTGTAGAATT GAGAGGATCC ACAGAGCATT CGACTCCTTT CTTGAATAAA GTATCCAAGC GAGAAGCTCC TAACTATGGA CTCATAATCA AGAAACCAAT GGACTTGAAC ACGGTGATGA AAAAGTTGAA ATCGTTCGCC TATAATTCGA AACAGGAGTT TGTGGACGAC CTCATGTTAA TCTGGTCCAA CTGCTTGACA TACAATACAG ATCCTAAACA TTTCTTACGA GCTCATGCTA TAGCCATGCA AAAAAAGACA TCCAAATTAA TTCCTACGAT TCCCAATATA ACCATTAGGA GTAGACAAGA AGTCGAAAGA GAAGAAGAAC TAGAAAACGA AAGAGTAGGA ACTCCTATGG CTTCGGCAGG AAAGTCTATG AAAAAAGGAA GAAAGAGAAG ACAGGACCAA ATCAAAACAG AAGTAGATCC AGCTACACCG ATAGCCACTG CTGTTGCATC TGCTATTGGC TCACCAGTGC CTGTAACAGG AGTTTCTGAA AATATAGTAG AAAATGGAAA TGTATCCAAC GATGAAGAAG AAGAGGAGGA AGAAGATAAT GAGAATATAA ATGGGGAAGC TAATGGAATC TCAGAAGAAG ACGACGAGTT AGATCCCGAG CTTCAGGCCT GGAGAACTAT TACAGCAAAA TCTCGTGCTC ATTACTGTGC TGAGAGGGCT GCCCTTTTTG ATGAGAAATT CCACTTACGG TCAGATGCCA GAGCGATAAT TCGTCAATCT CGTGAAATGA GCAACTTCAA CCAGTATTTG ACGAACAAAG AGGTCATCTC CAAGTCGAGC AACTTGTTGG AAAATGACGA GCCGTATCTA TTAGAATATG ACATCACTGG GGGATTGCCT GGTCTCAAGT ATAAGGGTAT TGATAAGGAA GAAGAAGAAA AGAGAGAACA ATCGTTGGTA GATGTTTTTT TGCAACAGGC TAATGGAGAT GCATCTAATA TCAAATCAGA TTTCGTGTTA CCCATAGACT CAGGCTTGAA CAAGATGTAC ACCGAGAATA TCAAGGAGAT GCAAGAAATC CGCAAGATCT GCTTCAAGAT CTCGCTTATT CGACAGATGC AAACGCAGCA ATTTGTTCAC CATACGCAGA TGAAACAACC TGAGATTGAG GTAATACGAG AAGTAGATGT AGATGCCGTT TCCAAGTTGC CCAACCATGA TCCGTTTACG GACCAAATCC AGTTTTCGGT GCTTCGTAGA AATATAGCGA AGATAGCGAT GCAGACGGGG TTCGAAAGTT CTGAACCGTT TGCTATTAAC ACCTTGACCC AAGTAGCAGA AAAGTACATG GGCAACTTGA TCAAGACGTT GAAGTTGCAC ACAGAGACCA GCTCTAACAA TAGATTGAAC GAAAAAGAGA TAGTGTTGCT TTCGTTGTTG GAAAACGGTA TAGACAAACC GGATGACTTG TACACTTTTG TGCAAGAAAG AATCATCAAA CAGCACGATA AACTTACAGA TTTGCGATCA AAGTTGTCGA ATTTCTTAAA GGAGTTGTTA CGACCTGGAT TGGAGAATTT CAATGAGAGA AGTTTTGAAG ATAATAGTGA GCAGTTCATG ACGGGAGACT TTTCGAATGA CTTGGGAGAT GATTTCTTTG GATTCAAGGA GTTGGGTTTG GACAAAGAGT TCAAGATGTT GAGCTCATCG ATTCCGATCT ACTTGTTACA TTCACGTTTA CACAACTCGT TCACCAACTC TGGCAGTGCC AGCAAGAGAA ACAAGTACGA AGATTTACAG GAGTACACAG CTCAGTTCTT GACAGCTGCA GACGTACACA AGCAGGTAGG CTTATTGAGA CCGTTCTATA GCAAGTTGAA TGAAAAGTCG AAGGCGCATT TTGTCAAGCT TCAGAAGAAG AAGGGAGAGC CTACAGATTT GCCAGAGGAT AACCTGTTGG TCTTGATAGA AGACGAAGAG TTGCCCCAGA AACAGCGGAA TATCAGGCCC AGATTACCTC CCACAGGAAA GATTACTGCC ATCAAGAAGA AGATAGTAGC CAACTCCTTT TTCTTGTCAG ACGACGAAGA TGAGGAGAAT GGAGCCAAGA CGGAGGATAT CAAACTCGAC GATCTTGCCA TAGATAAGAC CGATAGCTTA GGGATGAGCT CGCCGGCGTT GGCACTGGAG GCATAGCATA GAATGTATAT ATATAGCTGT ATCACTAAGA ATAGAGACGA ATGAATTGAT
|
Protein sequence | MEKLSTFEHN NPKKLFELAK RLHSSNFFES YLNETHLKVL DYIILLNNAE IWDNFLEGNC TITFSKDPEA SEVVKSEEGI ADNNDNVSDT PELLFEEGSK MVKMLALHIR YLLWEKAIDY YYKNTGSSSE QVVEVDDDFE MIDTLDNFSD DEDKEKEAEK EEKAQPKVRE VEDDYDDEDE EEEESDQKDD KDDNKDNEQS ENGIYQWKYN DEKQIVLQVP VSLVTVAPDT SSEIQTESQD GSGSTGSEPP KSSSNGDSPD SNADDQEKLI REYNKVYHNF EYDRETLIKR RKLEKSDLQL EDSKNGNADS TKDGLSGLGE ADSMGISLGT GSTSLKHLLS TIQLKRDDVP LNDHELRTLF MDVRKNRGKW ANNDRVGQEE LYEACEKVVV ELRGSTEHST PFLNKVSKRE APNYGLIIKK PMDLNTVMKK LKSFAYNSKQ EFVDDLMLIW SNCLTYNTDP KHFLRAHAIA MQKKTSKLIP TIPNITIRSR QEVEREEELE NERVGTPMAS AGKSMKKGRK RRQDQIKTEV DPATPIATAV ASAIGSPVPV TGVSENIVEN GNVSNDEEEE EEEDNENING EANGISEEDD ELDPELQAWR TITAKSRAHY CAERAALFDE KFHLRSDARA IIRQSREMSN FNQYLTNKEV ISKSSNLLEN DEPYLLEYDI TGGLPGLKYK GIDKEEEEKR EQSLVDVFLQ QANGDASNIK SDFVLPIDSG LNKMYTENIK EMQEIRKICF KISLIRQMQT QQFVHHTQMK QPEIEVIREV DVDAVSKLPN HDPFTDQIQF SVLRRNIAKI AMQTGFESSE PFAINTLTQV AEKYMGNLIK TLKLHTETSS NNRLNEKEIV LLSLLENGID KPDDLYTFVQ ERIIKQHDKL TDLRSKLSNF LKELLRPGLE NFNERSFEDN SEQFMTGDFS NDLGDDFFGF KELGLDKEFK MLSSSIPIYL LHSRLHNSFT NSGSASKRNK YEDLQEYTAQ FLTAADVHKQ VGLLRPFYSK LNEKSKAHFV KLQKKKGEPT DLPEDNLLVL IEDEELPQKQ RNIRPRLPPT GKITAIKKKI VANSFFLSDD EDEENGAKTE DIKLDDLAID KTDSLGMSSP ALALEA
|
| |