Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83887 |
Symbol | HIR2 |
ID | 4839451 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 861781 |
End bp | 865900 |
Gene Length | 4120 bp |
Protein Length | 993 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390766 |
Product | Histone transcription regulator HIRA, WD repeat superfamily |
Protein accession | XP_001385175 |
Protein GI | 150865807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.520766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATACT TCCGCTTCCC GCCGCTTCTC CACGGAGGCG AAGTCCACAC CGTTGATATC GACCCCACCA ACGAATGGCT TGCTACGGGA GGCTTGGACC ATATCATCAA CATCTGGAAA CTTTCTGATC TCGTCAACTT AGCCAGAATA CTGCCATTAG AAAAGAACGA AGATAATGTA ATTCAAAAAG AAAACGGCTT TGCTATCAAT GAGAACAATG CTGAGGCCAG AATTGGAATT CCTGAAAATC CTCAGATCAC TGAGGCCAGC ATCAGACCTG TCGAAATAAT TGATCTCGAC GAGGATGACA AAAAAGATGA ATCGAAAGAT GGATTCAAAA TTGACGAAGG CACATTGAGT GACAGAGTTG AAATCAAAGA ATTAAAAGTT AATACTGTTG ACACCAAGAG CGATAATCTC AATAACGAGC CTACTAAAAT TGAGAATGAT CATCAGCGGC ATTCTGTTTC TTCTGATAAC ATACAAACTT CCCAAGATTC ATTGACTTCG AAAGTCAATA GCATTTCACC CGTATACACC TTGAAAACCC ACAAAGCTGT AGTTTCCACG ATCAAGTTTT CTCCTAAAAA CAGTAAAGAG CTTGTCTCGG CAGACACGAA GGGAAATATC TACTTGCACA ACCTTGAAAA GAACAGCCAA ACCCTTCTCT ATCCTTTCAA TGAGGAACAG AAAGCCTCTG TAGTTGACTT AAGCTGGTCG ATGGATTCTC GTCTTGTAGC CTGGAGCACC ATTGAAGGTA AAGTGAATGT CATAGACGTT ACAAAGAATA CGTTTCAGGA ATTGACGGAA TTGACCCATT TGGAAAAACT TACCGTTCAA AGAAGCATTG CCTTTGATCC TACTAACAAC TATTTGATCA CTTTGGGTGA CGATACATTG GTTTACTTGT ACCAGTACAC GTATGATACT GCCTTGGACA ACTACCAGTT TCGCTTGATC AACAAGATCT CCCGTCTCAT CAACAAAAAT CCCATAAACG TAAACTACAA GCGGATCTCG TGGTCTCCAG AAGGTGAACT TCTTTCAGTT CCAACAGCCT CAAAAAATCA GACTTCGTTA ATCTCGTTGA TTTCTCGTTC TAAAAACTGG CAAAACAGAA TCAGCTTGGT AGGACATGGG CTTGCATGTG AAGTAGTACG ATTTCATCCC AAATTCTTGC GAGAAGGGAC TGATGATACC GCTTTCTATA ATGTGATTGC TACCGGAGGC TCAGACAAGA CCTTGGCCAT ATGGAACACT TCTAAAGATA CACCCGTAGT CGTCTTACAA GACGTCGTCG ACAAGCCAAT ACTTGATCTT GTATGGGATA AAACAGGTAC TTCTTTGATT GTTGCTACGT TAGATGGACA TTTAGGAATA GCTTCGATCG AAAACAACGA ATTGGGTCAC GAGATATCGC AAGACATGTT GGAAGAGCTA AAGAAATTCG ACCAAGAATA CATAAAGCCT ATAAACCATA AATATGAACA CGATCAACTG ACGACAAGAA GAGGTGAAAA ACACCAGATC GAGTTATTGG ATCAGAAAGA CGCTAAGAGC ACGATACATA GCGAACAGAA TGAAGAAAAA GACCAAAACA AGGATGAACT GAGTGAGAAA GAAGCAAACT CCAGTCCAAC TAATAGTCAG CCTGAGGCTA TAAGTAATGG ACCCATAGAA CCATCAGTCA TACCTCCACC AAACATGACA GAACCAGACA CTTCAGCGAC GGATATCTTA CATTCTGCCA TGAGCAGCAG GCAATCAAAA TCTACCACTA GTAAAACGAC CAAGACAGCA AAGACTACCT CCATAGCATC GGCATCTTCA ATTACCGTGC CACCTTCAGA TTCGAAAAGC GCCCAGAAGC AGGAAGTAAC CACGAAAAAC GGGAAACGTA GAATTCAGCC TATGCTTATT TCGAACAACG GAACCACGAA ACCTGCCATA GCTTCATCTG AATCAAGTTT GGGTAACAAT TCAACGGTCC AGTCGTCTTC AAAATCTCTA ATGGAATTTG ACAAGCCTTC TTATTCAGTT GAAGAAGACT TCTATAAACA GAACAAGAGG TTGAAAGCAC AAGAAGAAGC TGGCTCCAAT AAGAAAATTA AACGTGAACT CGAACCAGTT AAATTTATTG GATCAGTTAT CACAAATCCT AATACTACTT TTTCAAAGGT GAGACTTTCA GTTCCCAAAG TAAGATTAAA CTTTCAGATT CTGAGCAAAT TTGATGGTGA AGTCTTCATA ATGGATATCA AGAATGGAAC TGGAAACGAG ACTAAACCTT CCAGAATCAC ATACTTCAAA AAAGACAAGC AGCTTTGGTG CGATTTTATT CCGAGATACA TCCAATTGGC TGTCGAGGGG TCCAATTTTT GGGCATTGAG TACTTCCGAT GGTCAAATTT TGACTTATTC CCATACATCA GGTAAAAGAT TGTTACCTCC ACTAGTTCTT GGATCGCCTG TGTCTTTCCT TGAAAGTCAC AGTAAGTACC TTATGGCCGT AACATCTTTG GGCGAATTGT TTGTTTGGGA TTTAGAAAAG AAAAAGATCG AGTTGTCTAC TTCGTTGACT CCATTATTGG AACTCAGTAG CAAATATCAT GAAGATGGTT TGTCCAAGTC AGATAACATC ACCCTATGCG CTGTCACATC TGCTGGAATT CCATTAGTGA CGCTATCGAA TTGCTCAGGG TATCTCTTCA ACAAAAATTT ATGCATTTGG CAGACAATTA CAGAGTCGTG GTGGTCGTTT GGTTCTCACT ATTGGGAAAG CAATGACGAG AACAGCAAAA AGCCTCAAAC ATCGAACTTG TTTGGTGAAG AAGCTTCCAT TATTGAACTA TTGGAACATA AAACCAACGA AGAAATTATT AGAAAAACGA GAACTGGACG AGGCAAGTAC TTCAACAAGA TCTCAAAAAA CATGATAATG AAGGAAGGAT TTGAAAATCT TGAAAACACC ATTTCAATTA GCCACTTGGA AAATAGGATA TTATGCTGTG AATTGCTAGG TGAGTTTAAG GATTTCAGAA GATTCTTCTT GACATACGTT CAAAGAATAT GTGAATTAGG CTACAAAACG AAACTATTTG AGGTTTGTGA CGAGCTTTTA GGGCCAGACA GCCAACAAGA AACAGATGTC AATTCCAGGT CGGCTTCAGG ATGGTCTTCT AGCATCTGTG GAGTAGACAA GCATGAGTTA CTTAAGGAAG TCATTCTACT ATGTGCCAAA CATAGAGATG CGCAACGTAT TCTTATTCAT TTTGGTAAGA AGATCGGCGT GGTTAATGAC GTTTTGTAAT ATGTACGATA ACATTTAACT ATTTACAATG TATTATTACT CCAATTATAT ACAATTTGGA CTCTATAATT TATAGGTGTT TGATTCATCC CACTCTTTGT CACCTAGCAA ACCGCCCACT TGACCTTGTT GATTGTTAAT CAAGTCTTTA GGACCAGCAA CATAGCTTAC GTACCCTTCT GGTCCACATA CTAACGATAA ACTTGGCGAT GTTTTTTGTT CCTTACTGGT GGCTATAGCT TGTTGTATAG CATTTACATA TCTTGGAGCT CTTTCTTCGT TCAATTGTAC CTTACTTTGA ATTGTTGGTT CCTCTCCCTC CAAAATCGAC ATTCTCAACT TCAAAGCTTC TTCGGGCGTA AGGTCAGCTT CCTTCTGTTC TAATTTTAGA GGGCTAGTAT ACGCACTTTT AGATGGCTTT GAAACGTCCT TAGCTGTTAG GATAGTCTTA TTCTCGGAAT CGTAATGGCG AATCAATTTA ATTCTGTCTA ACTTTTCCAA AAAGAAAAGG AATCTCTCTA ATGGCTCCAA TTCCCCAGGT TTTTGGGCTG AGTAGTGAAT GTTGACAAAT CCCCTGTACG GATTACGAGA AAGCAAGACT TGAAGAATAG GAGCTATACC AGTTCCCGCA GCAAAAAAGT TTAAATTATC GAAAGCTGGT AAGTCGTTGG CCTTTATAGT CTTTTGAAGC AAGTGTTCAG GTTCAATTTT AGAAGGAAGA TCACGAAATA TTGGTCTAGT ATGGTAATTT TTAAGGGGAT GGTAAGGAAA TCTGTACTCC ACATTAGGAC CTCTCAATTC GATTTCTTCT
|
Protein sequence | MKYFRFPPLL HGGEVHTVDI DPTNEWLATG GLDHIINIWK LSDLVNLARI SPLEKNEDNH QTFNSISPVY TLKTHKAVVS TIKFSPKNSK ELVSADTKGN IYLHNLEKNS QTLLYPFNEE QKASVVDLSW SMDSRLVAWS TIEGKVNVID VTKNTFQELT ELTHLEKLTV QRSIAFDPTN NYLITLGDDT LVYLYQYTYD TALDNYQFRL INKISRLINK NPINVNYKRI SWSPEGELLS VPTASKNQTS LISLISRSKN WQNRISLVGH GLACEVVRFH PKFLREGTDD TAFYNVIATG GSDKTLAIWN TSKDTPVVVL QDVVDKPILD LVWDKTGTSL IVATLDGHLG IASIENNELG HEISQDMLEE LKKFDQEYIK PINHKYEHDQ STTRRGEKHQ IELLDQKDAK STIHSEQNEE KDQNKDESSE KEANSSPTNS QPEAISNGPI EPSVIPPPNM TEPDTSATDI LHSAMSSRQS KSTTSKTTKT AKTTSIASAS SITVPPSDSK SAQKQEVTTK NGKRRIQPML ISNNGTTKPA IASSESSLGN NSTVQSSSKS LMEFDKPSYS VEEDFYKQNK RLKAQEEAGS NKKIKRELEP VKFIGSVITN PNTTFSKVRL SVPKVRLNFQ ISSKFDGEVF IMDIKNGTGN ETKPSRITYF KKDKQLWCDF IPRYIQLAVE GSNFWALSTS DGQILTYSHT SGKRLLPPLV LGSPVSFLES HSKYLMAVTS LGELFVWDLE KKKIELSTSL TPLLELSSKY HEDGLSKSDN ITLCAVTSAG IPLVTLSNCS GYLFNKNLCI WQTITESWWS FGSHYWESND ENSKKPQTSN LFGEEASIIE LLEHKTNEEI IRKTRTGRGK YFNKISKNMI MKEGFENLEN TISISHLENR ILCCELLGEF KDFRRFFLTY VQRICELGYK TKLFEVCDEL LGPDSQQETD VNSRSASGWS SSICGVDKHE LLKEVILLCA KHRDAQRILI HFGKKIGVVN DVL
|
| |