Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56141 |
Symbol | HIR1 |
ID | 4837333 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 155692 |
End bp | 158839 |
Gene Length | 3148 bp |
Protein Length | 959 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388648 |
Product | protein involved in cell-cycle regulation of histone transcription |
Protein accession | XP_001382260 |
Protein GI | 150863699 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0125544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATCC TCAAGCTACC GTGGTTTGGA CACAAAAGTA TGTTCCAATT CACTTCCTTA GAGACTTTAC ATTATTGCTA GCAGAGCATT TTGTAAGTGA AATATCCATG TTTCAGTACT AACAATATCA GCTGAGAATA AGAACATTGA GTGTTATGCA GTCAGTATCA ATGCTGATGG AACACGTCTC GCCTCTGGAG GGCTTGACGG CAATGTCAAG ATCTGGGATA CTAGCACCAT CAACCCGTTC TTGAAATTGT GCTCAGATTC GGGTGCAACT TCCAGTAAAC CAGAATCTTC AAATAAAAAG AGAAAATTAG AACCAACACC TGTGCCGAGT TCAAGACTCG AAGATAAAGA TTTGCCGGTC GAATCCTTGA GGCGGCCCAT GTGCTCCATG AGTAGACACA ATGGTGTTGT AACCAGTGTG AAGTTTTCAC CTGATGGTCG TTTTCTCGCA TCTGGTTCGG ATGACAAGAT TTGTTTGATC TGGGAGAAAG ATGAAGAGCA ATCGAATCGA CCCAAGCAGT TTGGAGAAGT AGTTGCAGAT TTGGAGCATT GGACCGTACG AAAGAGATTG GTGGCTCATG ATAATGATAT TCAAGATATC TGTTGGTCGC CTGATGGAGG ATTGTTGGTT ACTGTAGGAT TGGATAGACT GATTATCATA TGGAATGGGC TTACTTTTGA GCGTATTAAA CGGTACGATA TCCACCAGTC TATGGTCAAG GGTGTAGTTT TTGATCCCGC CAATAAATTC TTTGCTACAG CTTCTGACGA TAGAACGGTA CGTATTTTCA GATACTACAA GAAGTTAAAC GAGTATAACA ACTACGAATT TCAGATGGAA CACATTGTGA TGGACCCGTT CAAGAAGTCG CCGTTAACCT CATATTTCCG AAGGATGAGT TGGTCACCCG ATGGGCAACA TATTGCTGTA CCTAATGCGA CCAATGGTCC TGTTCCTTCT ATTGCCATTA TCCGTAGAGG TAATTGGGCC ACTGATATCT CTTTGATCGG CCACGAAGCT CCGTGTGAGG TTTGTTCCTT TTCTCCACGG TTGTTTGACA TTTCTGAAAC TACCAAGAAA ACTACAAGTG ACTCTCAGTT TTCAACGATT CTTGCAACTG GTGGACAGGA CCAAACCTTA GCCATATGGT CTACCGCCAC AAGTAGACCG CTTGTTGTTG CAGAAAACAT AGTCAACAGC TCTATAACCG ATATCTGCTG GACTCCTGAT GGCCAGGCCC TCTATCTCAG TTGTTTAGAT GGATCCATTA CGTGTGTATC TTTTGACAAG AACGAGCTCG GTAGGGTCGT AAACGAGGAC ATTATCGATG CTCAGCTCCA TAGATATGGA GCTGACCGAG AATCGGCAAT TTTCCCAGAA AGTGTAGAGC AATTGCAGTT GGAATATCGA TCGGAGTCTA AGTTGAGAAA TGATAACAAG TTGTTGCTTA AACCTTCACT TTTGTTACAA AACAAGCCTG TTTCAAATAA AACCGAAGCT GAACCAGTTG TCAGTTCGTC TCCAAAGATC ATTGACTTAC CTATAAAAAG TGGACCATCT GAAAAGAACG ACAAATCTAT AAAACCAGTA AATAGAATAA CCCAGAATGT GGTTATCATG AAGAATGGCA AGAAAAAAGT GGCACCAACA TTGGTGTCGG CATCGTCTAC TAAATCATCC ACCTTTAGTT CTACTTCTAA GATGAATTCG ACTTCCAAGA AGATACAGCT CAGTTCCAAG ATGTCACAGT CAGCATACTT CTTGCCGAGG TTAGGTATTC AGACTTCTGT GCATGGATTT AAGATAAAGA ATGTAAACAA CCAGAGTATG AACAATGACC AGAATGAAGG TCAGGACAAT GACAATGAGG ATATGGGTAT TGACGAAAAC ACTGGCAATT CAGCTAGCCT GGCCAATATT TCAGAAGCAA CTTTGAAGAG ACAGAGAAAT AAAATCAAGC GTATGGTGAT GGAGGTAAGA TATCCCAGCT GTTTCAAATT TGTCACTCAG CTTCCGGAAG CTCTTTTCAA CAACCAATCG TTGATCAAAC ATGAAGTCCA GACTATCATC AATAACATTT CCAACAATGC AAAGGAGATG CCAGCCGAGT TTTCTAGTAG TACTCTTTTG GATGTTGAGG AAGAGCTTCT ATTTTCAGTT GTGTTACGTT CCGTAAGTCA CAATCACGAA ACAAATAAAG TCCTTGAAGA AAATGGAGAT AAAGACGTCA AAACTGTCTT GACCACTATC GAAATCAGAA ATGGACAGAG ATGGCGAGCC TCTGATGATG ACTTAGAATA TGATGATAGT ATTGATTTTG ATGACCCTAC GAGAGTATTA GTTTCAGATA ACAATTCCAA CAAGTTAAGA AAGTATACAC TTTTCTTTCC TTTTAGAATT CAGCACGTAT TGCCGTTGGT ATTAGATGAC CAGCTCAAGT TTATTGTCTT TGTTTCATTT GAAGGAACAA TTCAGATAGT ACGTGCTGAG TCTGGGACTT ACCATAGTCC CAGCTTTTCA TTGGGAGGGA ACGTTGTAAC GTTAATAAGC AGAGGCGAGA ACTTGATGGT TTTAACCAAT CGCGGTCTTA TATATACTTG GAAGCTCTCT AGTTCACAAG GAGGTATGAA GTGTATCATA CGAGGCGTCT CGATAGCTTC AGTACTAAAC ACAGTAGAAG TTCCTGTTAT TGCTAAAAGT GTACCGGTCA ACGGAGTCAA TGGCAGTGCT AATGGTTTAT CAATCAATCT TGGCGGTTCA AAGAAACCAT TATCGTTGGT GATGCCTAAT GTTCGTGTTA TCGACATTAA TCCAGAAGAT GGGTCACCGT ATATAATCAC TGATTCCACA GGAGATATAT TTTCATACTC CATCTCTCTT GGTTGTTGGA CCAAGATAGT TGATTCGTGG TATTTTCTCG CAGTAGATGA CGATTATCGA TTGGATGATA CCAGTACTGA AAACAAGAAA TTGGTGGATC AGTTGATCTG TAAGTCACTT GCCAAGTTTA CAGACGATGT CAGGAGGCTT AAGATTAATA GTTACGATTT TTCCAAGGAT GGGGATGGTG TTGATGAATT ATATGATACA ATGCTTAGTC GGTTTCAAGA AGTCCTTGAA ACGAAAGGTA TTAGTTAA
|
Protein sequence | MRILKLPWFG HKTENKNIEC YAVSINADGT RLASGGLDGN VKIWDTSTIN PFLKLKLEPT PVPSSRLEDK DLPVESLRRP MCSMSRHNGV VTSVKFSPDG RFLASGSDDK ICLIWEKDEE QSNRPKQFGE VVADLEHWTV RKRLVAHDND IQDICWSPDG GLLVTVGLDR SIIIWNGLTF ERIKRYDIHQ SMVKGVVFDP ANKFFATASD DRTVRIFRYY KKLNEYNNYE FQMEHIVMDP FKKSPLTSYF RRMSWSPDGQ HIAVPNATNG PVPSIAIIRR GNWATDISLI GHEAPCEVCS FSPRLFDISE TTKKTTSDSQ FSTILATGGQ DQTLAIWSTA TSRPLVVAEN IVNSSITDIC WTPDGQALYL SCLDGSITCV SFDKNELGRV VNEDIIDAQL HRYGADRESA IFPESVEQLQ LEYRSESKLR NDNKLLLKPS LLLQNKPVSN KTEAEPVNDK SIKPVNRITQ NVVIMKNGKK KVAPTLVSAS STKSSTFSST SKMNSTSKKI QLSSKMSQSA YFLPRLGIQT SVHGFKIKNV NNQSMNNDQN EGQDNDNEDM GIDENTGNSA SSANISEATL KRQRNKIKRM VMEVRYPSCF KFVTQLPEAL FNNQSLIKHE VQTIINNISN NAKEMPAEFS SSTLLDVEEE LLFSVVLRSV SHNHETNKVL EENGDKDVKT VLTTIEIRNG QRWRASDDDL EYDDSIDFDD PTRVLVSDNN SNKLRKYTLF FPFRIQHVLP LVLDDQLKFI VFVSFEGTIQ IVRAESGTYH SPSFSLGGNV VTLISRGENL MVLTNRGLIY TWKLSSSQGG MKCIIRGVSI ASVLNTVEVP VIAKSKPLSL VMPNVRVIDI NPEDGSPYII TDSTGDIFSY SISLGCWTKI VDSWYFLAVD DDYRLDDTST ENKKLVDQLI CKSLAKFTDD VRRLKINSYD FSKDGDGVDE LYDTMLSRFQ EVLETKGIS
|
| |