Gene PICST_56141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56141 
SymbolHIR1 
ID4837333 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp155692 
End bp158839 
Gene Length3148 bp 
Protein Length959 aa 
Translation table12 
GC content41% 
IMG OID640388648 
Productprotein involved in cell-cycle regulation of histone transcription 
Protein accessionXP_001382260 
Protein GI150863699 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0125544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCC TCAAGCTACC GTGGTTTGGA CACAAAAGTA TGTTCCAATT CACTTCCTTA 
GAGACTTTAC ATTATTGCTA GCAGAGCATT TTGTAAGTGA AATATCCATG TTTCAGTACT
AACAATATCA GCTGAGAATA AGAACATTGA GTGTTATGCA GTCAGTATCA ATGCTGATGG
AACACGTCTC GCCTCTGGAG GGCTTGACGG CAATGTCAAG ATCTGGGATA CTAGCACCAT
CAACCCGTTC TTGAAATTGT GCTCAGATTC GGGTGCAACT TCCAGTAAAC CAGAATCTTC
AAATAAAAAG AGAAAATTAG AACCAACACC TGTGCCGAGT TCAAGACTCG AAGATAAAGA
TTTGCCGGTC GAATCCTTGA GGCGGCCCAT GTGCTCCATG AGTAGACACA ATGGTGTTGT
AACCAGTGTG AAGTTTTCAC CTGATGGTCG TTTTCTCGCA TCTGGTTCGG ATGACAAGAT
TTGTTTGATC TGGGAGAAAG ATGAAGAGCA ATCGAATCGA CCCAAGCAGT TTGGAGAAGT
AGTTGCAGAT TTGGAGCATT GGACCGTACG AAAGAGATTG GTGGCTCATG ATAATGATAT
TCAAGATATC TGTTGGTCGC CTGATGGAGG ATTGTTGGTT ACTGTAGGAT TGGATAGACT
GATTATCATA TGGAATGGGC TTACTTTTGA GCGTATTAAA CGGTACGATA TCCACCAGTC
TATGGTCAAG GGTGTAGTTT TTGATCCCGC CAATAAATTC TTTGCTACAG CTTCTGACGA
TAGAACGGTA CGTATTTTCA GATACTACAA GAAGTTAAAC GAGTATAACA ACTACGAATT
TCAGATGGAA CACATTGTGA TGGACCCGTT CAAGAAGTCG CCGTTAACCT CATATTTCCG
AAGGATGAGT TGGTCACCCG ATGGGCAACA TATTGCTGTA CCTAATGCGA CCAATGGTCC
TGTTCCTTCT ATTGCCATTA TCCGTAGAGG TAATTGGGCC ACTGATATCT CTTTGATCGG
CCACGAAGCT CCGTGTGAGG TTTGTTCCTT TTCTCCACGG TTGTTTGACA TTTCTGAAAC
TACCAAGAAA ACTACAAGTG ACTCTCAGTT TTCAACGATT CTTGCAACTG GTGGACAGGA
CCAAACCTTA GCCATATGGT CTACCGCCAC AAGTAGACCG CTTGTTGTTG CAGAAAACAT
AGTCAACAGC TCTATAACCG ATATCTGCTG GACTCCTGAT GGCCAGGCCC TCTATCTCAG
TTGTTTAGAT GGATCCATTA CGTGTGTATC TTTTGACAAG AACGAGCTCG GTAGGGTCGT
AAACGAGGAC ATTATCGATG CTCAGCTCCA TAGATATGGA GCTGACCGAG AATCGGCAAT
TTTCCCAGAA AGTGTAGAGC AATTGCAGTT GGAATATCGA TCGGAGTCTA AGTTGAGAAA
TGATAACAAG TTGTTGCTTA AACCTTCACT TTTGTTACAA AACAAGCCTG TTTCAAATAA
AACCGAAGCT GAACCAGTTG TCAGTTCGTC TCCAAAGATC ATTGACTTAC CTATAAAAAG
TGGACCATCT GAAAAGAACG ACAAATCTAT AAAACCAGTA AATAGAATAA CCCAGAATGT
GGTTATCATG AAGAATGGCA AGAAAAAAGT GGCACCAACA TTGGTGTCGG CATCGTCTAC
TAAATCATCC ACCTTTAGTT CTACTTCTAA GATGAATTCG ACTTCCAAGA AGATACAGCT
CAGTTCCAAG ATGTCACAGT CAGCATACTT CTTGCCGAGG TTAGGTATTC AGACTTCTGT
GCATGGATTT AAGATAAAGA ATGTAAACAA CCAGAGTATG AACAATGACC AGAATGAAGG
TCAGGACAAT GACAATGAGG ATATGGGTAT TGACGAAAAC ACTGGCAATT CAGCTAGCCT
GGCCAATATT TCAGAAGCAA CTTTGAAGAG ACAGAGAAAT AAAATCAAGC GTATGGTGAT
GGAGGTAAGA TATCCCAGCT GTTTCAAATT TGTCACTCAG CTTCCGGAAG CTCTTTTCAA
CAACCAATCG TTGATCAAAC ATGAAGTCCA GACTATCATC AATAACATTT CCAACAATGC
AAAGGAGATG CCAGCCGAGT TTTCTAGTAG TACTCTTTTG GATGTTGAGG AAGAGCTTCT
ATTTTCAGTT GTGTTACGTT CCGTAAGTCA CAATCACGAA ACAAATAAAG TCCTTGAAGA
AAATGGAGAT AAAGACGTCA AAACTGTCTT GACCACTATC GAAATCAGAA ATGGACAGAG
ATGGCGAGCC TCTGATGATG ACTTAGAATA TGATGATAGT ATTGATTTTG ATGACCCTAC
GAGAGTATTA GTTTCAGATA ACAATTCCAA CAAGTTAAGA AAGTATACAC TTTTCTTTCC
TTTTAGAATT CAGCACGTAT TGCCGTTGGT ATTAGATGAC CAGCTCAAGT TTATTGTCTT
TGTTTCATTT GAAGGAACAA TTCAGATAGT ACGTGCTGAG TCTGGGACTT ACCATAGTCC
CAGCTTTTCA TTGGGAGGGA ACGTTGTAAC GTTAATAAGC AGAGGCGAGA ACTTGATGGT
TTTAACCAAT CGCGGTCTTA TATATACTTG GAAGCTCTCT AGTTCACAAG GAGGTATGAA
GTGTATCATA CGAGGCGTCT CGATAGCTTC AGTACTAAAC ACAGTAGAAG TTCCTGTTAT
TGCTAAAAGT GTACCGGTCA ACGGAGTCAA TGGCAGTGCT AATGGTTTAT CAATCAATCT
TGGCGGTTCA AAGAAACCAT TATCGTTGGT GATGCCTAAT GTTCGTGTTA TCGACATTAA
TCCAGAAGAT GGGTCACCGT ATATAATCAC TGATTCCACA GGAGATATAT TTTCATACTC
CATCTCTCTT GGTTGTTGGA CCAAGATAGT TGATTCGTGG TATTTTCTCG CAGTAGATGA
CGATTATCGA TTGGATGATA CCAGTACTGA AAACAAGAAA TTGGTGGATC AGTTGATCTG
TAAGTCACTT GCCAAGTTTA CAGACGATGT CAGGAGGCTT AAGATTAATA GTTACGATTT
TTCCAAGGAT GGGGATGGTG TTGATGAATT ATATGATACA ATGCTTAGTC GGTTTCAAGA
AGTCCTTGAA ACGAAAGGTA TTAGTTAA
 
Protein sequence
MRILKLPWFG HKTENKNIEC YAVSINADGT RLASGGLDGN VKIWDTSTIN PFLKLKLEPT 
PVPSSRLEDK DLPVESLRRP MCSMSRHNGV VTSVKFSPDG RFLASGSDDK ICLIWEKDEE
QSNRPKQFGE VVADLEHWTV RKRLVAHDND IQDICWSPDG GLLVTVGLDR SIIIWNGLTF
ERIKRYDIHQ SMVKGVVFDP ANKFFATASD DRTVRIFRYY KKLNEYNNYE FQMEHIVMDP
FKKSPLTSYF RRMSWSPDGQ HIAVPNATNG PVPSIAIIRR GNWATDISLI GHEAPCEVCS
FSPRLFDISE TTKKTTSDSQ FSTILATGGQ DQTLAIWSTA TSRPLVVAEN IVNSSITDIC
WTPDGQALYL SCLDGSITCV SFDKNELGRV VNEDIIDAQL HRYGADRESA IFPESVEQLQ
LEYRSESKLR NDNKLLLKPS LLLQNKPVSN KTEAEPVNDK SIKPVNRITQ NVVIMKNGKK
KVAPTLVSAS STKSSTFSST SKMNSTSKKI QLSSKMSQSA YFLPRLGIQT SVHGFKIKNV
NNQSMNNDQN EGQDNDNEDM GIDENTGNSA SSANISEATL KRQRNKIKRM VMEVRYPSCF
KFVTQLPEAL FNNQSLIKHE VQTIINNISN NAKEMPAEFS SSTLLDVEEE LLFSVVLRSV
SHNHETNKVL EENGDKDVKT VLTTIEIRNG QRWRASDDDL EYDDSIDFDD PTRVLVSDNN
SNKLRKYTLF FPFRIQHVLP LVLDDQLKFI VFVSFEGTIQ IVRAESGTYH SPSFSLGGNV
VTLISRGENL MVLTNRGLIY TWKLSSSQGG MKCIIRGVSI ASVLNTVEVP VIAKSKPLSL
VMPNVRVIDI NPEDGSPYII TDSTGDIFSY SISLGCWTKI VDSWYFLAVD DDYRLDDTST
ENKKLVDQLI CKSLAKFTDD VRRLKINSYD FSKDGDGVDE LYDTMLSRFQ EVLETKGIS