Gene PICST_82367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82367 
SymbolHRK1 
ID4837085 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp670280 
End bp673180 
Gene Length2901 bp 
Protein Length707 aa 
Translation table12 
GC content42% 
IMG OID640388400 
ProductHygromycin Resistance Kinase 
Protein accessionXP_001382892 
Protein GI150864174 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.263404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGGTTGCAC CTGACGTTCG AAAAACGACC TAGCTGCTGC CCTAGTGCTT TTCAGCCGTT 
TCTGGTAGCC AATAGGAGCT GAGAGTCAAA TTGACAACTA ACTCATTGAA AGTTCGAAGT
CACAAGAGAT ATCCACCAGA GAAACATCTT TTATACTGGT ATTGGTTATT CGAAAGAGAG
ACTTCATAAT AACTTACTGA CACTGGCTCT ATAGTTAGCC ATCAACTCAT ACAGTAGCTT
TACTTTAGAA TACCACACTA TAGTTTACTA TAGCGTTCCC ATTATAGCTT GTCATCCTTA
CTAATTTCCC TGTTGTTTGT CGTTTATCTG ACCATACCAT TATCATTATT GTTAGCTATT
GTCACATCAA TAGTGATTCA TTTCCTACAA TTGTCACCTC AACACATAGT AATCATTCAA
TTGTTGGTTG TTTGATTCGA ACTTCATAAT CTACCCAGTA TCATTCAAGT GTGGTTCTAG
TATCGCTTAG AACACTCTTA TCAATTCGTA TCATAGTATC ATTTGTCAGT ATCAACTACA
ATACTGCTAT CCATTAGTTT CTTCCACTAT TTATTAGTAT CATCTACCTC ATTTACGTCA
TCTAATATCT ATAGTCTACT TCCATCATGC CTGACAAACA CAAACTCAAG CTCTTCCGCA
AGGCTGAAGA CGACTTGACA CCTTCAACCT CCAACCACTC CTCTGTTGGA AGTAGAAAGT
TTCTTGGTTT CCACATTGGA AGACACGAGT CTGGTGAGTC CCTCACTTCA CCAACTCTCA
CGAACTCTTC AGACCACCAA CAGACTCACT ACAACAACCC TGGTGTTTCT GCAAGTGTAG
GCACTGGTAC AACAGCTACG ACTATTGTAA GTGCCAGTGG AATCCATTCT CCTTTTGCAG
CGGGCTCTGG CTCTGGCTCT GGCTCGCTCG GATCTCAACC AATCGTTTCT CCACAGCCGC
AAGTTCATGC CTCATATCTC AACAAACAGG ACTCGAATCA TACGTTCATG AAATCAAACT
CAATGGTTGA ATTGAAACGT TTCTTCAAGC CAAGTAAGAA AACAACTTCT AATACTCGTA
AGCATGATTA CTCTAATCAA CTCCACTCAC CACCTCCAAT GGTTAGTGGT AAGGAAATTA
TGGGAAATGC TCTCAACAGC GGCCACCCAA GTCGTGAACC CTCTTCTACA AGTTTAGCTA
CGTTGATTAA TCAGACGTCG TCGCAGTTGC TTCATAACGC TTCTCACCAC CACTCTTCTG
GAAAGGATCC CTTCACAGAT GACAACTCGC CGCTTGTGAA AAAGTACGGT AAAATGGGCA
AAGAGTTGGG AAGCGGTGCT GGTGGTTCTG TAAGGTTGAT CACGAGACCA TCTGATTCCA
AAACTTTTGC TGTTAAAGAG TTCAGACCCC GTAGAAGTAC TGAAACACTT AAGGATTACA
CCAGAAAGTG TACTGCCGAG TACTGTATCG GTTCTACATT GAAGCATCCC AACATTATCA
AGACAATTGA CATCATTCAC GAGAACAGCA GGTACTTTGA AATCATGGAA TATGCTCCAA
TTGACTTCTT TGCTGTTGTG ATGAGTGGAG AAATGTCACG ACAAGAGATC AATTGTTGTC
TCAAACAGAT ATTGGAAGGA GTCAGCTATT TGCATAACTT GGGATTGGCC CATCGTGATT
TGAAGTTGGA CAATTGTGTG ATAACCGATG AAGGTATTCT CAAGATTATT GACTTTGGTT
CTGCTGTTAT ATTCAAGTAC CCATACGATC AGTATGGCAA CAACAAGGAT ACGATTCATC
CTTGTCATGG AATAGTTGGA TCTGATCCGT ATTTGGCTCC TGAAGTGTTG TTAAGCCCCA
ACTCGTATAA CCCTCAGCCT GTAGATCTCT GGTCAATTGC TATCATTTAC TGTTGTATGA
CTCTTAAGCG TTTCCCTTGG AAGATTCCCA ACGCTGAAAA GGACAACAGT TTCAAGCTCT
ACAATTTACC AGATGACAAC TGGCACGACT ACTACTTGTC AAATGAATGC CATAAGTTGT
TACTTCAACA GAGGAAGTTC AAGAACATGA TAGTGAGACT GAATAAGAAG AAGAAGATGT
TAGCAGACCA GGAAAGCGGA GAAAATATGG AACAATCTGA ATCCGTTTCA GGAGATGCTG
GGTCTCGCGT CTTGGAGGCT GAAGAAGCCC AGCCTTCGGA CGGAACTCCT GAGGCTGTTG
CAGAAGCAAT TGTTGTTGAT AACGCCATAG GAGACGCAGA TTCAGATAAG TTCTCAAACA
AAGAATCTCG GAAAGAAAAG AATCTTGATA ATGCAGAGGT CTTGACGGAA GAGCAGGTTC
AAGAAGTGCT TGCGCAGTTG AAAGAGATAG ATGACAAGCT TGAGGAGTAT GAAAAGAAAA
AGTCAGACAT GAAATCAGCA TTCGTAGAAC AACAGAACAA ATTGAAATCT TCTTCAGAAG
CTGGCGAAGG TTCCGATGTA GCTTCGGGTC CAGAGAAAGA ACTCTCGCCG GGACCAGGCT
CGTCGTCACT GGCTACCAAG AAGAAACCAC AACACAAGCA GATTCATGGA CCATATAGAT
TGATGAGATT ACTTCCCCAT GCCTCTCGTC CTATCGTACA CAAGATGCTC CAGGTAGACC
CAAGAAAGAG AGCAACCTTG GAGGAGATCA TGAATGATGA GTGGATCCGC GAGATAAAGT
GTTGCACGTT GAAGAAAGTA GCCCGTAGTG CTGATAATGT AGGAGTCAAC TTTGACGAAG
ACGAAGACGA CGTTTTAGTC AAAGGAGTTC CTCCACATGA CCATACTATT GTGTTGGAAA
GCCAGTGATA GTCACAAAAG GTCTGTCATA GTCATGTAGT TTAGTTCTTG GGATTTATTT
AGAATATTTA TTGTAAGGGT G
 
Protein sequence
MPDKHKLKLF RKAEDDLTPS TSNHSSVGSR KFLGFHIGRH ESGESLTSPT LTNSSDHQQT 
HYNNPATTIV SASGIHSPFA AGSGSGSGSL GSQPIVSPQP QVHASYLNKQ DSNHTFMKSN
SMVELKRFFK PSKKTTSNTR KHDYSNQLHS PPPMVSGKEI MGNALNSGHP SREPSSTSLA
TLINQTSSQL LHNASHHHSS GKDPFTDDNS PLVKKYGKMG KELGSGAGGS VRLITRPSDS
KTFAVKEFRP RRSTETLKDY TRKCTAEYCI GSTLKHPNII KTIDIIHENS RYFEIMEYAP
IDFFAVVMSG EMSRQEINCC LKQILEGVSY LHNLGLAHRD LKLDNCVITD EGILKIIDFG
SAVIFKYPYD QYGNNKDTIH PCHGIVGSDP YLAPEVLLSP NSYNPQPVDL WSIAIIYCCM
TLKRFPWKIP NAEKDNSFKL YNLPDDNWHD YYLSNECHKL LLQQRKFKNM IVRSNKKKKM
LADQESGENM EQSESVSGDA GSRAVAEAIV VDNAIGDADS DKFSNKESRK EKNLDNAEVL
TEEQVQEVLA QLKEIDDKLE EYEKKKSDMK SAFVEQQNKL KSSSEAGEGS DVASGPEKEL
SPGPGSSSSA TKKKPQHKQI HGPYRLMRLL PHASRPIVHK MLQVDPRKRA TLEEIMNDEW
IREIKCCTLK KVARSADNVG VNFDEDEDDV LVKGVPPHDH TIVLESQ