Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82367 |
Symbol | HRK1 |
ID | 4837085 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 670280 |
End bp | 673180 |
Gene Length | 2901 bp |
Protein Length | 707 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388400 |
Product | Hygromycin Resistance Kinase |
Protein accession | XP_001382892 |
Protein GI | 150864174 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.263404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAGGTTGCAC CTGACGTTCG AAAAACGACC TAGCTGCTGC CCTAGTGCTT TTCAGCCGTT TCTGGTAGCC AATAGGAGCT GAGAGTCAAA TTGACAACTA ACTCATTGAA AGTTCGAAGT CACAAGAGAT ATCCACCAGA GAAACATCTT TTATACTGGT ATTGGTTATT CGAAAGAGAG ACTTCATAAT AACTTACTGA CACTGGCTCT ATAGTTAGCC ATCAACTCAT ACAGTAGCTT TACTTTAGAA TACCACACTA TAGTTTACTA TAGCGTTCCC ATTATAGCTT GTCATCCTTA CTAATTTCCC TGTTGTTTGT CGTTTATCTG ACCATACCAT TATCATTATT GTTAGCTATT GTCACATCAA TAGTGATTCA TTTCCTACAA TTGTCACCTC AACACATAGT AATCATTCAA TTGTTGGTTG TTTGATTCGA ACTTCATAAT CTACCCAGTA TCATTCAAGT GTGGTTCTAG TATCGCTTAG AACACTCTTA TCAATTCGTA TCATAGTATC ATTTGTCAGT ATCAACTACA ATACTGCTAT CCATTAGTTT CTTCCACTAT TTATTAGTAT CATCTACCTC ATTTACGTCA TCTAATATCT ATAGTCTACT TCCATCATGC CTGACAAACA CAAACTCAAG CTCTTCCGCA AGGCTGAAGA CGACTTGACA CCTTCAACCT CCAACCACTC CTCTGTTGGA AGTAGAAAGT TTCTTGGTTT CCACATTGGA AGACACGAGT CTGGTGAGTC CCTCACTTCA CCAACTCTCA CGAACTCTTC AGACCACCAA CAGACTCACT ACAACAACCC TGGTGTTTCT GCAAGTGTAG GCACTGGTAC AACAGCTACG ACTATTGTAA GTGCCAGTGG AATCCATTCT CCTTTTGCAG CGGGCTCTGG CTCTGGCTCT GGCTCGCTCG GATCTCAACC AATCGTTTCT CCACAGCCGC AAGTTCATGC CTCATATCTC AACAAACAGG ACTCGAATCA TACGTTCATG AAATCAAACT CAATGGTTGA ATTGAAACGT TTCTTCAAGC CAAGTAAGAA AACAACTTCT AATACTCGTA AGCATGATTA CTCTAATCAA CTCCACTCAC CACCTCCAAT GGTTAGTGGT AAGGAAATTA TGGGAAATGC TCTCAACAGC GGCCACCCAA GTCGTGAACC CTCTTCTACA AGTTTAGCTA CGTTGATTAA TCAGACGTCG TCGCAGTTGC TTCATAACGC TTCTCACCAC CACTCTTCTG GAAAGGATCC CTTCACAGAT GACAACTCGC CGCTTGTGAA AAAGTACGGT AAAATGGGCA AAGAGTTGGG AAGCGGTGCT GGTGGTTCTG TAAGGTTGAT CACGAGACCA TCTGATTCCA AAACTTTTGC TGTTAAAGAG TTCAGACCCC GTAGAAGTAC TGAAACACTT AAGGATTACA CCAGAAAGTG TACTGCCGAG TACTGTATCG GTTCTACATT GAAGCATCCC AACATTATCA AGACAATTGA CATCATTCAC GAGAACAGCA GGTACTTTGA AATCATGGAA TATGCTCCAA TTGACTTCTT TGCTGTTGTG ATGAGTGGAG AAATGTCACG ACAAGAGATC AATTGTTGTC TCAAACAGAT ATTGGAAGGA GTCAGCTATT TGCATAACTT GGGATTGGCC CATCGTGATT TGAAGTTGGA CAATTGTGTG ATAACCGATG AAGGTATTCT CAAGATTATT GACTTTGGTT CTGCTGTTAT ATTCAAGTAC CCATACGATC AGTATGGCAA CAACAAGGAT ACGATTCATC CTTGTCATGG AATAGTTGGA TCTGATCCGT ATTTGGCTCC TGAAGTGTTG TTAAGCCCCA ACTCGTATAA CCCTCAGCCT GTAGATCTCT GGTCAATTGC TATCATTTAC TGTTGTATGA CTCTTAAGCG TTTCCCTTGG AAGATTCCCA ACGCTGAAAA GGACAACAGT TTCAAGCTCT ACAATTTACC AGATGACAAC TGGCACGACT ACTACTTGTC AAATGAATGC CATAAGTTGT TACTTCAACA GAGGAAGTTC AAGAACATGA TAGTGAGACT GAATAAGAAG AAGAAGATGT TAGCAGACCA GGAAAGCGGA GAAAATATGG AACAATCTGA ATCCGTTTCA GGAGATGCTG GGTCTCGCGT CTTGGAGGCT GAAGAAGCCC AGCCTTCGGA CGGAACTCCT GAGGCTGTTG CAGAAGCAAT TGTTGTTGAT AACGCCATAG GAGACGCAGA TTCAGATAAG TTCTCAAACA AAGAATCTCG GAAAGAAAAG AATCTTGATA ATGCAGAGGT CTTGACGGAA GAGCAGGTTC AAGAAGTGCT TGCGCAGTTG AAAGAGATAG ATGACAAGCT TGAGGAGTAT GAAAAGAAAA AGTCAGACAT GAAATCAGCA TTCGTAGAAC AACAGAACAA ATTGAAATCT TCTTCAGAAG CTGGCGAAGG TTCCGATGTA GCTTCGGGTC CAGAGAAAGA ACTCTCGCCG GGACCAGGCT CGTCGTCACT GGCTACCAAG AAGAAACCAC AACACAAGCA GATTCATGGA CCATATAGAT TGATGAGATT ACTTCCCCAT GCCTCTCGTC CTATCGTACA CAAGATGCTC CAGGTAGACC CAAGAAAGAG AGCAACCTTG GAGGAGATCA TGAATGATGA GTGGATCCGC GAGATAAAGT GTTGCACGTT GAAGAAAGTA GCCCGTAGTG CTGATAATGT AGGAGTCAAC TTTGACGAAG ACGAAGACGA CGTTTTAGTC AAAGGAGTTC CTCCACATGA CCATACTATT GTGTTGGAAA GCCAGTGATA GTCACAAAAG GTCTGTCATA GTCATGTAGT TTAGTTCTTG GGATTTATTT AGAATATTTA TTGTAAGGGT G
|
Protein sequence | MPDKHKLKLF RKAEDDLTPS TSNHSSVGSR KFLGFHIGRH ESGESLTSPT LTNSSDHQQT HYNNPATTIV SASGIHSPFA AGSGSGSGSL GSQPIVSPQP QVHASYLNKQ DSNHTFMKSN SMVELKRFFK PSKKTTSNTR KHDYSNQLHS PPPMVSGKEI MGNALNSGHP SREPSSTSLA TLINQTSSQL LHNASHHHSS GKDPFTDDNS PLVKKYGKMG KELGSGAGGS VRLITRPSDS KTFAVKEFRP RRSTETLKDY TRKCTAEYCI GSTLKHPNII KTIDIIHENS RYFEIMEYAP IDFFAVVMSG EMSRQEINCC LKQILEGVSY LHNLGLAHRD LKLDNCVITD EGILKIIDFG SAVIFKYPYD QYGNNKDTIH PCHGIVGSDP YLAPEVLLSP NSYNPQPVDL WSIAIIYCCM TLKRFPWKIP NAEKDNSFKL YNLPDDNWHD YYLSNECHKL LLQQRKFKNM IVRSNKKKKM LADQESGENM EQSESVSGDA GSRAVAEAIV VDNAIGDADS DKFSNKESRK EKNLDNAEVL TEEQVQEVLA QLKEIDDKLE EYEKKKSDMK SAFVEQQNKL KSSSEAGEGS DVASGPEKEL SPGPGSSSSA TKKKPQHKQI HGPYRLMRLL PHASRPIVHK MLQVDPRKRA TLEEIMNDEW IREIKCCTLK KVARSADNVG VNFDEDEDDV LVKGVPPHDH TIVLESQ
|
| |