Gene Hmuk_3250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3250 
Symbol 
ID8409328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp41675 
End bp44500 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content61% 
IMG OID645018187 
Productserine/threonine protein kinase 
Protein accessionYP_003175708 
Protein GI257372934 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.308767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.185977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCG ACGAGGATTC GACTCGTGAG ATCGACAGTC TGCGGCGTGA CGCCGATGGC 
GCTCCGGCCC GGGTAGATCT CGACGCGGCC GGTCGGTTTC TCGACAGCAA CCAGCCCAAG
AACCGCCAGA CAGCTACCTA CGTGTTCAAG CAGGTCGCCA AATCGGACGC ACGGCGAGTG
AGGGGCTATC TCGACACTCT CGAACCGCGT CTCGAGGATT CGAACAGCAA GACGCGAAAC
TTCACGCTGT TCGCGTTCAG CGAGGTCGTC GAAGTCGACC CCCGGCGAGT GAGTGACTAT
CTGGACGCCA TCGAACCACG CCTCAACGAC GAGAGCGAAA ATACGCGAAA TCTCGCGACC
TACGTCTTCA AAGAGGTCGC CCAGGAGGAC CCCCGGCAGG TAATCGACTC TCTCGACGCG
GTCGAACCGC GACTCGACGA CGCCGAGGAA TCGACACGAA ACTTCGCGGT CTCCGTGTTC
AGCGAAGTGG TCGAAGTCGA TCCCCGTCGA GTGAGCGACC ATCTGGACGC GATCGAACCG
CGGCTCGACG ACAAGAGCGA AAATACGCGA AGCCTCGCGA CCTACGTCTT CGGAGAGGTC
TCTAGGGAAG AACCCCGACA GGTGATCGAC TATCTGGACG CGATCGAGCC ACGGCTCGAC
GATGCCAAAC AATCCACTCG GAACTTCGCG ACTACCGCGT TCAGCGAAGT CGTCGAAGTC
GCCCCCCGGC CAGTGAGTCG TTCTCTGGAC GCGCTCGAAC CACGACTCGA CGACGAGAGT
GAGAACACGA GACACCTCGC GGCCTACGTT TTCAAGGAAG TCGCAAAGGA GAAGCCACGC
CACACGAGCG GCTATCTCGA CGCGCTCGAA GACTGCCTCG ACGACCGGGG AGAGAAGACG
AGGAACTTCG CAGCGTACGT CTTCAAGGAA GTCGCCAAAA AAGAGCCACG CCTCGCGTAT
GACTATCTCG ACGCGCTCGA AGACCGCCTC GACGATCAGA GCCAGGAAAC GAGAAACTTC
GCAACCTACG TCTTCAAGGA AGTCTCCAAG GAAGATCCAC GGCTGACGAT TGACTATCTC
GACGCGCTCG AAGATCGCCT CGATGACGAA AGCCAAGGAA CGAGAAACTT CGCGACCTAC
GTCTTCAAGC AAGTCGCTAA CGAGAAACCG AACCAGGTTC TCGACTCTCT GGACGCGCTC
GAAGACCGCC TCGACGATGC GAGTGGGAAC ACGCGGAACT TCGCGACCAC CGCGTTCGGC
GAAGCCGTCG AGGTTGCCCC CCAGCGAGTG AGTGGCCATA TAGACGCCCT CGAACCGCGA
CTCGACGACG AGAGCGAGAA CACGCGAAAC CTCGCGGCCT ACGTCTTCGG AGAGGTCTCC
AAGGAAGAGC CCCGGCAGAC ACTCGACTAC TTAGACGCGC TCGTACCACG GCTCGACGAC
CCCAAACGAC CGACGCGAAA CTTCGCGACC ACTCCGTTCA GCGAGGGCGT CGAGGTCTCC
CCTCGGCGAG TGAGTGGCTA TCTGGACGCA CTCGAACCGC GACTCGACGA CGAGAGCGAG
AACACGCGAA ACCTTGCGAC CTACGTGTTC ACGGAAGTGG CCAAAGAAGA GCCCCGACAG
GTGGTCGACC ATCTGGACGC GATCGAATCG CGACTCGACG ATCCGAAGCG ATCGACACGA
AAGTTCGCCG CCTCCACGTT CGCAACCGTA TCCGAAGAAC ACCCGTCACG GGTCGCCGAC
TACCTTCCGT CGATCGTCCC ACTACTGGCA GACGGGAACG ACACCATCGA GGACTACGCA
GGGTCCTGTT TCGCCCAGGT CGCCAGACAC GATCCGGAGC CGTTGCGAGA CCTGCTCCGC
TCTGGCGAGT CGAAGCGGTT CGGAGCGGCG ACGGTCGAGC GAGTAGAGGC AATTCTAGCT
GCAAACACGT CAGAGGACGA CGATGACGAC AGCTCAGCGT CGGTCCACGT CGACAAACGG
CGACCGGCAG ACATTCCGGG TGCGCCAGAA GCGGAGATCG CCCTCGACGA GATCGAAGAG
GAACGGACGA TCGGACAGGG AGGCAACGCC GACGTGGTGG AGGCGTCGGT TTCGAGCGGG
ACGGAAGACA CGTCGATCGC GATCAAGAGA CCACGGATGT CGGGGACGAT TCACGGCGAG
ACGATCGAGC GGTTGCTCGA CGAAGCCGAG CGGTGGGACA GGCTGGACGG ACACGACCAC
ATCGTCGGCG TCGTCGACTG GGGGGCGAGC CCGGTCCCGT GGATCGGAAT GGAGTTCATG
GACGGCGGAC ACCTCGGCGA ACGGGCTGGA GAGATGAGTC TGGCCCAGAA GCTCTGGACC
GCGCTCGCAG TCACGAAGGC CGTTCGCCAC GCACACAAGC ACGGCGTCGC TCACCTCGAT
CTCAAGCCCG AGAACGTCCT CTTTCGGACG GTCGAAGACG CCTGGGACGT GCCGAAAGTA
GCGGACTGGG GACTCTCGAA GCAGTTGCTC GAACACTCCC AGAGCGTACA GGGGCTCACG
CCACAGTACG CCGCACCGGA GCAGTTCGAC GACGACTACG GGACCAGCGA TCACAGCACG
GACATCTATC AACTCGGCGC GGTCTTCTAC GAGCTGTTCA CCGGACAGCC ACCGTTCGAC
AGTAATCCAT CGAAGGCGAT GTACCAGATC CTGGAGAGTG AGCCGACGCC GCCAAGCGAG
ATCGCAGCGG TCCCACCGCG CGTAGACGAG ATCCTGTTGA CGGCACTGTC CAAATCGAAA
GCCCAGCGGT ACGACGACAT CCTGTACCTC CGGGACGCTC TACAGGAGGC GTGTGAAGAC
CTGTAG
 
Protein sequence
MDTDEDSTRE IDSLRRDADG APARVDLDAA GRFLDSNQPK NRQTATYVFK QVAKSDARRV 
RGYLDTLEPR LEDSNSKTRN FTLFAFSEVV EVDPRRVSDY LDAIEPRLND ESENTRNLAT
YVFKEVAQED PRQVIDSLDA VEPRLDDAEE STRNFAVSVF SEVVEVDPRR VSDHLDAIEP
RLDDKSENTR SLATYVFGEV SREEPRQVID YLDAIEPRLD DAKQSTRNFA TTAFSEVVEV
APRPVSRSLD ALEPRLDDES ENTRHLAAYV FKEVAKEKPR HTSGYLDALE DCLDDRGEKT
RNFAAYVFKE VAKKEPRLAY DYLDALEDRL DDQSQETRNF ATYVFKEVSK EDPRLTIDYL
DALEDRLDDE SQGTRNFATY VFKQVANEKP NQVLDSLDAL EDRLDDASGN TRNFATTAFG
EAVEVAPQRV SGHIDALEPR LDDESENTRN LAAYVFGEVS KEEPRQTLDY LDALVPRLDD
PKRPTRNFAT TPFSEGVEVS PRRVSGYLDA LEPRLDDESE NTRNLATYVF TEVAKEEPRQ
VVDHLDAIES RLDDPKRSTR KFAASTFATV SEEHPSRVAD YLPSIVPLLA DGNDTIEDYA
GSCFAQVARH DPEPLRDLLR SGESKRFGAA TVERVEAILA ANTSEDDDDD SSASVHVDKR
RPADIPGAPE AEIALDEIEE ERTIGQGGNA DVVEASVSSG TEDTSIAIKR PRMSGTIHGE
TIERLLDEAE RWDRLDGHDH IVGVVDWGAS PVPWIGMEFM DGGHLGERAG EMSLAQKLWT
ALAVTKAVRH AHKHGVAHLD LKPENVLFRT VEDAWDVPKV ADWGLSKQLL EHSQSVQGLT
PQYAAPEQFD DDYGTSDHST DIYQLGAVFY ELFTGQPPFD SNPSKAMYQI LESEPTPPSE
IAAVPPRVDE ILLTALSKSK AQRYDDILYL RDALQEACED L