Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1322 |
Symbol | |
ID | 8543704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 1749741 |
End bp | 1752752 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646386038 |
Product | serine/threonine protein kinase |
Protein accession | YP_003265773 |
Protein GI | 262194564 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAGC GCGACGAGGA TCAAGAGAAC ACCCGCCATC CGGCCGATGC CGCGGATGGC GAGCGGATGC CGGCCGACTC GCCGAGGCCG GCGGGCCAGA GTTCGGACCA GGGCGACAGC GACAGCGACA GCGGCGACAG CGGCGACAGC GACGAGTTCG GCATTGGCGC GCTCAAGCTC GACGACGCAC TGTCCGAATC CGATAAGGAT CGCGTTCGCG CCAGGACTCT CGAACTCCCC GTGCCCGATC GCTGTATCGG CGGTCGCTAC GAACTCCGCG AACGCCTGGG CGGCGGGGGG ATGGGGACGG TGTACGCCGG GTACGACAGA CAACTCGAAC GCGCTGTGGC CATCAAGCGC CTGCACAAAC GGTTCGCGCA AGATTCGAGT GAGGCGAGCG AGCGCCTGAA ACGCGAAGCC ATCGCCATGG CGCAGTTCGC CAGCGAGCCC AACGTCGTGC AGGTATACGA CATCGTCGCC GACCACGACC AAGCATTCCT GATCATGGAG TTGATCTCCG GGACGACCCT GGAGAAGGTC CAGCGCAAAC ACTCACCCTC GCAGGCCGAG ATCATCGATA TCTACCTGCA AGCCGGGCGA GGACTCGCGG CGATTCACCG CGCTCACCTG GTGCACCGGG ACTTCAAGCC CGCCAACGTC ATGATCGCGG ACAACCCGGC CAGCGGTCAC CAGCGCGTTA TCGTATGCGA TCTGGGGCTG GCCATCACGC GAGCGCTGGC GACGTCTAGC TCGGACAGTG AGCCGCCGAG TGAGCCCCGA CCCCGAGCTC TGGGCGAGCA GTTCACCGCC ACCCGGGCGC TGGCCGGCAC GCCCGTGTAC ATGGCTCCCG AGCAGCTCCG CGGCGAACGA GACCTGGACG GGCGCTGTGA CCAGTTCGCC TTCTGCGTGG CTCTGTACGA GGCCCTGAGC GGTACGCGAC CATTCGCCGA GGCCGAGGCC GGCGCCAAGT CCGAGCCGCT GCTGGCCGCC ATTCCGGCAG GTCCGCCGCC CCTACCCAAG CGCGACGGCG GCGCGGTGCC CGTGCGGGTC GAGCAGGCCC TCCGCCGCGG CCTGGCATTC GAGCCCGGCG AGCGCTTTCC CGACATGGAC GCGCTCCTCG TCGCTCTGGC GCCGCCCGAG CGCTTTCCGT GGTCGGTGTG GTTGAGTCTG GGGCTGGCGC TGGCCCTCGG AGCCACGCTG CTGCTGCTCG GTCGCGACGC TGCGCCTAGC TGCGAGAGCC AGGCGCAGAG CAAAGCGGCC GGACTGGTCG ATGCCTCCGC CCTGCAGCGA AGCGAACAGC GCATCCCAGC GCAGCACAGG CACGCCTGGT TGGCCCTGCG AGATGTGACG ACCCAACGGG TGAACACCTG GCGTGAGGAG ATGAGCGCGT CATGCGACGC CGAGAGACGA AATGACGATG ACAATGCGGC GCCAATAACC GCACGCAAAC AAGCCTGCCT TCTCGAAAAC GCGGCCGTGT TGAGAACCGC CGGCAGCTAC CTGACGCAAG AGGAAAACGA GTCTGCGCCC ATGTTTGAGC TGGCGGACGC CCTCCGCCGG ATGCCGAGCT GCATCCATCT GCGCGAGGAA CCGAAGCTGG TTCCGCCAGC GGACGCCGCC AGCCAGTCGA TAATGGATGA ATTCCACGAG GTACTCGCAG AATCCGAGCT GCGCGAGTAC GAAGGACGCT ACGACAGCGC CGTGGAACTG GCCGGCGAAG CGTTGCGGCA GAGCGAGGCG ATGTCGTTTC CCTATAAAGA TGTACTCGCT GCCAAAGCGC GTTTTCGCCT CAGCCGAGCC CACGCGTACG CCGGAGACCA CAATGACGCG GCCGAGAACT TCGACCAGGC CGCGCGGGCA GCGGCCGGAT TCCAACTGGG CGCCGAATCG CTCGAAGGGG CATTGTTTCA CGCCAAGTAT CTGCTCGCCG ATCTCGAGAA GAGCACCCTG GCCTGGGAGC AGCTAGTACG GGCCGAACTC CTCCTGGGCT GGCTCGGCGT GGATTGTGAG GATGCAGATG ACGCCGAGAT GCCGATGGAC GCAAACTGGC GGATCTGGGC GTGCGCCGAG TATGACGAGG CCAATGGCCT CCTGGCGTCT CGTCTGGGCG AGCTCGAGCA AGCGATTGCC TATCACGAGC AGGCGCTGCA ATGGCGCGAG AGGCTCTCGC CGGCACCGCC AGCGCTCGAC GCATTTCTGC ACTCGAAGTC GCTCAACAAT CTCGCCAACG CCGAAGCCAA CCTGGCCAGG CAATGGATGG ACGAGGGCGA GGAAGACACC GCAACGCGCT ACTGGGACAG CGCAGCGGCG CACTATCAAG CGGCGCTGAG GCTGCGCGGT GAGGCGCTGG GAAACGACCA TCCCTTGGTC GACCGCATCC AACTCAACAT ACGTTTGTTG CAGGTCGCTC GAGGGCAGGA AATCGGCGAC GACCTGCTGC GCTTGGGCCT GCGGGTGCTG GCTCAAAACC TCGCGCGAAG CGAAGTGCAA CTGGAGAGCC TCCCCGGCTG GCTGGTGCTG GTCATCGATG CCGCACTGGG CCGCTATTAC GGGCCGTCAG ACCCGAAAGA TCGGGATACG GCGCATCTGC AATCCGCAGC GGAAGCCGCA GCGTTCATCC GATCCATCCA CGCTCAGATG CCGCCTTCGT TCATGCAGCA CAGGCGGCGG GTCAGCGAGT ATCTGGCGCT CGCTGGCGTC AGCGAGGCCA AAGGGCAGTG GTCCGAGGCG CTGGCTGAGC TCGAGCACGC GTTCGAGATC CTCGACGAGC ACGACGCCGC GACCACGTGT GATCAGTATT TGCAAAATGT TTATCGTTCG ACGCTGTTTT CCGCGGCCTC CATCGTGTGC GCCAAGCCCG AATCCGAGCC CGAGCAGGCT CGCAGCTACC TCAAGCGGGC GTTCGCACCG CTCGCAGACT GCGCCGCAGC GGACGCCGTC ATCGATGCAA TCTACGAGCA AACCCTGATG GAACCGCAGG ACGGCGGAAT TCCTGCGAAC TGCCATCCCT GA
|
Protein sequence | MDERDEDQEN TRHPADAADG ERMPADSPRP AGQSSDQGDS DSDSGDSGDS DEFGIGALKL DDALSESDKD RVRARTLELP VPDRCIGGRY ELRERLGGGG MGTVYAGYDR QLERAVAIKR LHKRFAQDSS EASERLKREA IAMAQFASEP NVVQVYDIVA DHDQAFLIME LISGTTLEKV QRKHSPSQAE IIDIYLQAGR GLAAIHRAHL VHRDFKPANV MIADNPASGH QRVIVCDLGL AITRALATSS SDSEPPSEPR PRALGEQFTA TRALAGTPVY MAPEQLRGER DLDGRCDQFA FCVALYEALS GTRPFAEAEA GAKSEPLLAA IPAGPPPLPK RDGGAVPVRV EQALRRGLAF EPGERFPDMD ALLVALAPPE RFPWSVWLSL GLALALGATL LLLGRDAAPS CESQAQSKAA GLVDASALQR SEQRIPAQHR HAWLALRDVT TQRVNTWREE MSASCDAERR NDDDNAAPIT ARKQACLLEN AAVLRTAGSY LTQEENESAP MFELADALRR MPSCIHLREE PKLVPPADAA SQSIMDEFHE VLAESELREY EGRYDSAVEL AGEALRQSEA MSFPYKDVLA AKARFRLSRA HAYAGDHNDA AENFDQAARA AAGFQLGAES LEGALFHAKY LLADLEKSTL AWEQLVRAEL LLGWLGVDCE DADDAEMPMD ANWRIWACAE YDEANGLLAS RLGELEQAIA YHEQALQWRE RLSPAPPALD AFLHSKSLNN LANAEANLAR QWMDEGEEDT ATRYWDSAAA HYQAALRLRG EALGNDHPLV DRIQLNIRLL QVARGQEIGD DLLRLGLRVL AQNLARSEVQ LESLPGWLVL VIDAALGRYY GPSDPKDRDT AHLQSAAEAA AFIRSIHAQM PPSFMQHRRR VSEYLALAGV SEAKGQWSEA LAELEHAFEI LDEHDAATTC DQYLQNVYRS TLFSAASIVC AKPESEPEQA RSYLKRAFAP LADCAAADAV IDAIYEQTLM EPQDGGIPAN CHP
|
| |