Gene Hoch_1322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1322 
Symbol 
ID8543704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1749741 
End bp1752752 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content66% 
IMG OID646386038 
Productserine/threonine protein kinase 
Protein accessionYP_003265773 
Protein GI262194564 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGC GCGACGAGGA TCAAGAGAAC ACCCGCCATC CGGCCGATGC CGCGGATGGC 
GAGCGGATGC CGGCCGACTC GCCGAGGCCG GCGGGCCAGA GTTCGGACCA GGGCGACAGC
GACAGCGACA GCGGCGACAG CGGCGACAGC GACGAGTTCG GCATTGGCGC GCTCAAGCTC
GACGACGCAC TGTCCGAATC CGATAAGGAT CGCGTTCGCG CCAGGACTCT CGAACTCCCC
GTGCCCGATC GCTGTATCGG CGGTCGCTAC GAACTCCGCG AACGCCTGGG CGGCGGGGGG
ATGGGGACGG TGTACGCCGG GTACGACAGA CAACTCGAAC GCGCTGTGGC CATCAAGCGC
CTGCACAAAC GGTTCGCGCA AGATTCGAGT GAGGCGAGCG AGCGCCTGAA ACGCGAAGCC
ATCGCCATGG CGCAGTTCGC CAGCGAGCCC AACGTCGTGC AGGTATACGA CATCGTCGCC
GACCACGACC AAGCATTCCT GATCATGGAG TTGATCTCCG GGACGACCCT GGAGAAGGTC
CAGCGCAAAC ACTCACCCTC GCAGGCCGAG ATCATCGATA TCTACCTGCA AGCCGGGCGA
GGACTCGCGG CGATTCACCG CGCTCACCTG GTGCACCGGG ACTTCAAGCC CGCCAACGTC
ATGATCGCGG ACAACCCGGC CAGCGGTCAC CAGCGCGTTA TCGTATGCGA TCTGGGGCTG
GCCATCACGC GAGCGCTGGC GACGTCTAGC TCGGACAGTG AGCCGCCGAG TGAGCCCCGA
CCCCGAGCTC TGGGCGAGCA GTTCACCGCC ACCCGGGCGC TGGCCGGCAC GCCCGTGTAC
ATGGCTCCCG AGCAGCTCCG CGGCGAACGA GACCTGGACG GGCGCTGTGA CCAGTTCGCC
TTCTGCGTGG CTCTGTACGA GGCCCTGAGC GGTACGCGAC CATTCGCCGA GGCCGAGGCC
GGCGCCAAGT CCGAGCCGCT GCTGGCCGCC ATTCCGGCAG GTCCGCCGCC CCTACCCAAG
CGCGACGGCG GCGCGGTGCC CGTGCGGGTC GAGCAGGCCC TCCGCCGCGG CCTGGCATTC
GAGCCCGGCG AGCGCTTTCC CGACATGGAC GCGCTCCTCG TCGCTCTGGC GCCGCCCGAG
CGCTTTCCGT GGTCGGTGTG GTTGAGTCTG GGGCTGGCGC TGGCCCTCGG AGCCACGCTG
CTGCTGCTCG GTCGCGACGC TGCGCCTAGC TGCGAGAGCC AGGCGCAGAG CAAAGCGGCC
GGACTGGTCG ATGCCTCCGC CCTGCAGCGA AGCGAACAGC GCATCCCAGC GCAGCACAGG
CACGCCTGGT TGGCCCTGCG AGATGTGACG ACCCAACGGG TGAACACCTG GCGTGAGGAG
ATGAGCGCGT CATGCGACGC CGAGAGACGA AATGACGATG ACAATGCGGC GCCAATAACC
GCACGCAAAC AAGCCTGCCT TCTCGAAAAC GCGGCCGTGT TGAGAACCGC CGGCAGCTAC
CTGACGCAAG AGGAAAACGA GTCTGCGCCC ATGTTTGAGC TGGCGGACGC CCTCCGCCGG
ATGCCGAGCT GCATCCATCT GCGCGAGGAA CCGAAGCTGG TTCCGCCAGC GGACGCCGCC
AGCCAGTCGA TAATGGATGA ATTCCACGAG GTACTCGCAG AATCCGAGCT GCGCGAGTAC
GAAGGACGCT ACGACAGCGC CGTGGAACTG GCCGGCGAAG CGTTGCGGCA GAGCGAGGCG
ATGTCGTTTC CCTATAAAGA TGTACTCGCT GCCAAAGCGC GTTTTCGCCT CAGCCGAGCC
CACGCGTACG CCGGAGACCA CAATGACGCG GCCGAGAACT TCGACCAGGC CGCGCGGGCA
GCGGCCGGAT TCCAACTGGG CGCCGAATCG CTCGAAGGGG CATTGTTTCA CGCCAAGTAT
CTGCTCGCCG ATCTCGAGAA GAGCACCCTG GCCTGGGAGC AGCTAGTACG GGCCGAACTC
CTCCTGGGCT GGCTCGGCGT GGATTGTGAG GATGCAGATG ACGCCGAGAT GCCGATGGAC
GCAAACTGGC GGATCTGGGC GTGCGCCGAG TATGACGAGG CCAATGGCCT CCTGGCGTCT
CGTCTGGGCG AGCTCGAGCA AGCGATTGCC TATCACGAGC AGGCGCTGCA ATGGCGCGAG
AGGCTCTCGC CGGCACCGCC AGCGCTCGAC GCATTTCTGC ACTCGAAGTC GCTCAACAAT
CTCGCCAACG CCGAAGCCAA CCTGGCCAGG CAATGGATGG ACGAGGGCGA GGAAGACACC
GCAACGCGCT ACTGGGACAG CGCAGCGGCG CACTATCAAG CGGCGCTGAG GCTGCGCGGT
GAGGCGCTGG GAAACGACCA TCCCTTGGTC GACCGCATCC AACTCAACAT ACGTTTGTTG
CAGGTCGCTC GAGGGCAGGA AATCGGCGAC GACCTGCTGC GCTTGGGCCT GCGGGTGCTG
GCTCAAAACC TCGCGCGAAG CGAAGTGCAA CTGGAGAGCC TCCCCGGCTG GCTGGTGCTG
GTCATCGATG CCGCACTGGG CCGCTATTAC GGGCCGTCAG ACCCGAAAGA TCGGGATACG
GCGCATCTGC AATCCGCAGC GGAAGCCGCA GCGTTCATCC GATCCATCCA CGCTCAGATG
CCGCCTTCGT TCATGCAGCA CAGGCGGCGG GTCAGCGAGT ATCTGGCGCT CGCTGGCGTC
AGCGAGGCCA AAGGGCAGTG GTCCGAGGCG CTGGCTGAGC TCGAGCACGC GTTCGAGATC
CTCGACGAGC ACGACGCCGC GACCACGTGT GATCAGTATT TGCAAAATGT TTATCGTTCG
ACGCTGTTTT CCGCGGCCTC CATCGTGTGC GCCAAGCCCG AATCCGAGCC CGAGCAGGCT
CGCAGCTACC TCAAGCGGGC GTTCGCACCG CTCGCAGACT GCGCCGCAGC GGACGCCGTC
ATCGATGCAA TCTACGAGCA AACCCTGATG GAACCGCAGG ACGGCGGAAT TCCTGCGAAC
TGCCATCCCT GA
 
Protein sequence
MDERDEDQEN TRHPADAADG ERMPADSPRP AGQSSDQGDS DSDSGDSGDS DEFGIGALKL 
DDALSESDKD RVRARTLELP VPDRCIGGRY ELRERLGGGG MGTVYAGYDR QLERAVAIKR
LHKRFAQDSS EASERLKREA IAMAQFASEP NVVQVYDIVA DHDQAFLIME LISGTTLEKV
QRKHSPSQAE IIDIYLQAGR GLAAIHRAHL VHRDFKPANV MIADNPASGH QRVIVCDLGL
AITRALATSS SDSEPPSEPR PRALGEQFTA TRALAGTPVY MAPEQLRGER DLDGRCDQFA
FCVALYEALS GTRPFAEAEA GAKSEPLLAA IPAGPPPLPK RDGGAVPVRV EQALRRGLAF
EPGERFPDMD ALLVALAPPE RFPWSVWLSL GLALALGATL LLLGRDAAPS CESQAQSKAA
GLVDASALQR SEQRIPAQHR HAWLALRDVT TQRVNTWREE MSASCDAERR NDDDNAAPIT
ARKQACLLEN AAVLRTAGSY LTQEENESAP MFELADALRR MPSCIHLREE PKLVPPADAA
SQSIMDEFHE VLAESELREY EGRYDSAVEL AGEALRQSEA MSFPYKDVLA AKARFRLSRA
HAYAGDHNDA AENFDQAARA AAGFQLGAES LEGALFHAKY LLADLEKSTL AWEQLVRAEL
LLGWLGVDCE DADDAEMPMD ANWRIWACAE YDEANGLLAS RLGELEQAIA YHEQALQWRE
RLSPAPPALD AFLHSKSLNN LANAEANLAR QWMDEGEEDT ATRYWDSAAA HYQAALRLRG
EALGNDHPLV DRIQLNIRLL QVARGQEIGD DLLRLGLRVL AQNLARSEVQ LESLPGWLVL
VIDAALGRYY GPSDPKDRDT AHLQSAAEAA AFIRSIHAQM PPSFMQHRRR VSEYLALAGV
SEAKGQWSEA LAELEHAFEI LDEHDAATTC DQYLQNVYRS TLFSAASIVC AKPESEPEQA
RSYLKRAFAP LADCAAADAV IDAIYEQTLM EPQDGGIPAN CHP