Gene Hoch_2500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2500 
Symbol 
ID8544887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3440228 
End bp3443584 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table11 
GC content75% 
IMG OID646387200 
Productserine/threonine protein kinase 
Protein accessionYP_003266929 
Protein GI262195720 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATG CCCATAGTGA TTTGGCGGGA TTGGCCCAAC TTCATCGGGT CGAGAGTGAG 
GACGAGGTGG TCGCTCTGTG GCGCCGCGGT ATGGCCACGC TGTCGTCGCT GGCCTCGGAA
CAGCACCCGG TGCCGCTCGA GGGCTTCAAC CCCGCGGCGC TGCTGTCGAC CGCGCGCATC
GCGCTCACTC GCGGCTTTAT CGACGATCTC GGCTGGCTGT CGCCGCCTGC GGCCGCGGTC
GGGGTCTTTG CCTTGGCCTC GGCGCTGCCG CGCAGCGACG AGAAGCGCGA GCTCGGCCGG
CGCGTGTTGC AGTCCCTGCA CCAGAGCGAC GCCGAGACCT TCATCGCCCT GGCCACCGCC
CTGGCCCTGG GCTCGAGCCG GGCCCTGCGC GGCGCCTACG TGCGCGCCCG GGTCGCGCTG
TCGCTGGATC TGCCGCTCGG CGTCGGCATC CGCGCCGACG CGCTGGCGCT GGCGCTGCTG
TCGCGCCCCG ACAGCGAGCA GGCCTGGCTC GGACAGCCGT CCATGGGCGC GCTGACCTCG
CGCCGGCTGG CCTCGCGGCT GCTCGAGCGG GCCGCGCGCG AGGCCGCGCG CCGCTACCAG
GCCGGCGACG ACACCGGCGT GCGGGTGTTC GAGATGCCCT CGGTGCGCAC CGCCATGGAC
CGGCTGCTGG GCGATCGCGA GTCGCTGGTG TGGCGCCACG CGGCCGCGGC GCGCGGCCTG
CTGTCGAACG CCATCCCGGC CTGGGGCGAG GCCATCGAGG CCGGCCTCGG CCCGGAGCTG
TCGCCGACCG AGTGGCGGCG CGCGGCGACC TCGCTGGCGG CCCGCATCGC GCTCGACCCC
GAGGCCACCC TGCGCCGCTG CCGGGCGGTG TGCGAGGGCG AGCTGGCGCG CGCCGACCGC
GGCGTGGTCG CGGCCATGGT GCACGGCATC TCGTGCGCGG CCGACAGCGA TCCGGCGTCC
GCCGAGGCGC TGCTCGAGGT TCTCGTCCGC GAGGTCGATT ACCCCACCGC CGAGGCCCTC
GCCGACCTGC GCCGCGAGCG CGTGGGCAGT GATTTCGGCG CCGACGCGGT CGAGCGCGTG
CGCACCTGGC TGGCGACCTC GGCCGACGCC GACGCCCTGG GCGTGGCCCT GATGCGCGAG
CTGGCGCCCA GTGAGGAGCG CGAGCGCCGC ATCCTGCACG ACCATCTGGC CGCGGCGCTG
GCCGAGTTCG CGGCCGGTCG CGACGCCCGT CCGGACACCG AGCGCGCGCT CGAGGCCGCC
CACGGCGCCC TGGTTCGGCT CGAGCGCATC ACCACCGGCG ACGCCGCCGA GCGCGACGCG
TCCCAGCGCC GGGCCGCCTT CCTGGCGCTG CGCGAGCTCG ACCGCGGCCT GCTCGAGACC
TCGACCCTGG CCGACCTGCT GCTGCTGTAC GCGGCCGAGA GCCGGCCCAT GGCCGAGCGC
CTCATCGAGC GCCTGGCGCG CTGGCTGCTG GCGCACGAGA CCCGGCCGCT GAGCACCGAG
GCGCTTGAGG ACGTGCAGCG TCACTTCACC TGGCGCATGC ACCGGCTGCG CGCGCTGCTG
CACCTGGTCG ACATCGACGT GCAGTACGGC GACGGCCAGC GCGAGGACCT GTACGAGTGG
CGCCTGCGCG CGGTCAACGT GCTGCTGGCG CGCGCCCTCG GCGACGTGCC CTCGCCGCTG
CGCCGGGCGC TGTGCGCCAC CCTGGCGCGC GCCTGCGACG CCGTGGTGCG CGAGGAGCTG
TGCGAGCTGT CCGACGTGTT CGTGGCCGTG ACCCTGAGCC TGAACAGCGA GACCGACCTC
ACCGTGCTGG CCGAGGCCAG CATGATGCCG AGCTTCAAAC GCGCGATCCA CGCTTATGTC
GACGCGCTGG ACGAAATCGA GCGCGCGCGC GAGGACGTCG CCGAGCATGC CGGCGCGGCG
GGCGGACCCG GTGGCTGTCT CGCGGCCCTG CACGCGGTGG TGGCCGCGCT GCCGGCGGCG
GCGTCGCCGC GGGTCGAGGC CCTGCGCGGC GCGCTGGCGC AGGTGGCGCG CGCGCTCAGC
GGCATCGCGG CGGCGCGCGG CCTCAGCGAG CTGTCGAGCG AGGCCCCGGG CTCGCCGCTG
GGCGCCTTGC TCATCGGCGT CACCCGCATG GCGCGGCTGG TGGTGGGCGC GTGCCGCCAG
CTCGGCTATC CGGCCAACGC CTCGGCCTCG GCCCTGGGCT CGGCCCTGCG CGGCCTCGAG
GCCGAGATCG AGCACGCCCT GCGCGGCGAC CGCGCCAACC TGCGCAGCGC CCTGATCGAC
GCCGCGCGCA CCATGCGCGC CGAGCTCGCG CCGCTCATCG CCGACGTCGC CGCGCTGGTG
TTCGAGCACC TGGACCGGCT GCCGGCCACG GCCCCGGCCG GCGCCGCGAG CGGCGAGGGC
CGCACCGAGA CCCGGGGCGA TAGCCCGAGC GTACCGCTGG CGCCGTGGCT GCCGCCCGAT
CGCGTGCTCG GCGGCTTCTA CGTGCTGCGC GCCATCGACC GCGGCGCCGT GGGCTCGGTC
TTCGTCGCCT GCCGCGTCGC CGACCGGCAG TCCGAAACTC CTGATCTGTT CGCGCTCAAG
GTGCCCGAGT ACGGGGGCGA CGCCGCCCAC ACCCTGAGCG AGGCCGAGTT CATGGCCCTG
TTCCGCGAGG AGGCGCAGGC GCTGCTGTCG ATGCCGGCGC ATCCCAACCT GTCGCGCTTC
ATCACCTTCG ACGCCGGCGC CCGGCCCAAG CCCATCCTGG TCATGGAGCT GGTCGAGGGG
CCGACGCTCG AGCGCATGCT CGACCGCGAG GACATGAGCG TGGCCGAGCT CTTGGCCACC
CTCGACGGCG TGGCCGCCGG GCTCTCGGCC ATGCACCGGG TCGGCATCGC GCACCTCGAC
GTCAAGCCCT CGAACATCAT CCTGCGCCCG CGCGGCGGCG ACGTTCCGCT GGGCGAGACC
GCCGAATCCA TACCCGTGCT GGTCGATTTC GGGCTCGCCG GCCGCCGCAT CCGGCCCGGC
TGCGCGACCG TGTACTACGG CGCCCCCGAG GTCTGGGCGC AGACCCCGCA GCCCGATCTC
GCGCCCATGC CGACCGATGT CTACGCCTTC GCCTGCATGG CCTTCGAGAT GCTCACCGGC
GAGCTGCTCT TCGACGGCGA CACCGCGGTC GCGATCGTGT CCGAGCATCT GCGTCACGAC
GGCACGCCGG CGCGGCTCAC GCGCTACCGG CAGGCGCCGC ACCTCGAGGA GCTGATGTCG
CTGCTGGGCC AGGCGCTGCA CCCCAAGCCC GCGGCGCGCG CCGATATCGA CACCATCCGC
GCCGGGCTGG CCGCGCTCGC GCCAGCGCTC TCGGGCCACG GCTGGCCGCT GCTCTGA
 
Protein sequence
MLDAHSDLAG LAQLHRVESE DEVVALWRRG MATLSSLASE QHPVPLEGFN PAALLSTARI 
ALTRGFIDDL GWLSPPAAAV GVFALASALP RSDEKRELGR RVLQSLHQSD AETFIALATA
LALGSSRALR GAYVRARVAL SLDLPLGVGI RADALALALL SRPDSEQAWL GQPSMGALTS
RRLASRLLER AAREAARRYQ AGDDTGVRVF EMPSVRTAMD RLLGDRESLV WRHAAAARGL
LSNAIPAWGE AIEAGLGPEL SPTEWRRAAT SLAARIALDP EATLRRCRAV CEGELARADR
GVVAAMVHGI SCAADSDPAS AEALLEVLVR EVDYPTAEAL ADLRRERVGS DFGADAVERV
RTWLATSADA DALGVALMRE LAPSEERERR ILHDHLAAAL AEFAAGRDAR PDTERALEAA
HGALVRLERI TTGDAAERDA SQRRAAFLAL RELDRGLLET STLADLLLLY AAESRPMAER
LIERLARWLL AHETRPLSTE ALEDVQRHFT WRMHRLRALL HLVDIDVQYG DGQREDLYEW
RLRAVNVLLA RALGDVPSPL RRALCATLAR ACDAVVREEL CELSDVFVAV TLSLNSETDL
TVLAEASMMP SFKRAIHAYV DALDEIERAR EDVAEHAGAA GGPGGCLAAL HAVVAALPAA
ASPRVEALRG ALAQVARALS GIAAARGLSE LSSEAPGSPL GALLIGVTRM ARLVVGACRQ
LGYPANASAS ALGSALRGLE AEIEHALRGD RANLRSALID AARTMRAELA PLIADVAALV
FEHLDRLPAT APAGAASGEG RTETRGDSPS VPLAPWLPPD RVLGGFYVLR AIDRGAVGSV
FVACRVADRQ SETPDLFALK VPEYGGDAAH TLSEAEFMAL FREEAQALLS MPAHPNLSRF
ITFDAGARPK PILVMELVEG PTLERMLDRE DMSVAELLAT LDGVAAGLSA MHRVGIAHLD
VKPSNIILRP RGGDVPLGET AESIPVLVDF GLAGRRIRPG CATVYYGAPE VWAQTPQPDL
APMPTDVYAF ACMAFEMLTG ELLFDGDTAV AIVSEHLRHD GTPARLTRYR QAPHLEELMS
LLGQALHPKP AARADIDTIR AGLAALAPAL SGHGWPLL