Gene Hoch_5252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5252 
Symbol 
ID8547664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7214695 
End bp7219791 
Gene Length5097 bp 
Protein Length1698 aa 
Translation table11 
GC content69% 
IMG OID646389926 
Productsignal transduction histidine kinase with CheB and CheR activity 
Protein accessionYP_003269630 
Protein GI262198421 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG1352] Methylase of chemotaxis methyl-accepting proteins 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAG AGCAAAAGCA ACCCGCCTTG GTCGTCGGAG TTGGCGCGTC GGCCGGCGGC 
CTGGACGCGT TCAAACAGCT CCTGCCGGTG TTGCCGGGGG ACGCCGATAT GGCGTTCCTG
TTGGTACAGC ACCTCGATCC CACCCACGAG AGCCTACTCA TCGAGCTGTT GACCCCGTGT
ACGAAAATGC GCGTGTGCGA AGCCGCTCAA GGGGCCAAAC TCTGCGGCAA CACGATCTAC
GTGATCCGAC CGGACACCGC GCTCGCCGTC CGCGACGGGC GCATCGCGAT CAGCGCTCCG
ACCCTGCATC GCGGCGTGCG GCTGCCGGTC GACCATCTCT TCAGCTCGCT GGCGCACGAG
TACGGACCGC GCTCGGTCGG CGTCGTGCTC TCGGGCGCGG GCAGCGACGG ACGCGCTGGA
TTGCGCGAGA TCAAGAACGT CGGCGGGCTG AGCATCGTAC AGGAGCCGGC TACCTGCGCC
CAGCCGGGCA TGCCGCAGAG CGCGATCGAC ACCGGCATCG TCGATGTGGT CACCGAGATT
TCGGGGATGC CGGCGGTGCT CGAGCGTTTC TCCAAGTTGC CGCCCAAGGT GTTCCTCGAG
CCCGGTTACA GTGAGTTCGA CAGCGAGCGC GGCGCGGGCG AGTCCGGCGA GGGCGAGCCC
GAGGAGGATT CGCTGCGGCA CCTGAGCGAG CAGGGGATTG GACGGCTGTC CGCCTTGCTC
GAGGCGCAGC TCGATTTCGA CCTGCGGGTC TATAAAACCG GGACCATCGA GCGCCGGGTG
CTGCGGCGCA TGACCCTGTC GGGCTTCGAG GATATCGAGG GCTATTTCGA ATTTTTGCGC
CAGGACGCCT CCGAGCAGCA GACCCTGGTG CGCGACCTGC TGATCAGCGT CACCGACTTT
TTCCGCGACC CCGAGGCCTT CCGCGCGCTG CGGGAGAGCG TGGTCGAGCC CTCGGTCAAA
CAGGCGGCGC CCGGCGAGAC CCTGCGCGTG TGGATTCCCG GCTGCGCCAC CGGCGAAGAG
GCCTACTCCA TCGCCATCGA GTTCCTCGAC GCCATCGACG CCATCGAGAA GCGCCTGTCG
CTCCAGGTCT TTGCCACCGA CCTCGATCAG GACGCGCTGG CGGTCGGCCG AGCGGCCATC
TATCCCCAGA CCATCGCCGA GCGCATGTCG CCGGTGCGGC TGCAGACCTA CTTCAAGCCG
CTCGACGGGC AGGGCTACCA GGTGCGCACT CCGCTGCGCG ATACGGTGTC GTTTGCGGTG
CACGATCTGA CCAAGGATCC GCCGTTTTCG CGCATGAACC TGGTGAGCTG CCGCAACGTG
CTCATCTACC TGCGTCCGCA GGCGCAGAGG CTCGTGCTCA ACGTGTTGCA CTTTGCGCTG
CGCAGCGACG GCTACTTGTT CATGGGCACC TCGGAGTCCA CGGGTAAGCA GCGCGAGCTG
TTCTCGACGC TGTCCAAGCC GTGGCGCATC TACAAGAAGG TCGGCACCTC GCGTCCGATC
TCGGTGCTGC GCTCGATCCA GCGCCAGCCG AGCGAGCGCG ACGGCGGCAA CGCCTCGGGC
GGGCAGGCGG GCGAGTCCCC GCACGGACAC GGACACGGAC ACGGGCACGG ACAGGCGCGG
CGCGAGGGCG TGAACGATCT CGCCCGGCGG GCTGTGTTGC GGACGCGGGT GGCGCCGACC
ATCGTGGTGG GCGGCGATGG TTCGGTGCTG TTCATGCACG GCGAGCTGCG CCCCTATCTG
CGCTTTCCCG AGGGCGATAG TCCGCGCTTC GAGCTGGCCT CGCTGGTGGC TCCGGAGCTG
GCCACGCGCA CCCGCGGCGC GCTCTACAAA TGCCGCCGCG ACGGCGAGAC CGTGATCGCG
CTGTCGAGCC CCGACGAGGG CCGCGAGAGC CGCGTGCGCA TCGTGGCCAC GCCGGCGCAT
GAGTTCGGCG ACGGCGCCGT GGTGCTCAGC TTCGACGAGC TCGAGTCCGA GGTCGCGGTG
CGGCCCATGG AGGCCGAGGA GTCGGGCACC GGCGCGGTCA TCGAGCAGCT CGAGAAGGAG
CTGCAGGCCA CCCAGGAGGA CCTGCGCAAC ACGGTCGAAG AGCTCGAGAC CTCCAACGAG
GAGCTGCGTT CGTCGAACGA AGAGTCGATG TCGATGAACG AGGAGCTGCA GTCGGCCAAC
GAGGAGCTCG AGGCCACGAC CGAAGAGCTG CGCTCGCTCA ACGAAGAGCT GACGACGGTG
AATTCTCAGC TTCGCGAGAA GGTCGAGCAG CTCGAGCAGA CCCACGACGA CCTGACCAAC
TTCTTCAGCA GCACCAAGAT CGCGACCATC TTCCTCGACG ACCGCCTGTG CATCAAACGC
TTCACGCCGC CGGCCAGCGA CCTGCTGGGT ATCGATCACG GCGACGTCGG CCGCTACGTC
GGCGACATCG CCCGCGACCT GTTGCAGAAT CAGCTCGCGC GCGAGGCCCG CGGCGTGCTC
GACCACCTGA GCACGCACTC GCGCGAGCTG GGCACCGAGA ACGGGCGCTG GTTCACCCGC
CAGGTGTTGC CCTACCGGAC GGAGAATCGG CGTATCGAGG GCGTGGTGGT CACCTTCGTC
GATGTCACCG AACTGCGCGC GACCACCGAG CGCCTGGCGG TGCGCGAGCG CCAGCAGGCC
GTGATCTCGC GGCTGGGTCT CGACGCGCTC AAAGAGCCCG ACTTGCAGGG CTTCATGGAG
CAGGTGACGC GCGAGGTACA GCAGACCTTG GACACCGATT TCTGCAAGAT CCTCGAGCTG
CAACCGGGGC GCAAACGCTT CCTGCTGCGC GCCGGCGTCG GCTGGGACGA GGGCGAGGTG
GGCACGGCCT CCGTGCACGC CGGCCTCGAC TCGCAGGCCG GGTTCACCCT ACAGACCTCG
GAGCCCGTGA TCGTCGAGGA TCTGGCCAGC GAGCGCCGCT TCTCCGGGCC GCCGCTGCTG
GTCGAGCACG GCGTGGTGAG CGGTCTGAGC TGCCGCGAGG ATTTTGGCGT CATCGCCGTG
CACACGCGCA CGCGGCGCCT GTTCAGCCGC GAGGACGCCC ATTTTCTGCA GTCGGCGGCC
AGCGTCATCG GCGCGGCCGT GGGCCGCCAT CTCACCCGTC TGCGCCTGGG GATCGAGCGC
GCGGTGGCGC GCGAGCTGAG CGAGCCCACG GCGCCCGAGG ACGCGCTGCG CCGGCTGCTG
TCGTGCTTCA CCCGCGAGAG CGGCGCCTCG GTCGGCGAGC TGTGGTGGCC GGAGCCAGGC
GGCAAAGAGC TGAGCTGCCG GATGCTCTAC ACCGACCACG GCGTGCACGA GGACGAGGTG
CGCGAGCAGC TCGGGAGCCG CACCTTTCCG CCCGGCGATG GTCTCGTCGG ACACGTGTAC
CGCGACGGAC GCGCGGTGTG GTGCACCGAC ATCGGCGATC CCGATCTGTT CCCGCGTCGC
GACGCGGCGC GCGAGTTCGG CCTGGTCACC GGCCTGGGCA TTCCCCTGGT CGCCGGCTCG
GAGTCGCTGG GCGTCATCGT CGTGCTGTCG CGCGAGCGCA TCATCGCCGA CGACAGCTTC
CTGCGCAGCC TCGAGGGCGT CGGCCGATCG ATTGGCGACG CCTTGGCGCG CGCGGAGGTG
GAGGACAAGG CGCGGCGCCT GGCCGCCATC GCCGAGTCCT CGCACGACGC CATCCTCACC
TTTGGCTTCG ACGGTCGCAT CCGCGAGTGG TTGGGCGGGG CCCAGCATCT CTACGGCTAC
GCGGCCGAGG AGGTGGTCGG CGCCTCGATC GACATGCTGG TGCCTGGCGA GCGCCGCGCC
GAGCTCGAGG ACATGATGGG CCGCGTCCAG CGCGGCGAGC TGCTCGAGCC ACAGGAGAGC
GTGCGCCGGC GCAAGGACGG CAGCCTGGTC GAGGTGTCGG TGCGCAGCTC GCCCATCCGC
GACCCGCACG GCCGCGTGGT CGCCGTGTCG TCGACCGATC GCGACGTCAC CCGGCAGAAG
GAGACCGAGC GCCGGCTCAA GGCCGCCGAT CGCCAGAAAG ACGAGTTCTT GGCCATGCTC
GGCCACGAGC TGCGCAACCC GCTGGCCGCG ATCCGCAGCG CGGCCGAGTT CTTGCATCTG
CACGGAGACG ACAGCCCGCA GCTTGAGCGC ACGCGCGTGA TCGTCGAGCG CCAGAGCGCG
CACATGGCCA AGCTGCTCGA CGGCCTGCTC GACATCTCGC GCATCATCAG CGGCAAGATC
CGGCTCGAGA CCGAGGTCGT CGATTTCAGC GCTATCTGCC GCGAGGTGGC CGCAGACGCC
AAGCCGCGCG CGAGCGCGCT CGACATCCAC TTTCAGACCG ATCTGATGGT GGCGCCGATC
TGGCTCGAGT GCGACCGCGT GCGTATCACC CAGGTGGTCG ACAACCTGCT GTCGAACGCG
CTCAAGTTTA CCGAAGCCGG CGGTTCGGTC ACGATCTCGC TGTGGCGCGA GGGCGGTGAG
GGCGTGCTGG TGGTGCGCGA TACCGGTATC GGCATGGAGC CCGATCTGCT GCCGGTGGTG
TTCGATGTAT TTCGCCAGTC AGAGCAGAGC CTGGACCGCT CACACGGTGG CCTGGGGCTG
GGGCTGGCGC TGGTTCGCTC GCTCACCGAG CTGCACGGCG GCTCGGTCGC GGCCCACAGC
GAAGGGCGCG GGCATGGCTC CGAGTTTGTG GTCCGGCTGC CCATCACCGA GCGCTCGGCG
CCGGTCTCGG TGTCCGAAGT GGACGCGGGG GAGACCCACA TGCACCTGCT GCTCATCGAG
GACAATCTCG ATTCGGCCGA GATGCTCTCC GAGCTGCTGC GCATCAACGG ACACCGGGTC
GATGTCGCCG CCGATGGCGT CGAGGGCATC GAGATCGCGC GGGCGGAGAA GCCCGATGTG
GTGCTGTGCG ACCTCGGGCT CCCGGAGGGC GTGACCGGCT ACGACGTGGC CCGCGAGCTG
CGCGCCGACG AGCGCACGCG CGCCATCCGC CTGGTCGCGC TCTCGGGCTA CGGACGCCCC
GAGGACAAGA CCCGCTGCGT GGAGGCCGGC TTTGACGCGC ACTTCACCAA GCCGGTGTCG
CTCGAGCTGC TCGAGCGTCT GCTGGTCGAG TACAAGGTCT CGGTCGGCGG CGCCTGA
 
Protein sequence
MSQEQKQPAL VVGVGASAGG LDAFKQLLPV LPGDADMAFL LVQHLDPTHE SLLIELLTPC 
TKMRVCEAAQ GAKLCGNTIY VIRPDTALAV RDGRIAISAP TLHRGVRLPV DHLFSSLAHE
YGPRSVGVVL SGAGSDGRAG LREIKNVGGL SIVQEPATCA QPGMPQSAID TGIVDVVTEI
SGMPAVLERF SKLPPKVFLE PGYSEFDSER GAGESGEGEP EEDSLRHLSE QGIGRLSALL
EAQLDFDLRV YKTGTIERRV LRRMTLSGFE DIEGYFEFLR QDASEQQTLV RDLLISVTDF
FRDPEAFRAL RESVVEPSVK QAAPGETLRV WIPGCATGEE AYSIAIEFLD AIDAIEKRLS
LQVFATDLDQ DALAVGRAAI YPQTIAERMS PVRLQTYFKP LDGQGYQVRT PLRDTVSFAV
HDLTKDPPFS RMNLVSCRNV LIYLRPQAQR LVLNVLHFAL RSDGYLFMGT SESTGKQREL
FSTLSKPWRI YKKVGTSRPI SVLRSIQRQP SERDGGNASG GQAGESPHGH GHGHGHGQAR
REGVNDLARR AVLRTRVAPT IVVGGDGSVL FMHGELRPYL RFPEGDSPRF ELASLVAPEL
ATRTRGALYK CRRDGETVIA LSSPDEGRES RVRIVATPAH EFGDGAVVLS FDELESEVAV
RPMEAEESGT GAVIEQLEKE LQATQEDLRN TVEELETSNE ELRSSNEESM SMNEELQSAN
EELEATTEEL RSLNEELTTV NSQLREKVEQ LEQTHDDLTN FFSSTKIATI FLDDRLCIKR
FTPPASDLLG IDHGDVGRYV GDIARDLLQN QLAREARGVL DHLSTHSREL GTENGRWFTR
QVLPYRTENR RIEGVVVTFV DVTELRATTE RLAVRERQQA VISRLGLDAL KEPDLQGFME
QVTREVQQTL DTDFCKILEL QPGRKRFLLR AGVGWDEGEV GTASVHAGLD SQAGFTLQTS
EPVIVEDLAS ERRFSGPPLL VEHGVVSGLS CREDFGVIAV HTRTRRLFSR EDAHFLQSAA
SVIGAAVGRH LTRLRLGIER AVARELSEPT APEDALRRLL SCFTRESGAS VGELWWPEPG
GKELSCRMLY TDHGVHEDEV REQLGSRTFP PGDGLVGHVY RDGRAVWCTD IGDPDLFPRR
DAAREFGLVT GLGIPLVAGS ESLGVIVVLS RERIIADDSF LRSLEGVGRS IGDALARAEV
EDKARRLAAI AESSHDAILT FGFDGRIREW LGGAQHLYGY AAEEVVGASI DMLVPGERRA
ELEDMMGRVQ RGELLEPQES VRRRKDGSLV EVSVRSSPIR DPHGRVVAVS STDRDVTRQK
ETERRLKAAD RQKDEFLAML GHELRNPLAA IRSAAEFLHL HGDDSPQLER TRVIVERQSA
HMAKLLDGLL DISRIISGKI RLETEVVDFS AICREVAADA KPRASALDIH FQTDLMVAPI
WLECDRVRIT QVVDNLLSNA LKFTEAGGSV TISLWREGGE GVLVVRDTGI GMEPDLLPVV
FDVFRQSEQS LDRSHGGLGL GLALVRSLTE LHGGSVAAHS EGRGHGSEFV VRLPITERSA
PVSVSEVDAG ETHMHLLLIE DNLDSAEMLS ELLRINGHRV DVAADGVEGI EIARAEKPDV
VLCDLGLPEG VTGYDVAREL RADERTRAIR LVALSGYGRP EDKTRCVEAG FDAHFTKPVS
LELLERLLVE YKVSVGGA