Gene Hoch_3203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3203 
Symbol 
ID8545591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4415314 
End bp4417611 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content68% 
IMG OID646387870 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_003267598 
Protein GI262196389 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0763722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTCTC TGCGCTACAA GCTGTTCGTT CCGCTGTTTC TGGTCGGTGC CCTGGCGTCG 
ATCGGTTTTC CCTATCTGAC CTACGTGATG GTGCTCGACA AGCTCGAGCA CCAGGCCGTG
CAGCGCATGC GCACCATGGT CCACTCGGTC AACTACGCGG TCGAGAGCAT GGACGAGCTG
TCTCAACTCG AGCGCTTCGT CATGGCGGTG GGCGGCGAGG ACGACATCGA GGTGCTGGCC
ATCATCGGCG GCGAGCCCGC GCGCGTGGTC ACCAGCAGCC GCTTTGGCTG GAACGGACGG
CAGATTTCCG ACATGGCGTC CGAGATCCCG CTCGACGCCA TCGCCGAGGC GCAGCGGAGC
GGGGGCGAGC GGCTGCTCGA GGAGGGCGAG GATCACCTGC GCCTGCTCTC GCCCCTGTAC
CTCAACAAAA CCGGGCGCGC GCGCGCGATG TGGCACAGCG GCCTGATCAT GGCCGTGATC
GACTTCGGCA CGGTGCGCCA CGAGGCCGAG CGCGATGCGG TGCAGGCCTC GCTGTTGGCG
CTCGGTCTCA TTCTGGTGCT GGCGACCGGC GCGGTCGTGA CGCTGCGGCG CGTGGTCCTG
CGCCCGTTGC GCGCGGTCTC GGCCGCCATG GGGCGACGCA CGGCCGGCCA GCGACAGGCC
TACGCCAAGG TCTACGCCGA CGACGAACTA GGTGAGCTGG CGCGCACGCT CAACCACATG
GTCGATGTGC TGAGCGATAA GGAAGCTCAG TTGCGCACCT TGATCGACAA CGTGTCCGGC
GCCGTGTATC AGCTCAAGTG GCAAGGCAAC TGGCGGCTGG CCTTCATGAG CGAACACCTC
GAGGAGCTCG CCGGCTATCC GTCGAGCTAC CTCGACGGCA ACTTTCGCTT CGTCAACCTC
ATCCACCCCG ACGATGTCGG CGTGCTGCGC GCCCTGATGC AGGAGACCAC GAAGGCGCGC
GGCAGCGGCT ACCAGCTCGA GTACCGGATC ATCCACGCCG ACGGCGAGGA GCGGTGGGTG
CTCGACCGGG TGCGCCTAGT TTACGACGAC AGCGGCCGGC CGCTGCACGC CGACGGCATC
CTGGTCGACG TCACCGAGCG CCATCAGCAG AGCGAGCTTC TGCGCGAGGC CAAGGAGGCG
GCCGAGGTGG CCTCCCGGGT CAAGAGCGAC TTCTTGGCGA CCATGAGCCA CGAGATCCGC
ACGCCGCTCA ACGGCGTCAT CGGCATGGCC TCGCTGCTGC TCGACACCGA GCTGAGCGAC
GAGCAGCGCG AGTACGCCGA GACCATCGAT GTCTCGGGCA ATCTGCTGCT GGCGCTGATC
AATGACATCC TCGATTTCTC GAAGATCGAG GCCGGGCGTA TGGAGCTCGA AGAGGTCGCC
TTCGAGCTGC GCTACCTGAT CGAGGAGACG CTCGCGATCG TGGCGCCGCG GGCGCGCGAG
AAGAACCTCA GCCTCGACTG GCAGGCCGAG GCCGCGGTTC CCCGGCGCGT GATCGGCGAC
GCGCAGCGGA TGCGCCAGGT GCTGCTCAAT CTGGTCGGCA ACGCCATCAA GTTCACGCAC
GACGGCTCGG TCGAAATCCG CGTGCGGGTA CGCGAGCGCG AGGGCGACGC GCTGCTGCTC
GAGGTCGCGG TGGCCGACAC CGGCATCGGC ATCTCGCCGG GCGAGATGAT GCGCATCTTC
GAGCCCTTCT CGCAGGCCGA CGCCTCGACC ACGCGGCAAT ACGGCGGCAC CGGCCTGGGG
CTGGCCATCT GCAAGCGTTT GGTCGATCGC ATGGGCGGGA CCGTGGTGGT CAAGAGCCAG
CCCGGACGGG GCTCGACCTT CTCGTTCACG GTGCGGCTGC GCGTGGCCCC GCGGCGCGGC
GTGACCACGC CGGCGGCGGC CCTGGCTGTG CCCGCGCTGA GCGAGGCGCT GCGCGCGCGG
CGCGTGCTGG TGGCCGAGGA CAACGAGGTC AATCGGCGGG TGGTGGTGCA TATCTTGAAA
CAGCTCGGCT TCGAACCCGA GGCGGTGGAA AATGGCCGCG CCGCGGTCGA GGCCTTTGCC
GCCGGGACCT TCGACCTGGT GCTGATGGAC TGCCGCATGC CGGAGATGGA CGGCTTCGCG
GCCACCCGCG TGCTGCGCGA CCAGTTCGAC GCGCTCGTGC CGATCATCGC GGTCACGGCC
AGCGCCTCGG CCGACGATTC GCGCATGTGC CTCGAGGCCG GCATGGACGA CCATATGAGC
AAGCCGGTGA CCAAAGACGG GCTCTGCGCC ATGCTCCGGC GCTGGCTGCC GACCTCCAAC
CCGCCCGCGT CGTCGTAG
 
Protein sequence
MASLRYKLFV PLFLVGALAS IGFPYLTYVM VLDKLEHQAV QRMRTMVHSV NYAVESMDEL 
SQLERFVMAV GGEDDIEVLA IIGGEPARVV TSSRFGWNGR QISDMASEIP LDAIAEAQRS
GGERLLEEGE DHLRLLSPLY LNKTGRARAM WHSGLIMAVI DFGTVRHEAE RDAVQASLLA
LGLILVLATG AVVTLRRVVL RPLRAVSAAM GRRTAGQRQA YAKVYADDEL GELARTLNHM
VDVLSDKEAQ LRTLIDNVSG AVYQLKWQGN WRLAFMSEHL EELAGYPSSY LDGNFRFVNL
IHPDDVGVLR ALMQETTKAR GSGYQLEYRI IHADGEERWV LDRVRLVYDD SGRPLHADGI
LVDVTERHQQ SELLREAKEA AEVASRVKSD FLATMSHEIR TPLNGVIGMA SLLLDTELSD
EQREYAETID VSGNLLLALI NDILDFSKIE AGRMELEEVA FELRYLIEET LAIVAPRARE
KNLSLDWQAE AAVPRRVIGD AQRMRQVLLN LVGNAIKFTH DGSVEIRVRV REREGDALLL
EVAVADTGIG ISPGEMMRIF EPFSQADAST TRQYGGTGLG LAICKRLVDR MGGTVVVKSQ
PGRGSTFSFT VRLRVAPRRG VTTPAAALAV PALSEALRAR RVLVAEDNEV NRRVVVHILK
QLGFEPEAVE NGRAAVEAFA AGTFDLVLMD CRMPEMDGFA ATRVLRDQFD ALVPIIAVTA
SASADDSRMC LEAGMDDHMS KPVTKDGLCA MLRRWLPTSN PPASS