Gene Hoch_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1794 
Symbol 
ID8544176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2480223 
End bp2483120 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content68% 
IMG OID646386500 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_003266235 
Protein GI262195026 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0332193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.80092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTTA CCAAAGCGGC CATGTCAGTT ACCAAAGCGG CCATGTCAGT TACCAAAGCG 
GCTATGTCAG TCACGAAAGC GGCCACGTCA GTCACCAAAG CGCCCACATC GGTCACCAAA
GCGGAACTCG TCGCGATCGC GGGACGGCAT CTCAGCGCGC TCGCCGCCGG GACATGTGAA
CTCACACGGG CCGACGCCGA GCGCGAGGCC GATCCCGAAC TGCGCACGCT GCTTTTGGGA
ATCGTCGAGC TGGCCGAAAA ACAGCGCGCC GAGGCCGCCG AGCGCGCTCG TGCCGAGCTG
CTCGCGACCA TATTTGACGC GTTGCCGCTC AACATTTTTT TAAAAGACGA AGATGGCCGC
TTCCTCGAGG CCAACCGCAG CACCTGCGAG GTGGTCGGCA AGCGGCGCGA GGAGGTGCTC
GGCAGCACCG ACTACGACAT CTTTCCGGCC GCTCTGGCCA AGCGTCTGCG CGAGGCCGAC
CGCCTGGCTC TCGATGAGAA TCGCATCGTC GTCGAGGACG AGCGCCTGCG CCGCCGCACG
GGCGACGCGC AGGAGAAATT ATATTACGCG GCCAAGCTGC CCATGACGCG GCCGAGCGAC
GGCGTCCGCT GCTTGCTCGG CGTCTCGTTT TCGTTCGAGT GGCGCGAGCG CGAAATCGAC
GAGCTGGTCA AGCGCCACGA GTTCGTCCAG CGGGTTCTCG ACCAGTTGTC CCAACCCATC
TGCGTCAAGG ACCGTAGCGG CCGCGTGCTG TTCTCCAACA AGGCGGCCGA CGCGGTGACG
CAGTTCACGG GCGAGGCCAG CGAGGACGAG CGCGAGCGGC GGGTGTTCGA CGATCTCGAC
GAGCTCAGCA GCGAGCTGCC GCTGCCGCTG CCGGGCGGCG GCACCCGCTG GCTGCGCGTG
CGCCGGATGC CGCTGAGCAT CGGCGACAAC GAGGTATATT CGCTGAGCAT CGGCGACGAC
ATCGACGATC GCAAACAGCT CGAGCAGCAG CTCGAGGACG CCGCGTTCAT GATGCAGACG
GTGATCGACG CGAGCCCCGA CTGCGTGATC GTCAAAGACC TCGAGGGCCG CGTCCTGTAC
GTCAACGAGG CCTTTGCGCG CGAGGTCAGC GTGCCCCGCG CCAAACTGGT CGGGCTGTCG
GCCTTCGACG TCCTGCCGCG GGCGCGCGCC GCGGCTCAGG TGGAGCGCGA GCGCATGGTG
GTCGAGGGCG GCGTGCAGGT GCGCACCGAG GTCGCCCGGC CTCGCGACGC GCGCGGCAAG
GTCATCTACG AGCTGTTCGA GCTGCCCATG CGCGACCGCT CGGGCGCGCT CAGCGGTCTC
ATCAGCATCG GCCGCGACAT CACCGACTGG AAGCAGTCCG AGCGCGCGCT GAGCCTGGCG
CATCAGCAGG CCGAGGCCGC GCGCCTGGCC AAGACCGAGT TCTTGGCCAA CATCAGCCAC
GAGGTGCGCA CCCCGCTGAG CGGCATCATC GGTATGACCA GCTTGCTGCG CACCACCGAG
CTCGATGCCG AGCAGGCCGA CTACGTGGCC ACCGTGGAGG CCAGCGGCTC GACCCTGCTC
AACCTCATCA ACGATCTGCT CGACATCTCG CGTCTGGAGG CTGGGCGCAT CGCGCTCGAC
GAGCAGATGT TCGAGCTCAT GCCGTGCGTG ACCCACACGG TGAACCTGGC CAGACCGATG
GCGCGCGAGA AGGGGCTGGC GCTCGATCTC GACATCGCCG AGGGCGTACC GGCCTTCGTG
CTGGGAGACG AGCTGCGTCT GTCGCAGGTG CTGAGCAACC TGGTCACCAA CGCGCTCAAG
TTCACCGAGC AGGGGCGGGT GCTGGTCAAG GTGAGCCCCT GCGGGTACGA GGACGACAGG
ATCCGGCTGC GTTTCGCGGT CCACGACACC GGCATCGGTA TCCCCGAGGA CAAGCGCGAG
GTGCTGTTCG AGCGCTTCAC CCAGGTCGAT ACCTCGACCA CGCGGCGCTA CGGCGGCGCC
GGTCTCGGCC TGGCCATCAG CCAGCATCTG GTCAACGCCA TGGGTGGGCG GATCGAGCTG
CAGAGCCGCG AGGGCGAGGG CTCGTGCTTC TCCTTCGCGC TCAGCCTGCG CCGGCCGGAG
GAAGCGGCCG TGGCCGCGGC GACGGCCGAG GACCTGAGCC GCGATATCGG CCAGCCGATG
CCGCAGTTGC CGGCGCTGCG CATCCTGGTG GCCGAGGACA ACGCCGTCAA CCGCGAGGTC
GCGCTCGGCC TGCTGGCCAA GCTCGGACAC CGCGCCGACG TCGTCGGCAA CGGCGCCGAC
GCGGTCGCCG CGCTCGAACA GCACAACTAC GACATGATAT TCATGGACGT GCACATGCCG
GTGCTCGACG GCCTCATGGC GACGCGCGAG ATCCTCTCGC GCTGGCCGCA GGGCAGGCGT
CCCGTGATCA TCGCGATGAC CGCAGACGCG ATGAACGGCG ATCGTGAGCG ATGTTTATCT
GCGGGCATGC AAGACTACGT GAGCAAACCG GTGTCTATGA AGAACCTCTC CGCCATGCTG
TCGCGCTGGG CCTCGCCGGC GCAGGAAGCC GAAGTCGCAG CCACCATCGA CCGCGAGCTC
TTCGAAGCCT ACGGCGCCGA GCTGATGCGC GAGCTGCTGC AGGCGTTCAC CGATACCGTG
CCCGCGCGCA TCGAGGCGGT GAAGCAGCAC TTCGCCGCTG GCGAGGCGCA CGCGCTGGCC
GACGAGGCGC ACGCGCTCAA GGGCGCTGGC CTCAACCTCG GCGCCCACCG CTTCGCCGAG
GTCTGCCGCG AGCTCGAGAG CCGCGGCCGC GAGGGCGCGA TCGCCGGCCT CGAAGACCGA
CTCGAGCAGC TCAGCCAGGC CTACGTCAAC ACCCGCGTGG CGCTCGCTAC CCTGCTCAAG
CACGCCGAGG GCGGCTGA
 
Protein sequence
MSVTKAAMSV TKAAMSVTKA AMSVTKAATS VTKAPTSVTK AELVAIAGRH LSALAAGTCE 
LTRADAEREA DPELRTLLLG IVELAEKQRA EAAERARAEL LATIFDALPL NIFLKDEDGR
FLEANRSTCE VVGKRREEVL GSTDYDIFPA ALAKRLREAD RLALDENRIV VEDERLRRRT
GDAQEKLYYA AKLPMTRPSD GVRCLLGVSF SFEWREREID ELVKRHEFVQ RVLDQLSQPI
CVKDRSGRVL FSNKAADAVT QFTGEASEDE RERRVFDDLD ELSSELPLPL PGGGTRWLRV
RRMPLSIGDN EVYSLSIGDD IDDRKQLEQQ LEDAAFMMQT VIDASPDCVI VKDLEGRVLY
VNEAFAREVS VPRAKLVGLS AFDVLPRARA AAQVERERMV VEGGVQVRTE VARPRDARGK
VIYELFELPM RDRSGALSGL ISIGRDITDW KQSERALSLA HQQAEAARLA KTEFLANISH
EVRTPLSGII GMTSLLRTTE LDAEQADYVA TVEASGSTLL NLINDLLDIS RLEAGRIALD
EQMFELMPCV THTVNLARPM AREKGLALDL DIAEGVPAFV LGDELRLSQV LSNLVTNALK
FTEQGRVLVK VSPCGYEDDR IRLRFAVHDT GIGIPEDKRE VLFERFTQVD TSTTRRYGGA
GLGLAISQHL VNAMGGRIEL QSREGEGSCF SFALSLRRPE EAAVAAATAE DLSRDIGQPM
PQLPALRILV AEDNAVNREV ALGLLAKLGH RADVVGNGAD AVAALEQHNY DMIFMDVHMP
VLDGLMATRE ILSRWPQGRR PVIIAMTADA MNGDRERCLS AGMQDYVSKP VSMKNLSAML
SRWASPAQEA EVAATIDREL FEAYGAELMR ELLQAFTDTV PARIEAVKQH FAAGEAHALA
DEAHALKGAG LNLGAHRFAE VCRELESRGR EGAIAGLEDR LEQLSQAYVN TRVALATLLK
HAEGG