Gene Hoch_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1933 
Symbol 
ID8544315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2655832 
End bp2657784 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content72% 
IMG OID646386638 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003266373 
Protein GI262195164 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.77937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.846715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTCCT CCATCTCGCA GGAGCTGACG GCGGCGTACG AGTTGCTTGC CCACCCGCTG 
TGGGTATTCG ACGTGGAGAC GCTGAGCATG GTGTGGGCCA ACCAGCCGGG GCTGCGCATG
TGGCGCGCCG ACGACCTCGA CGAGCTGCTG GCGCGCGACT ACGCCGCCGA CATCAGCGAG
GCCGCGCAGC GGCGTCTCAT GGGCACCTTG CACCGCTGCC TGCAGGGCGA GCGCATCCGC
GAGCGCTGGA CCCTCTACCC CCTGGAGGAC ACGCCCGTGA CCGTGGACTG CGGGCTGGCG
GCGGTGCCGC TCGAGGACGG CCGCACCGCT CTGCTCATCA GCGGCATGCC GGTGCAGGCC
AGCGAGATCG CGGCCGAGAA TCAGCGCGGT CTCGAGACCT TGCGCCACCT GCCGCTGGCG
GTGTCGCGCT TCGACGCGGG CGGCGCGCTG GTGTTTCAGA ACCCGGCCGC GGCCAGCGCC
TTCGGCCCCG AGGACGGCCC GCACGCGCGT CTGCTCGAGC GCTTTGCCGA TCGCGAGCAC
GGCGCCGCGG CGCTGCGCGA GGCGCGCGCG GGCGCCATCG TGCGCACCCA GACCCGTCTG
CGCACCGGCG CGGGCGAGCG CTGGCACGAT GTCGAGCTGC GTCAGGGTCG CGACCCCGTG
ACCGGCAAGC CGCTGCTGCT GTTCATGGCC GGCGACATCT CCGAGCTCAA GCAGGCCGAG
GCCGAGCTGC GCGCGGCCAA GGAGCAGGCC GAGGCCGCGG CGCGGGTCAA GAGCGAGTTT
TTAGCGACCA TCTCGCACGA GATGCGCACG CCGCTGCACG GCATCATCGG CTTCGGCGAT
CTGCTGGCAC ACAGCCACCT CGACCCGCAG CAGCGCGGCT TCCTCGACTC GCTGCGCGAG
TCGGCGCGGC TGCTGTTTCG CCTGATCAGC GACGTCCTCG ACCTGGCCCA GATCGAGGCC
GGACGCATGC ACTTCGAGGA CCGCGCCATG GATCTCGAGG CCCTGATCGC CCGGGTGCGC
GACGTCATCG CCTTTCAGGC CGCCGAGAAG CGTCTGGTCC TCGAGGCCCA GCTCGACCCC
GAGCTGCCGC GCCACCTGCG CGGCGACGCG CTGCGCATCG AACAGGTGCT GGTCAACCTG
CTCGGCAACG CGCTCAAGTT CACGCCCGCC GGCCGCGTGA GCTTGCGCGT GCGCTGCGTC
CCCGAGTGCG CGGGCGAGGA CGCGCTCGAG TGCGCGAGCG CAGCCGGCCG CTGCCTCGGC
GACGTGCACC GCCTGCGCTT CGAGGTCCGC GACACCGGCG TCGGGATTTC GTCCGATCGT
CTGAATATCC TCTTCGAGCC CTTCACCCAG CTCGACACAT CGGCGTCGCG CGCCATCGGC
GGCAGCGGTC TGGGCCTGGC CATCTGCAAA CGCCTGGTCG AGGGCATGGG CGGTGTCATC
GGCGTCGCCA GCACACCGGG GAAGGGGAGC TGCTTCTGGT TCGAGCTGTC GCTGGCCGCG
GCCGATACCG CGCGCGCGCC TGCGGCCGAG CGCGAAGGCG GCGCCGGCGC CAGCGGCACG
AGCGAGGCCG AGGCCGAAGC CGAGAGCCGT CTGCACGTGC TGGTCGTGGA CGACAACGCG
CTCAATCAGA CCTTGGCGCG CACCCTGCTG CAGCGCCGCG GCCACCGCGT CACCGTGGCC
GAAAACGGCC AGCAGGCGGT CGCCCTGGTC GCGCGTGGCT CCTTTGACGC CGTGCTCATG
GACGTGCACA TGCCGACCAT GGACGGCCTC GCGGCGACGC GCGCCATCCG CGCCATGGGC
CTGTCGGCCA GCCAGCTCCC GATCGTCGGC TTGACCGCCA GCGTGGTCAA CGACAACCGC
GAGGAGTACC TCGCCTCGGG CATGAACGAC TGCATCGGCA AGCCATTCCG CGTGGCCGAG
CTGTTCGCCG TGCTGTCGCG CCAGCGCCGC TGA
 
Protein sequence
MRSSISQELT AAYELLAHPL WVFDVETLSM VWANQPGLRM WRADDLDELL ARDYAADISE 
AAQRRLMGTL HRCLQGERIR ERWTLYPLED TPVTVDCGLA AVPLEDGRTA LLISGMPVQA
SEIAAENQRG LETLRHLPLA VSRFDAGGAL VFQNPAAASA FGPEDGPHAR LLERFADREH
GAAALREARA GAIVRTQTRL RTGAGERWHD VELRQGRDPV TGKPLLLFMA GDISELKQAE
AELRAAKEQA EAAARVKSEF LATISHEMRT PLHGIIGFGD LLAHSHLDPQ QRGFLDSLRE
SARLLFRLIS DVLDLAQIEA GRMHFEDRAM DLEALIARVR DVIAFQAAEK RLVLEAQLDP
ELPRHLRGDA LRIEQVLVNL LGNALKFTPA GRVSLRVRCV PECAGEDALE CASAAGRCLG
DVHRLRFEVR DTGVGISSDR LNILFEPFTQ LDTSASRAIG GSGLGLAICK RLVEGMGGVI
GVASTPGKGS CFWFELSLAA ADTARAPAAE REGGAGASGT SEAEAEAESR LHVLVVDDNA
LNQTLARTLL QRRGHRVTVA ENGQQAVALV ARGSFDAVLM DVHMPTMDGL AATRAIRAMG
LSASQLPIVG LTASVVNDNR EEYLASGMND CIGKPFRVAE LFAVLSRQRR