Gene Hoch_5807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5807 
Symbol 
ID8548221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7975839 
End bp7978811 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content68% 
IMG OID646390474 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003270176 
Protein GI262198967 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTTC GTCATGAGCT GTTTGCCGCG GCCCTGTCCG CGGTAGACGC CGCCGTGGTG 
ATGACCGATG CTGACGACCG GATCGCGTGG GCCAACGCCG CGGCGCTCGC GCTCTTTGAC
TGGGACGCAG ACGGCGCCTG CGGGCAGTCG CTGAGCGCGG TGTTGCGATT TCAAGGGCCG
CGTCCGGTGC TCGCGCCCGG CGAGAGCGCG GCGGTGTGCG TGCTCCACGG CAGCGCGGAG
GCGTCGCGGG TCGTGCACGG CACCATGCGC AGCGTGGCCG AGAACAGCGG CGAGCGCGTC
GGTCACGTGT TCGTGTTTAC CGGGCAGCAG ACGTACGAGG CGCGCGCGAA GCATTGGCTG
TCGCAGTCCT CGCTGCTCTA CGCGCTGCTC GGCAGCAACG GCTATCTGCA CGAGGTCGGC
ACGGTGTGGA GCGAGCGCTT CGCCTACCCG CGGGCGATGC TGCTGGAGCG GCCGCTGCTC
GAGCTCGTGC ACGAGGACGA TCGCGAGGCC ATGGCGCGCA GCCTGGCGCA GATCGCGAAC
TCGGACGCGG CCTGCGGGGC CGTCGAAACC CGCTTGCGCC GCGCTCAGGG CGGGTATCGC
TGGCTGTCGT GGTACATGGT CTACGACGCC GAAAACCAAT GCGTGCACCT GAGCGCGCAG
GATGTCAGCG AGGTCAAGCG CCAGGAGCGC CTGCTGGCCG AGACCCAGGG CGCGGCCAGC
ATCGGCGGCT GGGAGCTCGA CCTGCACGAC ACCACCCTGT ACTGGACCGA CGAGATCTAT
CGCATTCACG ATCTGAGTCC GGACAGCTAC CTTCCGTCGC CCGAGACCGT GCTCGCGTTC
TACGAGCCCG GCTCGGCCGA GCGTTTCGGA CGCGCGGTCA AACGAGCCGC GCGCGGCGAG
GGCGGCTTCG ACATGGAGGT CGAGCTGCGC ACGCCCGCGG GCCGCTCGGT GTGGTGCCGC
AACATCGGGC ATATGGGCTT CGAGAACGGC GAGGTGGTGC GCGTCTTTGG CTCGTGCCAG
GACGTCACCG AGCAGCGCGC CATCCGCGAG GCCCAGCGCG AGAGCGAGCA GCAACTGCGC
TCGCTGGTGC GCGATGTCGG CATCGGCGTG ATGGTGCAGG GGCCCGAGGG CGAGATCCTG
CACTGCAACC GCGCGGCCCT CGACGCCCTG GGGTTGAGCG AGCACGAGGT CATCGGCATG
CCGGCGCCGC GGCTGCTGGC GTACGCCATC CACGAGGACG GCACGCCGCT GGCGCTGGGC
AGCGACCCGC TCAGCCGCGC ACTGGAAACC GGACAATCGG TCAAAGATGT CATCCTGGGC
GTGCCGCACA CCGGCCGCGA CGAGCCGGTG TGGCTGCTGG TCAACGTGGT CTCGCGCATC
GACGCCGCCG GCTCTCTGCG CTGGGCCGTG TGCTCGTTTG CCGATATTTC GGCGCGCAAG
CGGGCCGAGG ACACCGCCCG CGAGAGCGCG GCCATGTTCC GCGCGGTGTA CGAAAACGCC
GGCCTGGGCG TGCTCATGCG CGACATCGAC GGCGCCATCC TCAGCAGCAA CCCGACCTTT
TCGCGCATGC TCGGCTACAG CGCCAAGACC CTGCGCAGCA TGCCGCTCGA CGCCATGCTG
CACCCCGGCG ACCGAGAGCT GGGCGGGGAC GAGCACGAGG CGCTGCTCGC GGGCGAGCGC
GAGACCTACG AGGTCGACCG CCGGTACGTG CGCCGCGACG GCGAGATCGT GTGGGGACAC
GTGACGGTCT CGGTGGTGCG CGGGGCCAGC GACGAGCCGC AGTACGTGGT CGAGATGATC
GTCGATATCA CCGATCGCAA GCGCATGGAG GCGCAGCTCA TGTTGACCGA CCGCCTGGCC
TCGCTGGGCA CCATGGCGGC GGGCGTTGCC CACGAGATCA ACAATCCGCT CACCTGGCTG
ATGGGCAACG TGTCATACGT GCGCGAGAGC CTCGAGGAGC TGCGCGACGA GATCGCGCTC
GACGACGACA GCGCGGACGA CCTCGATAAG GCGCTGGCGG ATAGCCTGGT GGGCGCCGAG
CGAATTCGGA CTATTGTCCA GGATCTGAAG CTGTTCGCGC GCGATCGCGA GGACGAAGAC
GGTATCGCCG ATCTCGGCGA GGTCTTGCAC TCGACCCTGC GCATGCTGCG CAACGAGCTT
CATCACCGCG CGGTGCTGGA ACAGAAGGTC GGCGACGTGC CGCCCGTGGT CGGCGACCCC
GCGCGCCTGG GGCAGGTGTT CACCAATCTG CTGGTCAACG CCATTCACGC CTTGCCCGAT
CGCGATCGCG AGGAAAACCG CATCGAGATT CGCGGCGTGC GTTCGGGCCG CGGCGTGGTC
ATCGAGATTT CGGACAACGG GGTCGGCATG TCGCCCGAGA CCCAGGCGCG CATCTTCGAC
CCCTTCTACA CCACCAAGGA GGTCGGGCAG GGGACCGGGC TGGGGCTGTC GATCTGCCAC
AGCATCATCG CCCAGATCGG CGGACGCATC GAAGTCGACA GCGAGCTGGG GCAGGGGACG
ACCTTTCGCG TGCACCTGGC TCGCGCCCGG CGCGGCTCGA CCTCGGGCAT CGCGCTGACG
CTGATCGACG AGATGCCGAC CGAGCGCAAG AGCCTGCTGT GCATCGACGA CGAGCCGGAC
ATGGGGCTCA CCTTGAAGCG CATGCTGGGC AAGTATCACG ACATCACCTT CGAGACCGAT
GGCGAGCGAG CGCTGGAGCG CCTGCGCGAG GGCGAGCGCT TCGATGCCAT CATCTGCGAT
TTGATGATGC CGGGGATGAG CGGGCCGGAG TTCTATCACT CGCTGGGTGA GGTGGCGCCG
GAGCTGGTAT CGCATTGCGG CTTTGTCACC GGCGGGACCT TTACCCCGGC GACGCGCGCG
TTCGCGGAAG AGCAGCGCGG CTACCAGCTA CTGCAAAAAC CCTTTTCGCG CGAGGCGATG
TATATGTTCA TCGCCCATCT GACCGCGCGC TGA
 
Protein sequence
MPVRHELFAA ALSAVDAAVV MTDADDRIAW ANAAALALFD WDADGACGQS LSAVLRFQGP 
RPVLAPGESA AVCVLHGSAE ASRVVHGTMR SVAENSGERV GHVFVFTGQQ TYEARAKHWL
SQSSLLYALL GSNGYLHEVG TVWSERFAYP RAMLLERPLL ELVHEDDREA MARSLAQIAN
SDAACGAVET RLRRAQGGYR WLSWYMVYDA ENQCVHLSAQ DVSEVKRQER LLAETQGAAS
IGGWELDLHD TTLYWTDEIY RIHDLSPDSY LPSPETVLAF YEPGSAERFG RAVKRAARGE
GGFDMEVELR TPAGRSVWCR NIGHMGFENG EVVRVFGSCQ DVTEQRAIRE AQRESEQQLR
SLVRDVGIGV MVQGPEGEIL HCNRAALDAL GLSEHEVIGM PAPRLLAYAI HEDGTPLALG
SDPLSRALET GQSVKDVILG VPHTGRDEPV WLLVNVVSRI DAAGSLRWAV CSFADISARK
RAEDTARESA AMFRAVYENA GLGVLMRDID GAILSSNPTF SRMLGYSAKT LRSMPLDAML
HPGDRELGGD EHEALLAGER ETYEVDRRYV RRDGEIVWGH VTVSVVRGAS DEPQYVVEMI
VDITDRKRME AQLMLTDRLA SLGTMAAGVA HEINNPLTWL MGNVSYVRES LEELRDEIAL
DDDSADDLDK ALADSLVGAE RIRTIVQDLK LFARDREDED GIADLGEVLH STLRMLRNEL
HHRAVLEQKV GDVPPVVGDP ARLGQVFTNL LVNAIHALPD RDREENRIEI RGVRSGRGVV
IEISDNGVGM SPETQARIFD PFYTTKEVGQ GTGLGLSICH SIIAQIGGRI EVDSELGQGT
TFRVHLARAR RGSTSGIALT LIDEMPTERK SLLCIDDEPD MGLTLKRMLG KYHDITFETD
GERALERLRE GERFDAIICD LMMPGMSGPE FYHSLGEVAP ELVSHCGFVT GGTFTPATRA
FAEEQRGYQL LQKPFSREAM YMFIAHLTAR