Gene Hoch_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3749 
Symbol 
ID8546142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5153071 
End bp5154861 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content65% 
IMG OID646388419 
ProductSigma 54 interacting domain protein 
Protein accessionYP_003268142 
Protein GI262196933 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.552068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.248061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCAC TACGCGTACA AATTCCAGGG CATAGCCCCA CAGTCTTCCA CCTCTACAAG 
AAGATCACAT CGCTGGGCTC GGCGCCCGAG AACGACATCG TGCTGCCGGA TGCGCTGATC
CTCGACGCCT TCGCCCACAT CCTGTTCGAC GGCCAGACCT ACACCATCGC CACGCTGTCG
CGTCGCTCCG AGCTGGTGGT CAACGGCAAG AAGCGCAAGA AGCACAAGCT CAGCCACGAG
GACAAGCTGG TCATCGGCCC CATCGAGATG CGCTTCTCGC TGATCGACGC GCAGCCGCCG
ATCGAGGAGG AGGCGGCCGA GACCGTCGCC GACATCGACG CCTACCGCAA GCTCTACGAG
TTCTCGGCCC GGCTCATGGA GAAGCACGAT CTCGCCGAGC TGCTCGACAC CTTGATGGAC
ACGGTCATCG AGATCACCAA CGCCGACAAG GGCTTCCTGA TCCTCATGGA AGGCGAGCAG
ATGCAGGTCA AAGTGGCCCG CAATCTCAAG CGCGAGAACA TCGCCGACGC GGTCTCGCAG
CTCTCCGACT CCATCGTCGC CAAGGTGATC AAGACGCTCA AGGCGGTGAT TATTTCGGAC
GCCATGAACG ACGCCGAGTT CTCGGGCTCG AAGTCGATCA TGAAGCTCAA GCTCACCAGC
GTCATCTGCG TGCCCCTGCT CGACGGCGGC AAGCTCACCG GGCTCATCTA CGTCGGCAAC
GACTCGATCG TCGACCTCTT CCAGCCCGAC GCCATGCAGG CGCTCACCGT GTTCGCGGCC
CAGGCCTCGC TGATCATCGC CAACGCCCTG CTGCTCGACC ACCTACGGGT CGACAACCGC
CAGCTCAGCG AGCGCCTCGA GCAGATCCGC TTCGGCGAGA TCATCGGCAC CAGCGCGCCC
ATGCAGCAGG TCTTCAAGAA GGTCGAAAAA GTCGCCGGCA CCGACATCTC GGTGCTCATC
ACCGGCGAGA CCGGCACCGG CAAAGAGCTC ATCGCGCGCG AGATCCACGC CCGCTCGCCG
CGCGCCAAGG CGCCGTTCGT GACCATCAAC TGCGGCGCCA TCCCCGAGAA CCTGCTCGAG
TCCGAACTCT TCGGCCACGT CAAGGGCGCG TTCACGGGCG CGGTGGCGAG CAAGCAGGGC
AAATTCCAGG CCGCGCACGG CGGCACCCTG TTCCTCGACG AGATCGGCGA GATGCCGCTC
AACCTGCAGG TCAAGCTGCT GCGCGCCATC CAGGAGAAGA TCGTCATCCG CGTCGGCGAG
ACCCGGGCCG AGCCGGTCGA CATCCGCATC CTCACCGCCA CCAACCGCAA GCTCGAGGAC
GAGATCGAGT CGGGCACCTT CCGCGAGGAT CTCTACTACC GGCTCAACGT GGTCAATATC
CACCTGCCGC CGCTGCGCGC GCGCGAGGAG GACGTGGTCG TCATCGGCCG CTACCTGCTG
GGCCGCTACG CCCAGGACTA CGGGTCCAAG GTCAAGGGCT TCTCGCCCAA CGCCACGGTG
GCGCTGCGCA AGTATCACTG GCCCGGCAAC ATCCGCGAGA TGGAGAACCG CATCAAGAAG
GCCCTGGTGC TGGCCGAGAC CACGGTCATC GGCCCCGACG ACCTCGGCCT CTCGGCCGAC
GTGCTGCCGC GCATCCTCAC GCTCAGCGAG GCCAAAGACC GCTTTCAGCG CGACTACATC
AACGAGATCC TGGCGCTCAA CAGCGGCAAC CGCACCAAGA CCGCGCGCGA TCTCGGCGTC
GATCCGCGCA CCATCTTCCG CCACCTCGAG AAAGAGGCCT CGGATGCGTG A
 
Protein sequence
MPSLRVQIPG HSPTVFHLYK KITSLGSAPE NDIVLPDALI LDAFAHILFD GQTYTIATLS 
RRSELVVNGK KRKKHKLSHE DKLVIGPIEM RFSLIDAQPP IEEEAAETVA DIDAYRKLYE
FSARLMEKHD LAELLDTLMD TVIEITNADK GFLILMEGEQ MQVKVARNLK RENIADAVSQ
LSDSIVAKVI KTLKAVIISD AMNDAEFSGS KSIMKLKLTS VICVPLLDGG KLTGLIYVGN
DSIVDLFQPD AMQALTVFAA QASLIIANAL LLDHLRVDNR QLSERLEQIR FGEIIGTSAP
MQQVFKKVEK VAGTDISVLI TGETGTGKEL IAREIHARSP RAKAPFVTIN CGAIPENLLE
SELFGHVKGA FTGAVASKQG KFQAAHGGTL FLDEIGEMPL NLQVKLLRAI QEKIVIRVGE
TRAEPVDIRI LTATNRKLED EIESGTFRED LYYRLNVVNI HLPPLRAREE DVVVIGRYLL
GRYAQDYGSK VKGFSPNATV ALRKYHWPGN IREMENRIKK ALVLAETTVI GPDDLGLSAD
VLPRILTLSE AKDRFQRDYI NEILALNSGN RTKTARDLGV DPRTIFRHLE KEASDA