Gene Hoch_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2040 
Symbol 
ID8544422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2815662 
End bp2816681 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content71% 
IMG OID646386743 
ProductCheB methylesterase 
Protein accessionYP_003266478 
Protein GI262195269 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG2201] Chemotaxis response regulator containing a CheY-like receiver domain and a methylesterase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.322604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.302776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAGA CTTCTACCCA GCATGAGCGC GTGTTGTGCA TGGGCGCCTC GGCTGGCGGG 
TTGACGGCCT TGCGGCGGCT CGTCGCCGAG CTGCCGGCCG ACTTCCCCGC GCCCATCTGC
CTGGTCCAGC ACACGCCCTC CGATGGCCCG CGTCTGCTCG ACGGACTGCT CACCCAGGCC
GGCAAGCTCA GGGCGCGATT TGCCGAGGAT GGCGAGCCGC TCACGGCCGG CACCATCCAC
ATCGCCCCGC CCGGCATGCA CATGGTCATC GACGACGGCC GCTTGCGCCA CGTGCGCGGC
CCGCGCGAGA ACCTGGCGCG GCCGGCGATC GATCCGCTGT TCCGCTCCGC GGCGCTGCAT
TTCAAGCAGC AGACCATCGG CGTGCTGCTC AGCGGCATGC TCGACGATGG CGTGGCCGGG
CTCAGCGTCA TCGAGCGCTG CGGCGGCGTG GCCGTCATCC AGGACCCCGA GGACGCCGAG
GCAGCCGACA TGCCGCAGAA CGCGCTCGAC GCGATCGGCG AGCGGCTCGC GGCCGTGCTC
CCGGCCGATG CCCTGGGCCG CTATCTGCGC GAGCTGCGCA CCGTGGAGCC GCGCACCGGC
GCCAACTGCC CCGAGCACCT CGGCGCCGAG CACCGCATGT TCGTCGCCGC CTCGGGCATC
GATGTGGTGC CCATGATCGG CGACCCGGCG GCCCTGAGCT GTCCGACCTG CGGCGGGCCG
CTGTGGGAGA TGCCCGACGA AGACGTGCGC CGCTACCGCT GCCACGTGGC CCACGGCTTC
ACCACGCAGT GCCTGGGCGA GGAGCAGCGC ACGGGCATGG AAGAGGCGCT GTGGGCCGCT
GTCCGGACGC TCGACGAGCG CGTCAAGACG CTCGGCGTGA TGATCCAGGA CGCCGAGAAG
CGCGGCTATC GGCGCATCGT CGATATGTAT TCGGACGAGC GCAAGGAAGC CAAGCGCCAC
GCCGACGCGC TGCGCGAGCT GTTTCTCGGC AATCTGGACA AGTCGCCCAA GGGGAACTGA
 
Protein sequence
MSQTSTQHER VLCMGASAGG LTALRRLVAE LPADFPAPIC LVQHTPSDGP RLLDGLLTQA 
GKLRARFAED GEPLTAGTIH IAPPGMHMVI DDGRLRHVRG PRENLARPAI DPLFRSAALH
FKQQTIGVLL SGMLDDGVAG LSVIERCGGV AVIQDPEDAE AADMPQNALD AIGERLAAVL
PADALGRYLR ELRTVEPRTG ANCPEHLGAE HRMFVAASGI DVVPMIGDPA ALSCPTCGGP
LWEMPDEDVR RYRCHVAHGF TTQCLGEEQR TGMEEALWAA VRTLDERVKT LGVMIQDAEK
RGYRRIVDMY SDERKEAKRH ADALRELFLG NLDKSPKGN