Gene Hoch_2452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2452 
Symbol 
ID8544839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3380724 
End bp3381824 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content73% 
IMG OID646387152 
Producthistidine kinase 
Protein accessionYP_003266881 
Protein GI262195672 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCCC AGCCAGTGAG CGAAGCGATA GAGCCCGGAC CCGGGTCCGG GCCGCTGGCG 
CGTGTGGCGC TGTGTCTCGA CAACGCGGCC GAGCGCGAGG CCCTGGCCCA GTGGCTGCAA
GACGCGCCCA ATCTCAACCT GGCCACCGAC GTCGAGGCGC CCGACACCGA CGTGGTCCTC
GCCGACCCGC GCGGCCTCGG CATGCACCGC GACCGCCTGA CCGCGCTCCG GGCCGAGCAG
TATCCCCGGG TGCTGCCCGT GCTCCTGCTG GTGCCCGCGA ACCAGCCGCT CGACGCCCTG
AGCGCCGACC TGCTCGAGCT GAGTGACGAT CTCGTGCGCG TGCCGGTCTC GCCCGCCGAC
CTGCGCTTTC GCCTCAAGAG CGCGCTGCGC ACCCGCGAGA TGTCGCTCGC GCTCAGCCAG
AGCATTCAGT TCGAGCAGCG CCTGGTCGGC GTGGTCGGCC ACGACATGCG CTCGCCGCTG
TCGGTGCTGA GCATGGTCGC CGACATGCTC GGCGACGCCG ACACCGAGCT GCCCCCCCAC
CTGCGCCGCC TGGGCGGGCG GGTGAAGCGC GCGGCCACCA CGCTCACCCA CCTGGCCGAG
GATCTCCTGC TGGTGGCGCA CGGGCGCTCG GGCGCGCAGT TGAAGCTCGA GCGCGTGCCC
TGCGAACTCG AGCCGGTGCT GGCCGACGCC ATCCAGCTCA CGGCCAGCAG CTCGCGGGTA
ACGCTGAGCA GCGTGGGCGA CTGCGCCGCC GAGGTCGACG CGCAACGCGT GCAGCAGGCG
GTCATCAACT TGCTGCAGAA CGCCCTACGC CACGGTACCC CCGACGGTGA AGTCGCCGTG
CACGTCGACG GCAGCGCGCC CGACAGCGTC GCCATCGCGG TGAGCAACGA CGGCTCGCTC
GGCGACGTCG CCCCCGAGCA GTTGTTCGAC TCGTTTCACC AGGGCGCCAA GGCCAGCGGC
GGCGGCGTCG GCCTCGGCCT GTACATCGTG CGCCACCTCG CGCGCGCCCA CGGCGGCACC
ATCGAGGCCC GCAGCGCCGA GGGCACGGTC ACCTTCACGC TGCATCTACC CAGGCAGGCG
CCTCAGGGCA CGGGCGCGTA G
 
Protein sequence
MGSQPVSEAI EPGPGSGPLA RVALCLDNAA EREALAQWLQ DAPNLNLATD VEAPDTDVVL 
ADPRGLGMHR DRLTALRAEQ YPRVLPVLLL VPANQPLDAL SADLLELSDD LVRVPVSPAD
LRFRLKSALR TREMSLALSQ SIQFEQRLVG VVGHDMRSPL SVLSMVADML GDADTELPPH
LRRLGGRVKR AATTLTHLAE DLLLVAHGRS GAQLKLERVP CELEPVLADA IQLTASSSRV
TLSSVGDCAA EVDAQRVQQA VINLLQNALR HGTPDGEVAV HVDGSAPDSV AIAVSNDGSL
GDVAPEQLFD SFHQGAKASG GGVGLGLYIV RHLARAHGGT IEARSAEGTV TFTLHLPRQA
PQGTGA