Gene Hoch_5359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5359 
Symbol 
ID8547771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7373112 
End bp7374338 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content72% 
IMG OID646390032 
Producthistidine kinase 
Protein accessionYP_003269736 
Protein GI262198527 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAG CGCCAAGCAA GCACCGCCCC GATTCCCAGT CCGCGCGTCC GAGTCGCCTC 
GCGCTGGTGT GCGACGCGCG CATTCCCGGC CGGATCACCG ATATCGCGTG CGACGAACGT
CCCGTGCTCG TCGCGCGCCC CGACCTCGAC GCCGACGCCG ACAGCGAGCG CATCCACGGC
CGCATGCTCA GCCGTTTCGT CGCGCCCGGC GGGATTCAGC ATCTATCGCA TCTGCTCGCC
CACGTGGCCA AACACGGCCA GGTCTGCGAG TGGCTGCTCG ACCTGCGCAG CGCCGCCGGA
CCGGTGCCCT ATCGCCTGTG GGTCGCCCAG GACGACGAGG CGCTGCTGGT CGTCGGCTCG
GCCGATACCG AGGCAAGCTT CGCCGGCATC GCCGCCGCCG TCGCGCCCGC GGTGGCAGCC
GTGAGCGCGC CCTCGGCCGC CGCGCTGCTC GAGCGCCTGC GCCGCCGCGC CGATGATCCG
CGGGCGCTGC TCGACGATCT GCGGGCGCAG ACCCGCGGGC TCCACGGCCA GGTGCAACCG
CCGCCCAACC TCAACAATCG GCTGCTGCGC ATGGCGGCCC ACGACCTGCG CAATCCCCTC
CTGGTCCTGT CCATCGGCTG CTCCTTCCTG CTCGCCGACG CCGACGGCCT CAGCGAGGAA
CACCGCAGCT TGCTGCAGGA GAACCTCGAC ACCTGCGACT TCATGAACCG CCTGATCGAC
GGCATCGTCG ATCTCGCTGA GGTGTCCAGC GGCCATCTGC AGCTCCAACG GGCGCCCGCG
GATCTGTACC GCCTGCTGGT CTCCGCGGTC GAACGACACG GCAGCCTGGC GCGCGAGCGC
GGCAGCACAC TCACCTGCGC TCCCACCTCC GATGCCGTGC ATCTGCCCGT GGACGAGGGC
CGCATCTCGC AGGTTTTCGG ACAGCTCATC TCCAACGCGC TGCTGCACTG CCCGCCCGGC
ACCAAGGTGC ACGTGGACAT CGCCTGCGGC GAACGCGAGG TCACGGTCAC CGTGAGCGAC
GACGGCCCCG GCATCCCCAC CGCGGTGCGC GAGCGGCTGT TCCGGCCCTT CGGCAAACCC
CACGGCGAGG TCGCGCCCAA GCACTACGGC GCCGGCGTGG GCCTGGCCGT GGCGCGTCGC
ATCGTCGAGG GCCACGACGG CTGCCTCGAC CTCATCTCCT CGGCCGACGC CGGCACGCGC
TTCCGCGTCC GCCTGCCGCG CACCTGA
 
Protein sequence
MTSAPSKHRP DSQSARPSRL ALVCDARIPG RITDIACDER PVLVARPDLD ADADSERIHG 
RMLSRFVAPG GIQHLSHLLA HVAKHGQVCE WLLDLRSAAG PVPYRLWVAQ DDEALLVVGS
ADTEASFAGI AAAVAPAVAA VSAPSAAALL ERLRRRADDP RALLDDLRAQ TRGLHGQVQP
PPNLNNRLLR MAAHDLRNPL LVLSIGCSFL LADADGLSEE HRSLLQENLD TCDFMNRLID
GIVDLAEVSS GHLQLQRAPA DLYRLLVSAV ERHGSLARER GSTLTCAPTS DAVHLPVDEG
RISQVFGQLI SNALLHCPPG TKVHVDIACG EREVTVTVSD DGPGIPTAVR ERLFRPFGKP
HGEVAPKHYG AGVGLAVARR IVEGHDGCLD LISSADAGTR FRVRLPRT