Gene Hoch_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0223 
Symbol 
ID8542602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp326567 
End bp328510 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content70% 
IMG OID646385019 
Producthistidine kinase 
Protein accessionYP_003264757 
Protein GI262193548 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGATT TCGAAGCGTC CCCGCCCGTG TTCGCAAGCT CCGGCCCCCG GAGCGTCCGC 
GCGAGCAGGC GTCGCCCGCT CATGGATGCG GACCAGAGCG GGGCCGAGGA GGCGTTCGAA
CTCGCGGCGG AGGAGCCGCA GGGGCCTGGC CCGGTGCGCG TGTTGGTCGC CGACGACAGC
GCCTCGATCC ACGAGGATTT CCGCAAGACC CTGCCGCCGC CGCGGCAGAG CGATCCTCTC
GACGAGCTGG AGGCGGTCCT GTTTTCGTCG TCGTCGTCTG CTGCCGCGAC CTCGCACTCT
GCGCCGCTGC CGAGCTTTGC CATCGACTCC GCCTTTCAGG GCGAAGAGGC GCTGGCCAAG
GTCGAGGCCG CGCGCGCCCG CGGCGAGCCC TACGCGCTGG GCTTCATCGA CATCCGCATG
CCGCCGGGCT GGGACGGCGT AGAGACGGCC CAGCGCATCC TGCGCGCCGA TCCGCACATT
CAGCTCGTGT TGTGCTCGGC GTATTCGGAC TATGCCTGGG AGGACCTGGC GGAGGCCGTG
GGCCACGGCG ATCGCGTCCT CATCCTCAAG AAGCCCTTTC GCACTATCGA GGTGCGTCAG
ATCGCCAGCT CGCTGGCCGC CAAGTGGCAG CTCGTGCGCG AGCGCGAACG CCAGATGCAG
GACCTCGAGG AGCGCGTGCA GGCGCGCACC GCCGAGCTGG CGCGCGCCAG CCAGGAGCTA
CAGCGCGAGA TGCTCGAGCG CGGCAAGGTC GAGGCGGCCC TGCGCACGGC CCAGCGCCTC
GAGGCGCTCG GGCGCATGGC GGCCGGCCTG GGCCACGAGA TCAACAACCC GCTCAACTTC
GTCTCGGGCA GCCTCGAGAT GCTCGACGCC GAGCTGATGC GCGTGCGCGG ACGTCTGCGC
GAAGACGAGT GGGCGCGCAT GGGCGAGATG CTGCACACGG CGGCCGCCGG CGTCGGCCGC
ATCGCCCAGA TCGTGAGCGG CATCCAGTTC TTCGACCGAC CGACCGAGAT CCAGCTCGAG
GTCGTCGATC TGTGGAAAGT GCTCACCTGG AGCGTCAAGA CCGTCGACGA TCGCCTCAGC
CCCGATCTCG AGCTGGTGCT CGACCTCGAC GACGTTCCCG CCGTGCTCGG CAAGCGCATC
GAGCTCGAGC AGGTCATCAA CCATCTGCTC GAGAACGCCA TCCAGGCCGT CGCCGCGGCG
CCGGCGCCCG CGTCTGGCCG CGAACACAGC GTGCGCGTGG CCGCGCGCTG CGAGCACGCG
CCAGGCGGGT CCGCGGGCGA GGTGGTCATC GAGATCGAGG ACACCGGCGA GGGCTTCCCG
GAGGGCGAGA TCGACAAGGT CTTCGAGCCC TTCTACACCA CGCGCTCGCC CGACCAGGGC
ACCGGCTTGG GGTTGGCGAT CTGCCGCACC ATCGTCACCG CGCTGGGCGG GAGCATCGCG
GCCGAGAACC CGAGCGAAGG CGGCGCGCTG GTCACTGTGC GGCTGCCGGC GGCGTCTCCC
GAGGCGATCC AGGCGGCCGC GGCCAAGCCC GCTGCGACAG CGCCCAAGCC GGTGCCGAAC
GGGCGCGCGC GGGTGCTGGT CATCGACGAC GAGCCGCTGA TGTTGCGCAT CATGAGCCAC
GCGCTGCGCG AGCACGAGGT GGTGACCGTG CAGAGCGCCG ACGACGCGCT GGAGCTTCTG
CAGCGCGAGG ATTTCGACAT CGTGTTCTGC GACGTGATGA TGCCGCGCAT GAACGGGCCG
CAGTTCTACG AGGCGCTGGC GCACCTGCAT CCGGGCCTGG AGCGGCGCAT CGTGTTCATC
ACCGGCGGCG CGCGCGACCC CGAGGCGCAG CGCTTCCTCG ACGGCCTCGA CAACGACTGC
TTGCAAAAGC CCATCCCGAC CGACCTGCTG CGCGCGCGCG TGGGCGAGAT GCTCGTGCGC
CTGGCCCGCA TGTCCGAAAA ATAG
 
Protein sequence
MSDFEASPPV FASSGPRSVR ASRRRPLMDA DQSGAEEAFE LAAEEPQGPG PVRVLVADDS 
ASIHEDFRKT LPPPRQSDPL DELEAVLFSS SSSAAATSHS APLPSFAIDS AFQGEEALAK
VEAARARGEP YALGFIDIRM PPGWDGVETA QRILRADPHI QLVLCSAYSD YAWEDLAEAV
GHGDRVLILK KPFRTIEVRQ IASSLAAKWQ LVRERERQMQ DLEERVQART AELARASQEL
QREMLERGKV EAALRTAQRL EALGRMAAGL GHEINNPLNF VSGSLEMLDA ELMRVRGRLR
EDEWARMGEM LHTAAAGVGR IAQIVSGIQF FDRPTEIQLE VVDLWKVLTW SVKTVDDRLS
PDLELVLDLD DVPAVLGKRI ELEQVINHLL ENAIQAVAAA PAPASGREHS VRVAARCEHA
PGGSAGEVVI EIEDTGEGFP EGEIDKVFEP FYTTRSPDQG TGLGLAICRT IVTALGGSIA
AENPSEGGAL VTVRLPAASP EAIQAAAAKP AATAPKPVPN GRARVLVIDD EPLMLRIMSH
ALREHEVVTV QSADDALELL QREDFDIVFC DVMMPRMNGP QFYEALAHLH PGLERRIVFI
TGGARDPEAQ RFLDGLDNDC LQKPIPTDLL RARVGEMLVR LARMSEK