Gene Hoch_5638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5638 
Symbol 
ID8548052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7740365 
End bp7741987 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content67% 
IMG OID646390306 
Producthistidine kinase 
Protein accessionYP_003270008 
Protein GI262198799 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01145] ATP synthase, F1 delta subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACA CCCACGAGGT CTTGGCGCTA GCGGACAAGG CACGTGCATC GGACGATATC 
GACGCCTACG TACGAGCGTC CGGCCACAGC TTGGCGGCCG TGGTCCACTC GCTCGACGAT
GTGCGTGCGC GTCTGGGTTT GGAGGCGCTC AGCGGCGAGG ATCCCGCGGT CCTGATCGTC
GAGGCGGGAG CCTGGGACGA GCGCGCTGGC TTCGCCGACG CGTTGGTGTC CTTGGCGCGT
GAGGAGGGCG TCCCCGTGGT CTGGTTGCAG GACGACGGGC ACGAGGCCCC CGAGGCATTT
GCCCACAGCG AGGTGGTCGT CGCGAGCTCG GCGCCGCGCG AGCTGGGGCT GGCGGTATCG
CTGGCCATGG CGCAGTGGAG CGCCGGCCGC GCGCAGCGCA CCAGCCGCGA GCTGGGCGAG
GCTCTGCGCC AGAGCGAGGG CCGCTACGAG TCGCTGTTTC ACGACGCGCC GGTGTTCTTC
TGGGAAGAGG ACGTGTCGGC CGTGGATCGC CACCTGGCCG CGCTGCGCGC CGAGGGCGTC
ACCGACATCG CCGCCTACGC CAAGGAGAAT CCCGAGGCGG TGATGGGCTG GGTGTTGCAA
AACGAAGTCG TCAACGTCAA CGAGGCCGCG CTGCGCGAGT TCGGCACCGA CAGCGTCGAG
GACCACCGCG ACAGCGTCGG CGAGACGCTG ATGCCCGACA TATTCCCGCA GATCATCGCG
GCCGTGGTCG CCTACGCCGA GGGCAAGCTG TACTGGCAAT TCGAGGGCCG ATACCGCAAC
AAGAAGACCG GCGTGCCCTT CGTCGCGCTG TGCCGCTTCG TATTTCCGCG TCCGGGCATG
ACCTCGCGGC GCATGATCAT GTGCGGCATC GATATCACCG AGCGCAAACG CTCCGAAGAG
GAGATCAGCG AGCTCAACCG GGCGCTGTCG GAGCGCGCGC ACGAACTCGA GGCCATCAAC
CACGAGCTCG AGGCGTTCAG CTACTCGGTG TCGCACGATC TGCGGGCGCC GCTGCGCGCG
ATCGAGGGCT TCAGCCGGCT GCTCTTCGAC CGCTATCACG ATCAGCTCGA CGAGCGCGGC
CAGAATTACC TCACGCGTGT GCGCGAGGCC GGTCAGCGCA TGAACCTGCT GATCGAGGAC
CTGCTCAAGC TGTCGCGCAT GTCGCGCAGC GAGATGCACC TCGAATCCTG CGACCTCAGC
GCCATGGCCG AAGAGACCAT CAGCAACCTG CGGCAAATTT CACCGGAGCG CGAGGTCGAG
GTGCTCATCG CGCCCGAGGT GCGCGCCAAG GGCGACCCGA CGCTGCTGCG CGCGGTGCTC
GAGAATCTGC TCGGCAACGC CTGGAAATTC TCGGCCAAGC GCGAACACGC GCGCATCGAG
TTCGGCGTCA CGATGGAAAG CGGACGGGTG TCCTATTTCG TACGCGACAA CGGCGCTGGC
TTTGATATGG CGTATCTCGG CAAACTATTC AACGCGTTTC AACGCCTGCA CACGGCCACC
GAGTTCGAGG GCACGGGCAT CGGCCTGGCT ACGGTGCAGC GCATCGTGCG TCGCCACGGC
GGCGAGGTCT GGGCCAAGGG CGAGATCGAC GTGGGCGCGA CCTTCGGCTT CTCGCTCGGC
TGA
 
Protein sequence
MTNTHEVLAL ADKARASDDI DAYVRASGHS LAAVVHSLDD VRARLGLEAL SGEDPAVLIV 
EAGAWDERAG FADALVSLAR EEGVPVVWLQ DDGHEAPEAF AHSEVVVASS APRELGLAVS
LAMAQWSAGR AQRTSRELGE ALRQSEGRYE SLFHDAPVFF WEEDVSAVDR HLAALRAEGV
TDIAAYAKEN PEAVMGWVLQ NEVVNVNEAA LREFGTDSVE DHRDSVGETL MPDIFPQIIA
AVVAYAEGKL YWQFEGRYRN KKTGVPFVAL CRFVFPRPGM TSRRMIMCGI DITERKRSEE
EISELNRALS ERAHELEAIN HELEAFSYSV SHDLRAPLRA IEGFSRLLFD RYHDQLDERG
QNYLTRVREA GQRMNLLIED LLKLSRMSRS EMHLESCDLS AMAEETISNL RQISPEREVE
VLIAPEVRAK GDPTLLRAVL ENLLGNAWKF SAKREHARIE FGVTMESGRV SYFVRDNGAG
FDMAYLGKLF NAFQRLHTAT EFEGTGIGLA TVQRIVRRHG GEVWAKGEID VGATFGFSLG