Gene Hoch_4724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4724 
Symbol 
ID8547131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6453387 
End bp6455711 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content69% 
IMG OID646389398 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003269107 
Protein GI262197898 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.960478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCC ACTCCGACGA CTCGTCGAGG GCGAACGCGT CCGCCGTATG GCAAGACCCG 
CGCGTCGACC TCAGCAACTG CGAGCGCGAG CCCGTGCACA CGCCGGGTCA TGTCCAGCCC
CACGGCGCCC TGCTGGCCGC CGATCCCAAC ACGCTGCTCA TCGAGCAGGT CAGCGACAAC
CTCGAGGCGC TCGGGCTGTT TTCCGGCGAG CTGTACGGAA AACCGCTGGA AGTCGTCCTC
CCCCCGGTGG CGGTCTCGGC CCTGCAGCGG CGCATCCTCG ACAACTCGCT CGACGGCCAG
GCGCGCTACG TGTACCGCTG GGAGCGCGAG GCCGAAGGCT CGGTCGACGT TTTGGCCCAC
GTGTACAAAG ACGTCTTGAT GGTCGAGCTG GAGGTGGGCG AGCCGACCTT TTTCGACCAG
CTCCCCAGCC ACGAGATCTT CAACTCGACG CTCGAGGCCT TTGAGCGCAC CACCACGGTG
CAGGCGCTGT GCGACTGCGC GGCCGACGAG TTCCGCTCGC TCACCGGCTA CGACCGGGTG
ATGATCTACC GCTTCGGCGC CGACGACAGC GGCCACGTGT TGGCCGAGAG TTCGGCGCCC
GAGCTCGAGC TCGAGTCGTA TCGCGACCTG CACTATCCGG CCTCGGACAT CCCGCGCCAG
GTGCGCGCGC TGTTCCTCGA GAAGCGCCTG CGCCTGCTCG CCGACAACCG CTACCAGCCC
GCCTTCATCA CCCCCGAGGT CAACCCGCGC ACCGGCAAGC TGCTCGACAT GAGCTTCGGC
GTGCTCCGCG GCTCCTCGGT CATGTACACC GAGTACCTCG AGAACATGGG CGTGCGGGCC
TCGCTCACCG TGGCCATCGT GCAGGAGAAC CGGCTCTGGG GTCTGGTCGC CTGCCACCAC
TACCGCGGCC CGCGCCACCT GCCCTTCGAC ATGCGCACCA CGGCCGAGTT CCTGGGCCGG
GCGCTGGCGC TGCAGATCTC GCACAAAGAG AAGCTCGAGC AGCGCCAGGA GCGCAGCCGC
ATGGAGGCCC AGCTCCACGA CATCGAGACC CACCTCTCGC CCAACGCCAC CTTGCTCGAG
GCGCTCACCG ACAGCGAGCC CGGCGTCGTC GGCCTGATCG CCGACAGCAC GGTCGTGGTC
GTGGTCGAGG GCGCCATCAA GACTTTCGGC ACCGAACTGC CGACCAGCCT GCTCCAGGCG
CTGTGCGGCT GGCTCACGCA GATCGGCGCC AGCGACGTCT ACGCCACCGA CTCGCTCACC
GCGGCCGGTT TCCCCAAAGC CGAGTCCATC CGCGAGCCCG CCAGCGGCCT CATGGCCATG
CCCATCGCGC GCAGCGAAGG CGAGTGGCTG CTGTGGCTGC GGCCCGAGCA GAACATGGTG
GTCAACTGGG CCGGCGACCC CACCAAGCCG GTGCTCTCGG GTCCCCACGG CGACCGGCTG
ATGCCGCGCA AATCCTTCGC CCTGTGGGAG GAGATCGTGC GCGGCCGCTC GCAGCCGTGG
ACGCCGCTCG AGCTCGAGAT GGTGCGCCGG CTGCGCAACG CGGTGGCCAC CGCGGCGCTG
CAACGCGACG CCCAGCTCCG CCGCCTCAAC GCCGAGCTGG CGCGCAGCAA CGAGGACCTC
GACGCCTTCG CCTACGTGGC CTCGCACGAC CTCAAAGAGC CGCTGCGCGC CATCGCCAAC
TACGCCGGCT TCCTCACCCA GGACTACGGC GAGCGCCTCG ACGACGAAGG CCGCGACATG
CTCGAGGCGC TGGTCCGGCT CAGCGATCGC CTGCGCCAGC TCATCGACTC GCTCCTGCGC
TTCTCGCGCA TGGGACGCGC CGGCATCCAC ATCACCGACT TCCCGCTCAC CCAGGTGATC
GACGAGATCC GCGAGGACCT GGTCGAGCTC ATCCGCAGCA AGCACGCCGA CATCCAGATC
GAAGGCACCC TGCCGCGCAT CCACGCCGAC CGCGCGCTGG CCCGCGAGAT GCTGTCGAAC
CTGATCTCGA ACGCCATCAA GTACACCGAC CGCGCCACGC CGCAGGTGTT CCTCAGCTAC
GAGCCCCCGG GCTCGCCCGA GCTGCCGGCC GAGGCCGGCG GTCGCGGCTG TATTCTGGTG
CGCGACACCG GCATCGGCAT CCCCGAAGAA CACCACGAAG ACGTCTTCCA GATCTTCCGC
CGCCTGCACC CGAGCAACGC CTACAGCGGC GGCACCGGCG CCGGCCTCAC CATCGTGAGG
CGCATCGTCC AACTGCACGA TGGCTGGGTC GGCTTCATCT CTCACCCCGG CCGGGGGACG
ACCTTCTTCG TGTCCCTGGC CCCGCAGCCC GAGGAGGCCG AATGA
 
Protein sequence
MNSHSDDSSR ANASAVWQDP RVDLSNCERE PVHTPGHVQP HGALLAADPN TLLIEQVSDN 
LEALGLFSGE LYGKPLEVVL PPVAVSALQR RILDNSLDGQ ARYVYRWERE AEGSVDVLAH
VYKDVLMVEL EVGEPTFFDQ LPSHEIFNST LEAFERTTTV QALCDCAADE FRSLTGYDRV
MIYRFGADDS GHVLAESSAP ELELESYRDL HYPASDIPRQ VRALFLEKRL RLLADNRYQP
AFITPEVNPR TGKLLDMSFG VLRGSSVMYT EYLENMGVRA SLTVAIVQEN RLWGLVACHH
YRGPRHLPFD MRTTAEFLGR ALALQISHKE KLEQRQERSR MEAQLHDIET HLSPNATLLE
ALTDSEPGVV GLIADSTVVV VVEGAIKTFG TELPTSLLQA LCGWLTQIGA SDVYATDSLT
AAGFPKAESI REPASGLMAM PIARSEGEWL LWLRPEQNMV VNWAGDPTKP VLSGPHGDRL
MPRKSFALWE EIVRGRSQPW TPLELEMVRR LRNAVATAAL QRDAQLRRLN AELARSNEDL
DAFAYVASHD LKEPLRAIAN YAGFLTQDYG ERLDDEGRDM LEALVRLSDR LRQLIDSLLR
FSRMGRAGIH ITDFPLTQVI DEIREDLVEL IRSKHADIQI EGTLPRIHAD RALAREMLSN
LISNAIKYTD RATPQVFLSY EPPGSPELPA EAGGRGCILV RDTGIGIPEE HHEDVFQIFR
RLHPSNAYSG GTGAGLTIVR RIVQLHDGWV GFISHPGRGT TFFVSLAPQP EEAE