Gene Hoch_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4844 
Symbol 
ID8547251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6627809 
End bp6629332 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content66% 
IMG OID646389517 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003269226 
Protein GI262198017 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0550144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.563731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAA GTCTGTTGCA GCCACAAAGT CCGGACGAGC GGCGGAAACG CATGGAGTTG 
GTTCTGGAAG GGACTCGCCT GGGCATGTGG GATTGGAACC CGCAGACCAA CGAGGTGATC
TTCGACGAGC GCTGGGCGGC CATGCTCGGC CACTCGCTCG ATGACCTGGA ATTCACCTAC
GACGCCTGGT ACAGCCGCGT ACACCCGGAC GACGTCGAGG CGTGTCTGCG CGATATTCAG
GCCCATTTGA AGGGCGAGAC TGACTTCTAC GAGAACGTCC ACCGCATGCG CCACAAGGAC
GGGCACTGGG TGCACATCCT CGATCGCGGC CGCATCATGG ACCGCGATGA GCAGGGCCGA
CCCACCCGTT TTACCGGCGC GCATACCGAT ATTTCCGCGC AGCGCGAGGC GGAGCTGCGC
GCGCGTGAGC TGGCGCGGGC GCGGACGCAG TTTCTGGCGG TGATGTCGCA CGAGATCCGC
ACGCCGCTGC ACGGCATGCT GGGCATCACG CATCTGCTCA AGAAGACCGA GCTCTCGGAC
GAGCAGCAGC GTCTGCTCGA GATCGTCGAG AGCAGCGGCG AGAGCCTGCT GCTGGTCATC
AACGATATTC TGGATTTCGC CAAGGCCGAC GAGCGTCGGC TGTCGCTGTC GCCGCACGCC
TTCGAGGTGC GCGCGATGCT CACCGGCATC GCCAATCTGT TCGGGCCGCG GGCGAAGCAG
AAGGGGCTGC GCTTCTCGTG CACGGCGGCG CCCGGGTTGC AGGGCGCCGC GTGGGGCGAC
GGGCATCGGC TGCGCCAGAT CCTCATCAAC CTGGTGAGCA ACGCCATCAA GTTCACCGAG
CGCGGCGGCG TGGCCCTGAG CGCGCGCCGC GAGGGCGAGA GCTTGCATTT CGAGGTGGTC
GATACCGGCG TGGGCGTGGC CGATACCGAG CGCATCTTTC TCGCCTTCGA GCAGGAGGAC
GCCTCGATCA CGCGCCGCTA CGCGGGGACG GGCCTGGGTC TGGCCATCGT CCGTCTGCTG
GCCGAGCAGA TGGGCGGCGA GGTCGGCGTG TCATCGACGG TGGGCGAGGG CAGCCGCTTC
TGGCTGCGCG TGCCCATGCG CGAGACGCAG ATGCCGCAGA TGGAGGAAGT GAGCCGCGAC
CAGGTCGAGT CGTTGCCGGC GATGCGCGTG CTGGTCGCTG ACGACAACGC GATCAACCAG
ATGGTCATCC GCGGCATGCT CGCGGCCGGT GGGCATTTTT GCCAGACGGT GGACACTGGG
CGGCAGGCGC TGGCCTGCGT CGAGGACAGC GATTGGGACT GTATCTTCCT CGATCTGTAC
ATGCCCGACA TGGGCGGGGA AGAGGCTGCC GAGCGCATGC GGGCGGCCGG GGTGCGCACG
CGCATCGTCG CGGCTTCGGC CGATGCCAGC GTCGAGACCC AGGAGCGCTG CCGAGCCAAG
GGTATACAGG GCTTTCTCAG CAAGCCCTTC AAGCGTCTGC AGTTGCTCGA GGAGTTGCGA
CAGGCGCACG AAAGCGCGCC CTAG
 
Protein sequence
MSKSLLQPQS PDERRKRMEL VLEGTRLGMW DWNPQTNEVI FDERWAAMLG HSLDDLEFTY 
DAWYSRVHPD DVEACLRDIQ AHLKGETDFY ENVHRMRHKD GHWVHILDRG RIMDRDEQGR
PTRFTGAHTD ISAQREAELR ARELARARTQ FLAVMSHEIR TPLHGMLGIT HLLKKTELSD
EQQRLLEIVE SSGESLLLVI NDILDFAKAD ERRLSLSPHA FEVRAMLTGI ANLFGPRAKQ
KGLRFSCTAA PGLQGAAWGD GHRLRQILIN LVSNAIKFTE RGGVALSARR EGESLHFEVV
DTGVGVADTE RIFLAFEQED ASITRRYAGT GLGLAIVRLL AEQMGGEVGV SSTVGEGSRF
WLRVPMRETQ MPQMEEVSRD QVESLPAMRV LVADDNAINQ MVIRGMLAAG GHFCQTVDTG
RQALACVEDS DWDCIFLDLY MPDMGGEEAA ERMRAAGVRT RIVAASADAS VETQERCRAK
GIQGFLSKPF KRLQLLEELR QAHESAP