Gene Hoch_5333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5333 
Symbol 
ID8547745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7331646 
End bp7336700 
Gene Length5055 bp 
Protein Length1684 aa 
Translation table11 
GC content73% 
IMG OID646390007 
Productserine/threonine protein kinase with WD40 repeats 
Protein accessionYP_003269711 
Protein GI262198502 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.90192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA ACGACGGCGC CGCGGATACC AAGGACGCGT CCCCGGGCGA GGTCGCCCGG 
GCCGGGGCGC GGGTCTCTGG AGATCGCATC GACGGTAAGA CCGAGTACGG CGTAAACGCC
TCGTTGGCGG AAGCATCGAC CTTCATGTCG TCGTCGCCGA TGATGGCGGC GCTCGAGCCG
ACCATGACCG CGCCGATGCG GCGCAGCGGC TCCGGCGCGC GCGCGCGCAC GCCGCTCGAG
CCGATGGAGG CCTTTGGCCG CGGCTTTGGC CAATACGAGC TGATTCGTCC GCTCGGCCAC
GGCGGCATGG GCCAGGTCTT TCTCGGCCGC GACACCCGTC TGGGCCGCCT GGTCGCGCTC
AAGTTTCTGC GCTATCGCGA GGACGCTTCA GCCACGCGCT TCCTCGACGA GGCCCGGGCC
ACGGCGCGCT GCAACCACCC CCATATCGTC ACCATCTACG AGGTCGGCGA GCACCAGGGC
GCGCCCTACA TGGTGCTCGA ATACCTGCCG GGCCGAACCC TGCGCGAGTG GCTCACCCAG
CGCAGCGAGC GCCAGGGCAT TTTCGACCAG GCCGGGCACG CGGAGCGGCC GTTCGGCCTG
TCGCTGAGCT ACGCGGTCGA TCTCATGCTG CCGGTGGTGC GCGCGCTGGT GTACGCGCAC
GAGCGCGGCA TCGTGCACCG CGATCTCAAG CCCGAGAACA TCATGCTGGC CGAGGCCGGC
TCGCTCAAAG TGCTCGATTT CGGCATCGCC ATGGTCACCG AGGGCGCCGA GGCCGATGTC
AGCTTCGAGC AAGCCGAGGA CACGCCCGGC AGCGTGGCCA CCGCGGTCGA GGCCGGAACT
CCGTTCTACA TGGCGCCCGA GCAGTGGAGC GGCGACGAAG TCGATCACCG CTGCGACATC
TGGGCTGCGG GCCTGATCCT CTTCGAGCTG CTGTGCGGAG CCCATCCGCT GGCGCCGATC
ACGCGCGTCA AGCTGGTCAA CGTGCAGGAT CTCGACCTGC CCATGCCGCG GGTGCGCGAC
TTTCGCGACG ACCTCGGCAA GCTCGGGGCG ACCATCGACC GCTGTCTGAT CAAACGCCCC
GAGGACCGGC TGGGCAGCGC TAAAGAGCTG CTCGACGTGC TCGAGTCCAC GGTGCGGCCG
CACCGCGGCA GCGCCTACGA CCAGGACACG TCGCCGTATC CCGGCCTGGT GGCATACCAG
GAGAGCGACG CGGAGCGCTT TTTCGGACGC TCGCAAGCGG TTACCATGGT TGTCAACCGA
CTGTCCGAAA TTCCGCTCTT GGTGGTGCTG GGACCGTCGG GCGCCGGCAA GTCGTCGTTC
GTGCGCGCGG GCGTGGTGCC GGCGCTCGAC CGCACCGGCG ACGCCTGGGA ATCCTTCGTC
GTGCGTCCGG GAGCCCGGCC GGTGGCGGCG CTGGCTTCGC TGCTGCGGCG CCACGCCTGG
GACACCGGCA CCGAGGTCTC GGCCGGGGCG GCGCCGACCG AGCCGCTCGC GGGCGAAAAT
CCCATGCCGC CCGGCATCAG CTCAGAGGAG ATCGGCGCGC GTTTGCGGGC CGAGCCCGGT
TATCTGGGCG CGCAGTTGCG CGCGCGCGCC CGGCGCCGGC TGTCGCGGGT GGTGCTGTTC
GTCGATCAGT TCGAAGAGGT CTACACGCTG GCGTCGCTGG CCGAGCGCAA GGCGTTTTTC
GCCTGCCTGT CCGGCGTCGC CGACGACGTC GATGCGCCGC TGCGCGTGGT CATCGCGCTG
CGCTCGGACT TCCTCGACCT CACCACCGAC GCGCAGGCGG CGATCCCGGG CTTTCACCGC
GGCATCACGC TGCTGCCGCC CATGGACCGC CCGGCGCTGC GCGAGGCCCT GGTGCGCCCG
CTCGAGCCGC TGGCCTATCG CTTCGAGTCG TCCGAGCTGG TCGAGGATAT GCTCGACGCG
CTGGAGAACA CGGCCAGCGC GCTGCCGCTA TTGCAGTTCA CGGCCTCCAA GCTGTGGGAG
CTGCGCGATC CTCAGCAGCG CATGCTCACC CGCGGGAGCT ACGACACGCT CGGCGGCGTG
GCCGGCATTC TCGCCGTGCA CGCCGACGCG GTGCTGGCGA CCATGGCGGC GGGCGATCGC
GCTCTGGCCC GGGTGCTGCT GCTGCGCCTG GTGACGCCCG AGCGCACGCG CGCGGTGGTC
AGCATCAGCG AGCTCTACGA TCTGGCCGAG GGCACCTCGC GGCGCGGCGC GGAGATCGAC
AGTGTGTCCG AAGACGACGC GGGCGACGAG GGGGTGGCCG GCGAGCAGGC ACGCCCGGGT
GCCCGGGGCG TGCGCGCGCT CGGACCGGTG AGCTACGACG ACATCGACCG GGTGCTGGGC
CAGCTCGTCA ACGCCCGCCT GCTGGTGGTC GAGAATGACG ACATCGCCGC CAGCGCGGCC
GGGGGCGCGA CCGATATCTC GGTCGAGCTG GTGCACGAAT CGCTCATCGA CCGCTGGCCG
ACGCTGGCCA CCTGGGTGAG CGAGAGCTAC GATGACATGG CCTTTGTCGA GCGCCTGCGG
CGGGCGGCGC GCGAGTGGCG CGACCACGAG CGCGCGGACG AGCTTTTGTG GCAGGGCTCG
GCGGCCGAGC GGGCGTGGAC CTGGTATCAG CCCTACGCTG GCGAGCTGAC CCCGGTCGAG
CGCGAGTACA TCGAGGCGGT GCGGGCCCAC GCCGTGCGCG CGCAGCGGCA GCGGCGGCTG
TTCGTGGGCG GCGCGATCGC TGTGCTGCTA TTCTTCGCCG TGGCCATGAC CCTGCTGGCC
TGGCGCGAGC GGCAGGCCAA TCAGCTCGCG GCGCAGCAGG CCGAGCGCGC ACAGACCGAG
ACCGCGCGCG CGCGTGCGCA GGCGCGCATG GCTCGCGATG CCAGCCGCAT CGCGGTCGCG
CGCGAGCTCG AGGACGAGGA TCCGACCACG GTGCTGGCGC TGCTGCGCGA GGTCGAAGAG
CCGGAGCTGG CCCGCTCGTG GCCGATGCTG GTGAGTCGCG CGCTGCGCAC GGGTGTGGCC
CGCGCCGTGC TCAGCGGCCA CGAAGACCAG GTGTACGCGG CCGCCTTCAG CCCCGAGGGC
GAGCGCGTGG TCACCGCGGG ATGGGATGGC ACCGCGCGCA TCTGGGATGC CGATGGCGTC
GGCACACCGG TGGTGCTGCG CGGCCACACC GGGCGCATCA ACGCGGTGCA CTTCAGCCCC
GACGGCACCA GCGTGCTCAC CGCATCGGTC GATCACAGCG CCCGGGTGTG GAACGCCAAC
GGCGCGGGCG AGCCGCTGGT GCTCGAGGGC CACACCGATG AGGTGGTGAG CGCGGTGTTC
TCGCCCGACG GCGAGCGGGT GGCCACGGCG TCGGCGGACG GGCGCGCGCG GGTGTGGTCG
GTGCGCGCCG TGGTCGCCGG TCGCGCCAAG AGCGTGACCC TGCGCGGGCA CACCGGCCCG
GTGCGCGCGG TGGCGTTTTC GCCCGACGGC GAGCGGGTGG TGACGGCCTC GGCCGACGGC
ACCGCGCGGG TGTGGTCGGC CGACGGCACA GGCGCGGCCG TGGTCTTGCG CGGACACTCG
GACCAGATCC GCGCGGTGAG CTTTTCGCCC GACGGGGAGC GGGTGGTGAC GGCGTCGGCC
GACGGCACCG CGCGGGTGTG GTCGGCCGAC GGCAGCGGCG AGCCGGTGGT GCTGCGCGGG
CACCAGGGCT GGGTGGTGGA CGTGTGCTTC TCGCCCGACG GCGAGCGGGT GGCCACGGCG
TCCTTCGACA ACAGCGCGCG GGTGTGGTTG GCCGACGGCA GCGGCGAGCC GGTGGTGCTC
GCCGGTCACA CGCAGTCGGT GGCCTCGGTG CGCTTCTCGC CCGAGGGCGA GCGCGTGGTC
ACGGCCTCAT ACGACAAGAC CGCGCGCGCG TGGCCGGCCG ACGGTCTGGG CACCTCGGTG
CTGTTCCAGG GCCACGGCGG CCTGGTGCGA ACGGCGGCCT TTAGCGGCGA CGGCGAGCGT
GTGGTCACCG CGTCCGAAGA CGGCACGGCG CGGGTGTGGA AGGCGCGCGG GGTGCCGCAG
CCGCAGGTGG TGCACGCGCA CCAGGGCGCG GTGTACTCGA TGATGTTCAG CGCCGATGGC
GCGCAGCTCC TCAGCGCCTC GGCCGACGGC ACGGCGCGGC TGTGGCGCCT CGACGGCGGC
GACGCGCCCG TGGTCTTCGA GGGCCACGCC GGCGCGCTCA CCGGGGCGAT GTTCGACCCC
AGCGGAGAGC GTATCGTCAC CAGCTCCTTC GACAAGACCG CGCGGGTGTG GACCCTGGGC
AGCGACGCCG CGCCCGTGGT CCTCGAGGGG CACACCGGCT GGCTCAGTGA GGCGGTGTTT
TCGCCCGACG GCCGCTCGGT GGCGACGGCC TCCTCCGACG GCACCGTGCG GCTCTGGGAT
GCCGGCAGCG GACGCTCCAG CGCGGTGTTC CGCGGACACG CGGGCGAGGT CATGAACGTC
GGCTTCAGCC CCGACGGCGC GCGTCTGGTC TCGGCCTCGG CCGACCAGTC CGCGCGGGTG
TGGACGGTGG CCGAGCCCGA GGCGGAGCCG CTGGTTTTCG GCCATCCGTC CGTGGTCTAC
AGCGCCTCGT TCAGCGCCGA CGGCAGGTAT ATCGTGACCG CGGCCGACGA CGGCGTGGCC
CGGGTGTGGG CGGCAGACGG GCGCAGCCAG CCCAGGACCC TGCGCGGGCA CGCAGACAGT
CTCACCAGCG CCAGCTTCAG CCCCGACGGC CGGCGCGTGG TCACGGCCTC GCGCGACCGC
TCGGCGTGGA TCTGGGACCT CGAGGGCGAG GGCGCGCCGC TGGTCCTCGA CGGCCATCCC
GGCTGGGTCG GCCAGGCGGT GTTCAGCCCC GACGGTCGCC GGGTGGCGAC CTCGGCGTCG
GACGGTTCGA TCTGGCTATG GTCGGACGTC GCGCCGATCG CGGTGGACAC CGACGCGCTG
TGGCGGGCGA CGAGTTATTG CCTGCCCGTG GACGCGCGCC GCTCGCTGCT GAGCGCGGAC
GCCGAGAGCG CGGCCCAGGA TTTCTCCGCC TGCGTCGCCC GCGTCGCCCG CGTCGCCGAG
TCGGCTCGCG AGTGA
 
Protein sequence
MAENDGAADT KDASPGEVAR AGARVSGDRI DGKTEYGVNA SLAEASTFMS SSPMMAALEP 
TMTAPMRRSG SGARARTPLE PMEAFGRGFG QYELIRPLGH GGMGQVFLGR DTRLGRLVAL
KFLRYREDAS ATRFLDEARA TARCNHPHIV TIYEVGEHQG APYMVLEYLP GRTLREWLTQ
RSERQGIFDQ AGHAERPFGL SLSYAVDLML PVVRALVYAH ERGIVHRDLK PENIMLAEAG
SLKVLDFGIA MVTEGAEADV SFEQAEDTPG SVATAVEAGT PFYMAPEQWS GDEVDHRCDI
WAAGLILFEL LCGAHPLAPI TRVKLVNVQD LDLPMPRVRD FRDDLGKLGA TIDRCLIKRP
EDRLGSAKEL LDVLESTVRP HRGSAYDQDT SPYPGLVAYQ ESDAERFFGR SQAVTMVVNR
LSEIPLLVVL GPSGAGKSSF VRAGVVPALD RTGDAWESFV VRPGARPVAA LASLLRRHAW
DTGTEVSAGA APTEPLAGEN PMPPGISSEE IGARLRAEPG YLGAQLRARA RRRLSRVVLF
VDQFEEVYTL ASLAERKAFF ACLSGVADDV DAPLRVVIAL RSDFLDLTTD AQAAIPGFHR
GITLLPPMDR PALREALVRP LEPLAYRFES SELVEDMLDA LENTASALPL LQFTASKLWE
LRDPQQRMLT RGSYDTLGGV AGILAVHADA VLATMAAGDR ALARVLLLRL VTPERTRAVV
SISELYDLAE GTSRRGAEID SVSEDDAGDE GVAGEQARPG ARGVRALGPV SYDDIDRVLG
QLVNARLLVV ENDDIAASAA GGATDISVEL VHESLIDRWP TLATWVSESY DDMAFVERLR
RAAREWRDHE RADELLWQGS AAERAWTWYQ PYAGELTPVE REYIEAVRAH AVRAQRQRRL
FVGGAIAVLL FFAVAMTLLA WRERQANQLA AQQAERAQTE TARARAQARM ARDASRIAVA
RELEDEDPTT VLALLREVEE PELARSWPML VSRALRTGVA RAVLSGHEDQ VYAAAFSPEG
ERVVTAGWDG TARIWDADGV GTPVVLRGHT GRINAVHFSP DGTSVLTASV DHSARVWNAN
GAGEPLVLEG HTDEVVSAVF SPDGERVATA SADGRARVWS VRAVVAGRAK SVTLRGHTGP
VRAVAFSPDG ERVVTASADG TARVWSADGT GAAVVLRGHS DQIRAVSFSP DGERVVTASA
DGTARVWSAD GSGEPVVLRG HQGWVVDVCF SPDGERVATA SFDNSARVWL ADGSGEPVVL
AGHTQSVASV RFSPEGERVV TASYDKTARA WPADGLGTSV LFQGHGGLVR TAAFSGDGER
VVTASEDGTA RVWKARGVPQ PQVVHAHQGA VYSMMFSADG AQLLSASADG TARLWRLDGG
DAPVVFEGHA GALTGAMFDP SGERIVTSSF DKTARVWTLG SDAAPVVLEG HTGWLSEAVF
SPDGRSVATA SSDGTVRLWD AGSGRSSAVF RGHAGEVMNV GFSPDGARLV SASADQSARV
WTVAEPEAEP LVFGHPSVVY SASFSADGRY IVTAADDGVA RVWAADGRSQ PRTLRGHADS
LTSASFSPDG RRVVTASRDR SAWIWDLEGE GAPLVLDGHP GWVGQAVFSP DGRRVATSAS
DGSIWLWSDV APIAVDTDAL WRATSYCLPV DARRSLLSAD AESAAQDFSA CVARVARVAE
SARE