Gene Hoch_5384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5384 
Symbol 
ID8547796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7401368 
End bp7404697 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content71% 
IMG OID646390057 
Productserine/threonine protein kinase 
Protein accessionYP_003269761 
Protein GI262198552 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00425552 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.786393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGAC GCGAGAACAG CGCTGTCGAC CCGCCTCGCG GCGAGGGCCG GGAGCCGCGT 
CGATCGAACC TCCACCGCGA CCCGGTCCGC GCCATGTCGG AGCGCGAGGG TTCGGGCCGC
GTGGCCCATC CTCACGCGCA CGAGGATGCA CGCAGCCGCA GCGGGCGCGC GCGTCTGGAC
ATGTCGCCGG GCGCCGCGCT CGGTCGCTAC GTCATCCTGC ACAGCCTCGG CGCTGGCGGT
ATGGGCGTCA TCTACAAGGC CTACGACACC GAGCTGGACC GCAACGTAGC GCTCAAGATC
CTGCGCATTC AGCCCGACAC CCGGCACTCG ATGGTCAACG CGCGCATCCG CTTGCAGCGC
GAGGCCCAGG CGCTGGCCCA GCTCTCCCAT CCCAACGTCA TCACCGTGTA CGACGTCGGC
ACCTTCGGCG AGCAGGTGTT CGTGGCCATG GAGCTGGTCG AGGGCCAGAC CCTGCGCGAG
TGGCTCGATG GCAAGAAGCG CACGCTGCCC GAGATCCTCG ACGTCTTCGT GTCCGCGGCG
CGCGGGCTCG CGGCCGCCCA CGAGGCCAAC CTGATCCACC GCGACTTCAA GCCGAGCAAC
GTGATGATCG GCAACGACGG CCGCCTGCGC ATCCTCGATT TCGGGTTGGC GCGCGGCACC
CAGCGCGGCC ACGTCGAGAG CGAGGAGGAC GCCTTCTCGG CGCTGGACAC GGGCTCGATG
GAGCAGTCGG GCGACGCCCT GCCCGCGGTC ATCACGGGCA ACTCCTCCGA GCGCCTGCTG
GGCTCGAACC TCACCGAGCT GGGCGCGGTC GTGGGCACGC CCGCGTACAT GTCGCCCGAG
CAGCACCTGG GCCGCGGCAT CGGCGCGCGC AGCGACCAGT TCGCCTTCTG CGTGACCCTC
TACCAGGCGC TCTACGGCCG CAAACCCTTC GCCGGCAAGA ACAGCGAGAG CATCAAGCGC
AAGGTCTTGG CCGGGCAGGT GATCGCGCCG CCCTCGGGCA CCCAGGTGCC GCGCTGGCTG
CACCGCATCG TGCTGCGCGG CCTCGAGGTC GACCCCGAGA AGCGCTATCC GTCGATGAAC
GCGCTGCTCG TCGACCTCGA CCGCGACCTG TCGCGCAAGC GCCGCTTCGC CGTGTTCGGT
GCGCTCGGCG CCGTGGTGCT CGGCGCGGGC GTGGCCGCGG CCCTGTACTT CAACCAGCAG
CGCAGCCAGC TCTGTCAGGG CAGCGAGACC CAGCTCGCGA GCATCTGGAA CCAGGGCGTG
GCCGATGAGA TCGCGACCTC GTTTTTGGGC AGCGGGCGCG CGCACGCTGC CGAGACCCAC
GAGCGCGTGG TGCGCCTGAT CGACGAGTAC GGCAACGAGT GGGTCGATGA GCGCACCGGC
GCGTGCGAGG CCACGCACAA GCGCGGCGAG CAGTCCGAGC AGGTGCTCGA TCTGCGCATG
GTGTGTCTAG CGAGCCGCCT GCGCCGTCTG GACGTCCTGG TCCACACCCT CGCGCACCAG
TCGGAGAGCA CCATCGTGGA CGACGCCATC GCCGCCGTGC TGGCGCTGCC CAGCATCGAT
AGCTGCGCCA ACGTCGCGGC CCTGGCGCGC GCCTACCCGC TGCCGACCAA CCCGGAAGAG
GGGCGGGCGG TGAAGAAGCT CGAGGCGCAG CTCGACCAGA TCGAGACCAG CCTCGACATC
GGTCAGGGCA GCAAGCTGCT CAATGAGGCC CTGGAGGTCA AGGCGCGCAG CGATGAGCTG
GCCTATCCGC CGGTTCAGAG TCGCGCGTAC CTGCTGCTCG CGGCCGTGCA GGAGCTGCAC
GACCAGTACG AGGCCGCCGA AGCCAGCCTG CACGCAGCCG CTCGCGCCGC CGCGCACGCC
CACGACGACG AGAGCACGGC GCGGGCGTGG ATCGACCTGG TCAAGATCAT CGGCTTCCGC
CAGGGCCGCG TGGACGACGC CCTGGCCGTG TTGCCCTTCG CCCGCACCGC GATCGTGCGC
GCCGAAGACG ACGCCCTGCT CGGCGCCCGC CTGGCGCGCA GTCTGGGCTG GGTGTACAGC
CTGTCGGGCG ACTACGGCAA AGCCCAGAGC GAGTACGAGC AGGGGCTCAG GATCCTGCAG
AACCGCTACG GCGCCAATTC GCTCGAGGCC ACGCGCTCGC GCCTGGCGCT GTCGGCGAGC
TACGGCCTGG CCGGCGTGCT GTCCGAGCAA GCCGAGTACG AGCAGGCGCT GGCGCACTAC
GAGCGCATCC GCGACATCGT CGAGACCCAC TTCGGCGCCG AGCACATGAT CATGGTGAGC
GTTCTCGGCC CGCTGGCCGA AAATCTCAGC CACATGGGCC GACTCGAAGA GGCTACCGTG
CAGTATCGGC GCGCGCTGGA CGTCGCGCGC AAGAACGGCG GCGGCGAGAA CGCGCGCGCG
GCCCTGCTGG AGAAGTTCGC CCAGCACCTG GTGCGCCAGC GCAAGCTCGA CGAAGCGCGC
GAGCGGCTCG AGCAGGCGCT CAGCCTGCGC ACCGATGTCC TCGGCCCGGG CCACGTGCTC
ACCGCCGAGA TCCTGCGCGG TATCGTGCAC GTGCTCACGC TGCAGGGCGC CTACGGCGAG
GCCGAAGATG TGCTCGAGCG CGCGGGCAAG ATCTTCACCA GCACGGTCGG CACCGGCCAC
CCTGGCTACG CCGGCTGGCT GGTGCTGCGC GGCGATCTGT TCGCGGCGCG CGAGCTGCTC
GAGCAGGCCC AGGCCGACTA CGAGGCGGCG ATCGAGTTGA TGGCGGCTTC GCTGGGGGAG
ACGCATCCGA TCACGGCCGA GGCGCTCACC GGCCTGGGCA CGCTGATGCA GCGTCGCGCT
CAGGAGGCCG AGTCGCTCGC CTACCACACG CAGGCCCGCG ACATCCTGGC CAAGATCTAC
GGTGGCACCC ACCTGCGCGT GGGCGAGGCC GAGCTCGAGG TCGCGCGCGG CCTGCTGCTG
CTCGGGCGCT ACGAAGACGC CCGCGCGTCG CTCGAACACG CGCTCGGTGG CTTCGAGGCG
GCGCGCGCGA CCCGGCCGGA GCTGTTGGCG CAGGCCTACA CCGGCCTGGC CGAGAGCGCG
CTCGGCGTGG GCGATCCCGC GGCTGCGCGC ATCCACGCTG AGCAGGCGCT GGCGCTGTGG
GGCCGGCGCG ACGGCGGCGA CAGCGGCGAT GCCTTCTGGG CGCGCTTTCT CCTCGCTCGC
GCCAAATGGG AGCTCGAACT CGACCGCGCG AGCGCTCGAC AGGCCGCCGA GGCCGCGCTG
ATGGATCTCC AGCAACGCGA GCAAACGGCG CGCAGCGCTG CGATTCGCGC GTGGTTGAGC
AAGAGCGCCG ACGCCGGCGC GCGCCGCTGA
 
Protein sequence
MGGRENSAVD PPRGEGREPR RSNLHRDPVR AMSEREGSGR VAHPHAHEDA RSRSGRARLD 
MSPGAALGRY VILHSLGAGG MGVIYKAYDT ELDRNVALKI LRIQPDTRHS MVNARIRLQR
EAQALAQLSH PNVITVYDVG TFGEQVFVAM ELVEGQTLRE WLDGKKRTLP EILDVFVSAA
RGLAAAHEAN LIHRDFKPSN VMIGNDGRLR ILDFGLARGT QRGHVESEED AFSALDTGSM
EQSGDALPAV ITGNSSERLL GSNLTELGAV VGTPAYMSPE QHLGRGIGAR SDQFAFCVTL
YQALYGRKPF AGKNSESIKR KVLAGQVIAP PSGTQVPRWL HRIVLRGLEV DPEKRYPSMN
ALLVDLDRDL SRKRRFAVFG ALGAVVLGAG VAAALYFNQQ RSQLCQGSET QLASIWNQGV
ADEIATSFLG SGRAHAAETH ERVVRLIDEY GNEWVDERTG ACEATHKRGE QSEQVLDLRM
VCLASRLRRL DVLVHTLAHQ SESTIVDDAI AAVLALPSID SCANVAALAR AYPLPTNPEE
GRAVKKLEAQ LDQIETSLDI GQGSKLLNEA LEVKARSDEL AYPPVQSRAY LLLAAVQELH
DQYEAAEASL HAAARAAAHA HDDESTARAW IDLVKIIGFR QGRVDDALAV LPFARTAIVR
AEDDALLGAR LARSLGWVYS LSGDYGKAQS EYEQGLRILQ NRYGANSLEA TRSRLALSAS
YGLAGVLSEQ AEYEQALAHY ERIRDIVETH FGAEHMIMVS VLGPLAENLS HMGRLEEATV
QYRRALDVAR KNGGGENARA ALLEKFAQHL VRQRKLDEAR ERLEQALSLR TDVLGPGHVL
TAEILRGIVH VLTLQGAYGE AEDVLERAGK IFTSTVGTGH PGYAGWLVLR GDLFAARELL
EQAQADYEAA IELMAASLGE THPITAEALT GLGTLMQRRA QEAESLAYHT QARDILAKIY
GGTHLRVGEA ELEVARGLLL LGRYEDARAS LEHALGGFEA ARATRPELLA QAYTGLAESA
LGVGDPAAAR IHAEQALALW GRRDGGDSGD AFWARFLLAR AKWELELDRA SARQAAEAAL
MDLQQREQTA RSAAIRAWLS KSADAGARR