Gene Hoch_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4114 
Symbol 
ID8546516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5659890 
End bp5661431 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content70% 
IMG OID646388791 
Productserine/threonine protein kinase 
Protein accessionYP_003268505 
Protein GI262197296 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0600748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0166963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCG CGGATTTAGG CCCTCGCTAT CACGTCATGT CGCTCCTCGG CGAGGGGGCG 
ATGGGGCAGG TGTACCTGGC GCGGCACAAG GAGCTCGGGC GCATCGAGGC CATCAAGGTG
CTCAAGCCGC AGGTGGCGCT CGACGAGCGC TTCGTGGCCC GCTTTCGCCG CGAGGCGCGC
GCGACCAACC GGCTGCAGCA CACCAACATC GTGGCCATGC ATGACTTCGG CCGGCTGCCG
GACGGGCGCT TCTACCTGTC GATGGAGTTC GCCGACGGCG AGTCGCTGTC GACCACCATG
ACCGAAGAGG GTCCGTTTCC GTTCGCGCGC GCGCTGCGCA TCCTCATGCA GCTCACGGCC
GCGGTCGAGC ACGCGCACTC GCGCGGCGTG GTCCATCGCG ACCTCAAGCC GGCCAACATC
ATGCTGGTCA AGCACCGCGG CGTCGGCGAT CTGCTCAAGA TCCTCGATTT CGGCATCTCC
AAGATCATCT CGCCCGACTA CAACGAGAGC ATCCTGGTGA GCCAGGACGG CATCGTCTTC
GGCACGCCCC TGTACATGGC GCCGGAGCAG TTCTATCGCC AGCCCAACGA CCCGCGCAGC
GACATCTACG CCATCGGCTG TGTCGGCTAC GAGCTGGTCA CCGGCTCGCC GCCCTTCACC
GGCAAGATCC CCGAGGTGGT GCGCGCCCAC GTCGAGAAGC CGCCGCCGTA CCCGAGCACG
GACGCGCCGC TCGGCGACGT GCCGCCCGAG TTCGACCATG TGATCACGCA CTGCATGGAG
AAGGCGCCCG GCCAGCGCTA CCAGAGCGCG GGTGAGCTGT TGCGCGACCT GGTGCAGCTC
GAGCCGACCT TTCTCGGCGG CCAGCGCCTC GACGACGGCA CCCTGGTGCC CGAGCTGGGC
GGCGGGCGAT TCGACGCCTC GCTCGACGGC GCGACCCTGG CCGTGACCAC GCAGATCGCC
GAGCCGCTGC TCGAGTCGTA CACCGATCTG GTGCTGGGCG CGGACGAGAC CGCCCAGGCC
GAGCGCGACG AGGCGCTGCG CGAGCTGGTC GAGTGCCTGC TCGACCACGG CGCCAACGAC
GCCCGGCTCA CCATCGGCCT GGCTGACCTC GACGGCATCG ATCACGACAT CGTGCAGTGC
GACACCCGCG TGCACGAGCT GCGCGGCCGC GAGGCCCGGG TCGAGCAGAG CACGCGCGAG
CGCGAGGGCC GGCTGCGCTT CGCCATCGGC GAGCTGATCT TCGATCGCGA CCAGGGCGCC
CACGAGCGCC GCGCCGACCT CGACTTCCAG ATCCGCGAGC TCGAGCAGCG TCTGGGCGAG
GCGGTGCGCG AGACCGAGGC CGAGCTGGTC CGCATCGGCG ACGAGACCAT CGCGCAGGTG
GCCGAGCGCG CCAACCTCGA GGAGCGCCGG CGCGCTCTGC TCACGCAGCT CGAGACCATC
GTCGAGGAGC TGGTGCCGCA CTTCGACGAC AACCTGGCCG TGGCCCCGTA CCTCGACCGC
TTTTTTTCCA TCCGCGACCG CGCCGACGCG CGCCACCTCT GA
 
Protein sequence
MEFADLGPRY HVMSLLGEGA MGQVYLARHK ELGRIEAIKV LKPQVALDER FVARFRREAR 
ATNRLQHTNI VAMHDFGRLP DGRFYLSMEF ADGESLSTTM TEEGPFPFAR ALRILMQLTA
AVEHAHSRGV VHRDLKPANI MLVKHRGVGD LLKILDFGIS KIISPDYNES ILVSQDGIVF
GTPLYMAPEQ FYRQPNDPRS DIYAIGCVGY ELVTGSPPFT GKIPEVVRAH VEKPPPYPST
DAPLGDVPPE FDHVITHCME KAPGQRYQSA GELLRDLVQL EPTFLGGQRL DDGTLVPELG
GGRFDASLDG ATLAVTTQIA EPLLESYTDL VLGADETAQA ERDEALRELV ECLLDHGAND
ARLTIGLADL DGIDHDIVQC DTRVHELRGR EARVEQSTRE REGRLRFAIG ELIFDRDQGA
HERRADLDFQ IRELEQRLGE AVRETEAELV RIGDETIAQV AERANLEERR RALLTQLETI
VEELVPHFDD NLAVAPYLDR FFSIRDRADA RHL