Gene Hoch_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4058 
Symbol 
ID8546459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5571894 
End bp5573552 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content74% 
IMG OID646388735 
Productserine/threonine protein kinase 
Protein accessionYP_003268450 
Protein GI262197241 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.229235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCGCA AGGGCTTTGC CCGCTGTCCG CTCGACGGCG CGCCGCTGTC GCCGCTCAGC 
GCCGATCCCC TCGAGGGCAC GGTCTTCGCC GATCGCTACG TGATCGAGGA GTGCGTGGGC
GAGGGCGGCA TGGGCCGCGT GTACCGCGCC AGCCACCGCC ACATGAGCCG GCAGTTCGCG
GTCAAGGTGC TGTTCGGCGA GCACGCGGCC GAGCAGGAGA TGCAGGCGCG CTTCGCCCGC
GAGGCCGAGG CCGCCTGCCG CCTCAGCCAC CCCAACGTGG TCACCGTCAT CGACTTTGGC
GAGACCGAGC AGGGCCTGTT CTACCTGGTC ATGGACTGGG TCGAGGGCCG CCGGCTCACC
GACCTCATCA CCCACGAGGC GCCCATGCCG CGCGAGCGCG TGGCCGACAT CACCCGGCAG
ATCCTCGAGG GCCTCACCCA CGCCCACGAG CAGGGCCTGG TGCACCGCGA CCTCAAGTCC
GAAAACCTCA TCGTCAGCGT CGAGAGCGAG CGCGAGCTGG TGCGCATCGT GGATTTCGGA
CTCGCGGTCG GCGTCGAGTC GGAGAGCGAC GCCCGGCTCA CGGCCGAGGG CACGGTCTAC
GGCACGCCCG CGTACATGTC GCCCGAGCAG GCCTGCGGCA TGGCGCTCGA CGGCCGCACC
GACCTGTTCA GCCTGGGCGT GCTGGTCTAC GAGATGCTCT GCGGCGAGCT GCCCTTCACC
GGCGTGCCCA TGGACATCGT GCGCCAGAAC ATGAACGACG AGCCGCCGTG GATCGACGCC
CGCGTGCCCG GCCTCAGCGT CGACCGCAGC CTCGAGAGCC TGGCCCGCAA GCTCATGGCC
AAGCGACCCC AGGACCGCTA CCAGAGCGCG CGCGAGGTGC TCGCCGTGCT GGCCGAGCTC
AACGGCGCCT GGACCCGGCC CGGCAGCATG CGCGAGCGCG CGCGCCACGA CACCGTGGTG
GCCGGCGGCC CCTCGCGTCG GCGCTGGCCG GTGATCGCGG GCGTCGCCGG CGTCCTGGTG
CTGCTCAGCG GCCTCGGCTT CGTGCTCAGC TCGATGTCGC CAGCCGGCGA GGGCGAGGCC
GTGGTCGCCC GAGCGGCTGA CGGCGCCGGG GGCGAGCCCG CGGCCGCCAT GAGCATCGCC
GAGGCAGCGG CGGCCGAGGC AGCGGCGGCC GAAGCGGCCA AGGCTGCAGC GGCGGCAGAG
GCGGCCGAAG CCCTGGCCGC GGCCAACGCC GGCGAGCCCG CGGCTACCGC CGCCGATGCC
GCCAGCGCTG AAGCCGCCAG CGCCGATGCC GCCAGCGCCG ATGCCGACGC GGATGCCAAC
GACGACGAAG CGCGCGAGGA GGCCCGCCGC CGCCGCCGCG CTCGTCGCGA GCGGGGGCGG
GCCGAGGGCG AGGCTGAGGG CGAGGCCGAG AGCGGGCGCG AGGCGCCGGC CGAGCGATCC
GGCCACACCG CGGCCGAGGT GTCCAAGCTC TACACCCAGG TCGGCGCGCT GGTGGACAAG
CTGGCCAAGA CCCGCGACGA CGCCGCCGCC AAGGCGCTGG TGGCCGAGTA CTTCGCCATC
CCGGCCGGCT CGGCGCAGCT CAACGCGGCC CTGCGCGAGG ACGTCTGGCG CCGGCTCACG
CGCCTGCGCG CCAAGGTCCG CAGCGCGCTC GCGCGCTGA
 
Protein sequence
MFRKGFARCP LDGAPLSPLS ADPLEGTVFA DRYVIEECVG EGGMGRVYRA SHRHMSRQFA 
VKVLFGEHAA EQEMQARFAR EAEAACRLSH PNVVTVIDFG ETEQGLFYLV MDWVEGRRLT
DLITHEAPMP RERVADITRQ ILEGLTHAHE QGLVHRDLKS ENLIVSVESE RELVRIVDFG
LAVGVESESD ARLTAEGTVY GTPAYMSPEQ ACGMALDGRT DLFSLGVLVY EMLCGELPFT
GVPMDIVRQN MNDEPPWIDA RVPGLSVDRS LESLARKLMA KRPQDRYQSA REVLAVLAEL
NGAWTRPGSM RERARHDTVV AGGPSRRRWP VIAGVAGVLV LLSGLGFVLS SMSPAGEGEA
VVARAADGAG GEPAAAMSIA EAAAAEAAAA EAAKAAAAAE AAEALAAANA GEPAATAADA
ASAEAASADA ASADADADAN DDEAREEARR RRRARRERGR AEGEAEGEAE SGREAPAERS
GHTAAEVSKL YTQVGALVDK LAKTRDDAAA KALVAEYFAI PAGSAQLNAA LREDVWRRLT
RLRAKVRSAL AR