Gene Hoch_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3789 
Symbol 
ID8546182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5207914 
End bp5209605 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content73% 
IMG OID646388459 
Productserine/threonine protein kinase 
Protein accessionYP_003268182 
Protein GI262196973 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.365732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0988778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCG AAAAACTCGG CAGCTATCGC GTGGTGCAGA AGATCGGCGA GGGCGGTATG 
GGCGCCGTCT ACGTCGGCGA GCACGAGCTG CTGTCGCGCA AGGCGGCGAT CAAGGTGCTG
CTGCCGATGC TGTCGAATGA CGAGGAGCTG GTGCAGCGCT TCTTCAACGA GGCCAAAGCC
GCCACCGTCA TCAACCACGC CGGCATCGTC CAGGTGTTCG ACTTCGGCAC CACGGCCAGC
GGCCAGGCCT TCCTGGTCAT GGAGTTCCTC GAGGGCGAGG CGCTCGAGCA GCGTCTGGAA
CGGCTCGGCC GCCTGCCCGT GGCCGACGCG CTGCGCATCG TGCGCCACTG CGCGGGCGCC
CTGCACGCGG CCCACGAGGT CGGCATCGTC CACCGCGACC TCAAGCCCGA CAACATCTTC
CTGGTGCCCG ACCCGCAGGT CGAGGGCGGC GAGCGCTCCA AGATCCTCGA CTTCGGCATC
GCCAAGCTCA CCGATGAGCG CCAGAGCGGC TCGGTCAAGA CCCGCACCGG CAGCCTGATG
GGCACGCCCA CCTACATGGC GCCCGAGCAG TGCCGCGGCG CCGGCGAGGT CGACCTGCGC
GCCGACATCT ACTCGATGGG CTGCGTGCTC TTCCACCTGG TGAGCGGGCG GCCGCCCTTC
GTGGGCGCCG GCGTCGGCGA GGTGCTGGCC GCTCACCTGC GCGAGCCCGC GCCCCCGGTG
CGCGCCTTTG CGCCCGACGT GCCGCCCGCG GTCGAGGCCC TGATCGCGCG CACCTTGGCC
AAAGAGCCGG GCGATCGCTT CCCCGACATG AAGGCCTTCT CCGCGGCCAT CACCGCCGCC
GCGCACGGCC AGATGCCGCA GCCGGTCGGC GGCGCGCCGC TGGGGCCGCC GCAGACCCTG
GCCGTCGGCG GCGGCGCGCT CGGCTCCGGG CAGGGGCCGG TGCCGGGCTC GTATCCCGGT
CAGGGGCCGG TGCCGGGCTC GTATCCCGGT CAAGGGCCGG TGCCGGGCTC GTATCCCGGC
CAGGGCATGC ACAGCGGCCA CGGCTCGTAT CCCGGCCAGG GGCAGGGCTC CGGCGTGATG
CCGGGCGTCA CCTCCAATCA GCAGGGTCCG CGGCCGGTGA CCGGCTCGAC CCTGGGCGCG
GCCGCGGCCC AGAACCTCAC CCAGGGCACC ATGCCCGGGG CCCGGCGCAA GCGCACGGCC
TTGATCGCGG GCGCCGCCGT GGCCACGCTC GCGGGCGTAC TCGCGGCCGT GCTGGTGGTG
GGCTCGGGCG GCGATGACGA CGAAGAGCCG ACCCAGCTCG CCGCCATCGA GACCCCGGCC
GACGCCGCCG AGGTCGCGCT CGCGACCCCG ATCGACGCGG CCGTGGCCGT GGCCGTCGAG
GTCGAGGAGC CCGACGCCGC GCCGCCGCAG ATCGCCATCG AAGTCAAGAC CCGCCCCCCG
GGAGCCACCG TGTTCCTGGG CGACAGCGAC GAGCCGCTGG GGACCACGCC GTACACCTAT
GAAGCTGCGG CCAGCTCCGA GGTGCTCAGC TTCCGCCTCG AGCGCGACGG CTACGAGACC
GAGGAGGTCG AGTTCGAGGG CGACACCAAC CAGAGCTTGC GGCTGAGCCT CTCGCGCAAG
CGCCGAACGC CGACCAAGCG GCCTCCGAAG AAGCCGGACT CGCCCGGCGG CGATGAGCTG
CTGTATCGTT AG
 
Protein sequence
MIGEKLGSYR VVQKIGEGGM GAVYVGEHEL LSRKAAIKVL LPMLSNDEEL VQRFFNEAKA 
ATVINHAGIV QVFDFGTTAS GQAFLVMEFL EGEALEQRLE RLGRLPVADA LRIVRHCAGA
LHAAHEVGIV HRDLKPDNIF LVPDPQVEGG ERSKILDFGI AKLTDERQSG SVKTRTGSLM
GTPTYMAPEQ CRGAGEVDLR ADIYSMGCVL FHLVSGRPPF VGAGVGEVLA AHLREPAPPV
RAFAPDVPPA VEALIARTLA KEPGDRFPDM KAFSAAITAA AHGQMPQPVG GAPLGPPQTL
AVGGGALGSG QGPVPGSYPG QGPVPGSYPG QGPVPGSYPG QGMHSGHGSY PGQGQGSGVM
PGVTSNQQGP RPVTGSTLGA AAAQNLTQGT MPGARRKRTA LIAGAAVATL AGVLAAVLVV
GSGGDDDEEP TQLAAIETPA DAAEVALATP IDAAVAVAVE VEEPDAAPPQ IAIEVKTRPP
GATVFLGDSD EPLGTTPYTY EAAASSEVLS FRLERDGYET EEVEFEGDTN QSLRLSLSRK
RRTPTKRPPK KPDSPGGDEL LYR