Gene Hoch_3747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3747 
Symbol 
ID8546140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5150367 
End bp5152226 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content71% 
IMG OID646388417 
Productserine/threonine protein kinase 
Protein accessionYP_003268140 
Protein GI262196931 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.350974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.548904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG CGGTCGGCCA GCGTGTCGAC AAGTACGAGG TCGCCGAGCA GGTCGGTCAG 
GGCGGCATGG CCGTGGTCTA TCGCGGCGTC GATCGCTCGC TCGAGCGCGT CGTCGCCATC
AAGGTGCTGC ACCAGCATCT GGCCGGTCAC GAGGAGGCGC GCGCGCGCTT TGCCCGCGAG
GCCCGCGCCG TGGCCAAGCT GCGCCACGAG AACATCCTCG AGATCTACGA CTTCGCCGAC
GACGACGCGC GCGACAGCTA CATCGTCACC GAGTTCATCG ACGGCCCCAC GCTCACCGAA
TCGCTCGCCG ATCACCTGCC CCGCTACCCC GAGATCGGCG CCATGGTGAT GACCCAGGTG
TGCCGCGCGC TGGCCCACGC GCACAGCCTG GGCATCCTGC ACCGCGACGT GAAGCCCGAG
AACATCATGA TCCGCACCGA CGGCGTGGTG AAGCTCACCG ACTTCGGCAT CGCCCAGATG
CTCGACGCGC AGCGCATGAC GGTGACCGGC CAGCTCCTGG GATCGCCCGC GTACATGTCG
CCCGAGCACA TCGAGGGCCA GCCGCTCGAC TTCCGCACCG ACATCTTCGC CGCCGGCGTG
GTGCTGTATC AGCTCGTGGT CGGCGAGCTG CCGTTCACCG GGCGCAACCC CCACGAGCTG
CTCAAGCGCA TCAGCGACGG CGTCTACCGC GACCCGCGCC AGGCCAATCC CCTGGTCGGC
AACGAGCTCG GGCGCATCAT CGACACCGCG CTGGCGCGCG CCAAAGAGGA CCGCTATCGC
GACATCACCG AGATGCTGAG CGCACTCGAG CGCTACCTCA AGCACTCGGG GATCGACGAG
CCGCGCCAGG AGCTTTCGCG CTATTTCGAC GCCCCGGTCG CCTATGAGCT GGCCCTGCGC
GAGCGCCTGC TGGCCGCGCT GGTGCAGCGC GGGCGCGAGC TGGCAGGCAG CGAGCGCGTC
GCCGCCCTGG ACGTATTCAA CCGCGCGCTG ACCATCGACC CGGACAACGC CGACGTGCTC
GCCCAGGTGC ACGCGCTGTC GCGGCGCCAG CGCACCCGGC GCATGCTCGC GCTGTTCGGC
GGCGTGCTGG CGGCCGCCGC GCTGCTGCTC GGCGCCGTGC AGGTGTTGCG CGAGGCGCCG
GCGCCCGAGC CCGCGCGCGT CCTGCCCCCG GCCCAGGTCG GCCCGGCCAC CGAACCCGCT
GCCGGCAGCG AGCTCCCGGC GCTGGCGGGC GACAGCGCGA TCGCGGCGGT CCCGGCCGAC
GCTGGCATGG ACGCCAGCGC CGCCAGCGCC AGTACCAGCC CCGGGCTCGG CTCGCAGAGC
CCGCCGCAGC GCCCCGGCAG CAGACGACCC GGCGTCGGCC GCGAGCAGTC GCCCGACGCC
GCCCCGCGCC GGGATCCGCC AGTCGCCAGC GCCCCGCCGG GCCCGGCCAC GCGCGCATTC
ACGCTCAACG TCTCGCCGCT CAAATCCGAG TACCGCGTCG ACGATGGCCC CTGGCTGCCG
ATCGCGCGCA GCCGCGCCAC CCTGGAGCTG GACCGCGGCA GCCACGTGGT CGAAGTGCGC
AACACCGCCT GCTGCGAATC CGACCAGCAG ATCATCGCCG CCGATGCGCC CGGCGGCGTG
CTCGACTTCA CCCTGGGCTA CTTGCCGGCT ATGATCGTGC CGAAATGCCC CGTCGTCGCG
GTCGGCGTCC AGGTAGACGG GCAACCCGCT CGACTGGACC GAAAACATCC GATTTTCTTT
ACCCGAAGCC TCGGTCAACG CGCCGTCGTG ATCACCTTCT TCTCCGATGA TGGCACTGAC
GAACACACCG TTCAGGTGCA GTACAACGAG ACCAAGGTAG TCACGTGCGC CTTCCCCTGA
 
Protein sequence
MSIAVGQRVD KYEVAEQVGQ GGMAVVYRGV DRSLERVVAI KVLHQHLAGH EEARARFARE 
ARAVAKLRHE NILEIYDFAD DDARDSYIVT EFIDGPTLTE SLADHLPRYP EIGAMVMTQV
CRALAHAHSL GILHRDVKPE NIMIRTDGVV KLTDFGIAQM LDAQRMTVTG QLLGSPAYMS
PEHIEGQPLD FRTDIFAAGV VLYQLVVGEL PFTGRNPHEL LKRISDGVYR DPRQANPLVG
NELGRIIDTA LARAKEDRYR DITEMLSALE RYLKHSGIDE PRQELSRYFD APVAYELALR
ERLLAALVQR GRELAGSERV AALDVFNRAL TIDPDNADVL AQVHALSRRQ RTRRMLALFG
GVLAAAALLL GAVQVLREAP APEPARVLPP AQVGPATEPA AGSELPALAG DSAIAAVPAD
AGMDASAASA STSPGLGSQS PPQRPGSRRP GVGREQSPDA APRRDPPVAS APPGPATRAF
TLNVSPLKSE YRVDDGPWLP IARSRATLEL DRGSHVVEVR NTACCESDQQ IIAADAPGGV
LDFTLGYLPA MIVPKCPVVA VGVQVDGQPA RLDRKHPIFF TRSLGQRAVV ITFFSDDGTD
EHTVQVQYNE TKVVTCAFP