Gene Hoch_4959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4959 
Symbol 
ID8547367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6835347 
End bp6838481 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content70% 
IMG OID646389633 
Productserine/threonine protein kinase with TPR repeats 
Protein accessionYP_003269341 
Protein GI262198132 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC CCGTGGGCAT GTCGAACAAT CAGCACATTC AAGCCCAGTT CCCCCGTAAG 
TTCGGTAACT ACCACCTGCT AGCGCCGTTG GCGCAGGGGG GCATGGGCGC TCTCTATCTG
GCGGTCAAAG GCGACCGCGG TCTGGAGCGT TTGCTCGTCA TCAAGACCGT GCTGCCGCAC
CTGGCCGACC AGGAGTATTT GGCGCGCTTC CGCGACGAGG CCAAGGTCGT GGTGCAGCTC
TCGCACGGCA ACCTGATCCC GGTGTTTGAC GCCGGACAGG TGGGCGGCGA GGTGTTCCTG
GCCATGGAGT TCATCGAGGG GCGCGATCTG CGGGCGGTGT GGAATCGCTG CGCCAAGAAG
CAGGTCGCCT TTCCCCTCGA CGTCGCCGTG TACATCGTCA AGGAGCTGTG CCGCGGCCTG
TCGTACGCGC ACTCGTTCCG CGACCTCAAG CTGGTGCATC GCGACGTCTC GCCGCCGAAC
ATCATCATCT CGTTCTCGGG CGAGGTGAAG CTCACCGATT TCGGACTGGC CTCGTCCACG
CTCAAGCTCG AGAAGACCGC GCCCGGCATC ATCTACGGCA AGGTCGCGTA CATGTCGCCG
GAGCAGGCGC GCGGCGAGGA TCTCGACGGC CGCTCCGACA TCTACGCGGC CGCGATCGTG
CTGTGGGAGA TGCTCACCGG GCGGCAGCTT TTCCCGCCGG GCAAGGACCA GCCGCAGGAT
CTGATCAAGC GCGCGCGCAA CCCGCGCGTG CCGCTGCCCT CGCAGCGTGC CCCGCGCGTG
CCCACGACGC TCGACGACAT CTGCCTCAAG GCGCTGTGTG CGCGCCGCGA AGACCGCTAC
GCCACCGGTG ACGAGTTCCG CGAGGCGCTG TCGACCTGGC TGGCGGCGGA GCATCCCAAG
ACCGACGCCT CGAAGATCGA GACGTTCTTG CACGACCTCT TCGCCGAGGA CATCGAGCGC
GAGCGCCGCG AGCGCGAGGA GTTGCTCGGC AAGATCCGCA ACCGCGTGCA GACCATGCCG
CCCAACGACG AGCTGCGCCG CGCGCTCGAG CAGTCGGGCG ACATCGCCGC CGCGGACGTC
CGCCGCGCGC CTGGCAGCCG CGGCCGCCGG GCCAGCGATC ACGGCGAGCA GACCGAGGAG
CGCCGTCAGG TTCCCGATCG CCGCAAGGGC GCGCAGACCT CGCGGCCGCA GCTCACGCAT
CCCGGGCACG GGCCGATGCG GGCGCCCAGT CAGCCGCCGC CGCCGAGCGA GGGCGCCGGC
GAGAGCAACC CGTCCAACGC GCTGATGCGC ACCAAGCCGC TCGACGCCAA CGAGGTCGTC
GGGCAGGTGA TCGACGGGCG CTATTCGATC AAGAAGCTGG TGGGCGAGGG CGGCATGGGC
CGGGTGTATC TGGCCGAGCA CGTCGAGATC GGTCGGCGCG TGGCCATGAA GATCTTGCAT
CCGGTGTACA GCCACATGCC GGACCTGGTC GAGCGCTTCC GCCGCGAGGC GCGCGCGGCC
TCGCGCATCG GCCATCCGCA CATCGTCGAT GTCACCGACT CGGGGCGCAC GCGCGAGGGC
TCGGTGTACT TCGTGATGGA GTACCTCGAG GGCGTCGATG TGGGCACGGT CATCGAGCGC
GAGGGCGCGC TCGACGTGCG CCGGGCGCTG CGTGTGACCA CGCAGATCTG CCGGGCCCTG
GCCGCGGCGC ACGACGCCGG CATCATCCAT CGCGACCTCA AGCCCGAGAA CATCTTCCTC
ACCATGCGCG ACGGCACCTC GGACTTCGTC AAGGTGTTGG ACTTCGGCAT CGCCAAGAGC
ACCGAGGCCG AGCGCAATCG CACCCGGCGC CTCACCAGCC CGGGCATGGC CATGGGCACG
CCCGAGTACA TGGCGCCCGA GCAGGCGGCC GGCAAGCCGG CGGACGAGCG CTGCGACGTG
TACGCGGTGG GCGCGATCCT CTATGAGGCG CTCACGGGCG AGCCTCCGTA CGAGGGCGAC
AACTTCATGG AGGTCTTGAC CAAGAAGGCC ATGAACGAGC CCCCGCCCGT GCGCGAAGTG
CGGCCCGAGG TGCCCGAGCA GGTCGCCGAG CTGGTCAAGC GCGCGATGGC GCGCGACCCC
GAGGCCCGCC TGGCCTCCAT GGATGTCTTT GAGTACGAGC TGACCAAGTG CCTGGCGGGT
CGCGGTGAGG CCGTGGCCGG GATCCTCGGC ATGCGCACGG ACAACGAGCT GGTGGCCAGC
CTCAACCCCG GGCTGGCGCT GCCGCCGGCG CCGGCCGAGG TCGAGCGCAA GCTGGGCGGC
CGAGCGCCGA GCCCGACGGA CGCGGTGCTG GTGGCGCCCG GCGAAGAGGT GATCGACGCC
GCGCTTGTCG ATGTCGCGGC CAAGCCGGCG TCGTCGACCG CGCGTCCCGA CACCGACGCC
GGCCACACCA CCGTGCTCAC GCCGGTCACC GGGCGTTCGT CGTCGAGCAT CATGCGCTGG
ACCGCGGCGG GCGTGCTCGG CCTGTTGCTG GTCGGCGGGC TGATCTACGT CGCCTCCGAG
GAGGACGACG AGCGCGCGCG GACCGAGGTT CCGATCGCCT CGGCTCGCGG GCTCGACGAC
GACCCCGCCG CCGAGCCGGA CACAGCGCCG CCCGACGACG AGGGCATCGA GATCGTCGAG
ATGGAGCCCG ACGAGGTCGA GCCTGACGAG GGCACGCCGG ACGAGGCCGA GACCGAGACC
GCGGCCAATG GCCGCGATAC GAGCGGGCAG GGCAACGGCG GCTCCGAGAC CGAAGCGACG
GGGAGCGAGA GCGAGACCGC GCAGCGCAGC GAGGGCATGA GCAAGGCCGA GGCCGAGTCG
CTCTTCGCCC AGGCCGAGCG CAAGCGCCTC GGCGATCCCA ACGGCGCGGT GCAACTCTAC
AAGCAGGCGG CCAAGCATCG GGCCTTCCGT CGGCGCGCAT ACGTGCGCAT GGCCGAGGTG
TCGTTCGACC GCAAAGATTG GGACGGCGCG ATCGGCTACG CGCAGCGTGC TGGCGGCGTG
CAGGCCGATC GCATTCTCGG CAACGCGTAT CTGCGCAAGG GCGACGTCAA CAAGGCGCGG
CGCCACTACG AAGCCGTGCT GGCGCGCAAT CCCAACGACA CCGCGATTCG CGCGCTGCTC
GAGCGGCTCA ACTGA
 
Protein sequence
MTAPVGMSNN QHIQAQFPRK FGNYHLLAPL AQGGMGALYL AVKGDRGLER LLVIKTVLPH 
LADQEYLARF RDEAKVVVQL SHGNLIPVFD AGQVGGEVFL AMEFIEGRDL RAVWNRCAKK
QVAFPLDVAV YIVKELCRGL SYAHSFRDLK LVHRDVSPPN IIISFSGEVK LTDFGLASST
LKLEKTAPGI IYGKVAYMSP EQARGEDLDG RSDIYAAAIV LWEMLTGRQL FPPGKDQPQD
LIKRARNPRV PLPSQRAPRV PTTLDDICLK ALCARREDRY ATGDEFREAL STWLAAEHPK
TDASKIETFL HDLFAEDIER ERREREELLG KIRNRVQTMP PNDELRRALE QSGDIAAADV
RRAPGSRGRR ASDHGEQTEE RRQVPDRRKG AQTSRPQLTH PGHGPMRAPS QPPPPSEGAG
ESNPSNALMR TKPLDANEVV GQVIDGRYSI KKLVGEGGMG RVYLAEHVEI GRRVAMKILH
PVYSHMPDLV ERFRREARAA SRIGHPHIVD VTDSGRTREG SVYFVMEYLE GVDVGTVIER
EGALDVRRAL RVTTQICRAL AAAHDAGIIH RDLKPENIFL TMRDGTSDFV KVLDFGIAKS
TEAERNRTRR LTSPGMAMGT PEYMAPEQAA GKPADERCDV YAVGAILYEA LTGEPPYEGD
NFMEVLTKKA MNEPPPVREV RPEVPEQVAE LVKRAMARDP EARLASMDVF EYELTKCLAG
RGEAVAGILG MRTDNELVAS LNPGLALPPA PAEVERKLGG RAPSPTDAVL VAPGEEVIDA
ALVDVAAKPA SSTARPDTDA GHTTVLTPVT GRSSSSIMRW TAAGVLGLLL VGGLIYVASE
EDDERARTEV PIASARGLDD DPAAEPDTAP PDDEGIEIVE MEPDEVEPDE GTPDEAETET
AANGRDTSGQ GNGGSETEAT GSESETAQRS EGMSKAEAES LFAQAERKRL GDPNGAVQLY
KQAAKHRAFR RRAYVRMAEV SFDRKDWDGA IGYAQRAGGV QADRILGNAY LRKGDVNKAR
RHYEAVLARN PNDTAIRALL ERLN