Gene Hoch_5349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5349 
Symbol 
ID8547761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7361348 
End bp7364398 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content69% 
IMG OID646390022 
Productserine/threonine protein kinase with TPR repeats 
Protein accessionYP_003269726 
Protein GI262198517 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.553326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCC GCTTCCCCAA CCAGCCTCGT GACCGCCAAC CCGGCACGCC GACGCAGATC 
GGCGAAGAGG CCGAGTACAC TGACAAGATG GCGGCGGCGC CCACCGATGA TTTCCTGCCC
GAGGCTCTCC CCACCGATCG CACCCTGCAA TCGGCCGCGG ACACCGGCGC GTTGGCCACC
CTGAAGGGGT CGCCGCTCGG CCACCTGGGA CGCTACGAGC TGCTCGAGTA CAAAGCCGCC
GGCGCCATGG GCGAAATCTA CATCGCCTAC GATGCCCAGC TCGACCGCAA GCTGGCGCTC
AAAGTGTTGC GCCCCGAGGT CAGGCGCCGC AGCGACCACG CCTCGCAGCG GCTGCTGCGC
GAGGCTCAGA GCCTGGCTCA GGTGTCGCAT CCCAACGTGG TGCCGGTCTA CGAGGCCGGC
GAGATCGACG GTCGGGTCTT CGTCGCCATG GAGCTGGTCG CCGGCGAGCC GCTCGGCCGC
TGGCTGGCCA AGCACGCGCC GCCCGCCGCG CGCTTCTGGC GCTCCATCCT CGACACCTTC
CTCGAGGCCG GTCGCGGGCT GGCCGCCGTC CACGAGGCCG GCTTGCTGCA CCGGGACTTC
AAACCCGACA ACGTGCTGGT CGGCGATGAC GGCCGCGTGC GCGTGGTCGA CTTCGGTCTG
GCGCGGGCCA GCAACACCAC GGCCAGCATC GACGAACGCT CCACCACCTC CATGGAACTC
GAACCCCATG CGAGCGCGCA GCGCCCCCGC GCATGGACCG CCGATCTCAC CCGCAGCGGC
GTGGTCGTCG GCACGCCGGC GTACATGGCG CCCGAGCAGA TCAGAGGCGG CACGGTCGAC
GCGCGCAGCG ATCAGTTCAG CTACTGCGTG GCCCTGTACG AAGCCCTCTA CGGCCACCGA
CCCTTCCGCG GCGAGGACTT CACCAGCCTG CGTGCCTCGG TGCTAAGCGG CGAAGTCCTG
GCGCCTTCGG CGTCGGCCAA GGTGCCCTCG TGGGTGTGGC GCGTGCTGCT GCGCGGCCTC
GCGACGGAGC CCGAGCAGCG CTATCCCGAC ATGCACGCGC TGCTCTCGGC CCTGTCCGCG
GATCCGGCGC GCAAACGGCG CCAGCGGGGC ACCGCGGTCG CGCTGCTGTT TGGCGGCACC
GCGGCCGGCG TGCTGGCGAC CATGCTCGCC AGCGAGCACC GCGACGAGCG CTGTACCGCC
GCAGCCGAGA GCGCACTAGA GGAAGTGTGG ACGGAAGCCA CCCGCAACCA GGTGCGCGCC
GCCTTCGCCG CATCCACCCT GCCCTACGCG GCCACCGCGT GGCAGAGCGT CGACCGCGGT
ATTCAGGACT ATATCCGCCG CTGGCAGCAC AGCTCGATCG AGCTATGCGA ATCCGACGCC
GAAGACGGCC TCGACGGCAC CGCGGACGGC CACGCTCCCA CCACGCTCTG TCTCGATACG
CAAGCAGAGC GCCTGCAGCT ACTGGTCACC GAGCTGGTTC ACGCCGACGC CCACCTGGTC
GAAAACGCCT CCTCTGCGCT CGCGTTCCTC GGACAACCCG AGGCCTGTGG GACCCAATTC
AGCAGTCTGC CGCCAGCGCC CGACGCGCAA ACCCGGGACA AGCTGTCCCA GATCCGCACG
GCCATGAGCA GCGCGCGCCT GCAGGCTATC GCCGGCTCCT TCGACCCGGC TCTGCGTCAG
CTCGACGACC AGATCGAAGC CGCCGAGTCC CTGTCCTACG AGCCGGTGCT GGCCGAGGCT
CTGTACCACG CCGGCGACGC GCGCCTCGAA CGCGGACGCG AGGACGAAAT AGAGACCGGC
ACCGAGCTGC TCAAGCGAGC GCTCGACATC GCCGAGAGCA GCGGCAACGA CGCCCTCGCC
ACCGACATCT GGAACGCGCT GGCGCGACGC CGCGAGGCCA CCTTGCCGCC CGAGCAGATC
GCCTTCTGGT CGCGTCGGGC GCTGGCGCTC ACCAAGCGCA TGAGCAGCTA TGACCAGCGC
CGCGCGCAAG CGCTGCGCAA TCTGGGCACG GCCCTGTATC GCTCGCAGCA GTACACCGAG
GCTGAAGTCT ACCAGCGCCG GGCCATCGAC CTAGCGCGCA GCAACAACGC CTCGCCGCTG
CTGAGCGCCG ATATGCTGCA CGCGCTCGCC AACACGCTGC ACGCGCTCGC CAAGTACGAC
GACGCCCGCG TGCACTACGA AGAGGCGCTG GCGCTGGCCG ACGGCGAGCT CGGTCGCGGT
CACCCCAAAG TGCAGGCGCT GCGCTTCGAC TTTGCCGATT TCCTCATCGA GACCGCCGAG
CTCAGCGGCG ACGACGGCAA AGGCGCGCTC GCGCTCGACC AGGCGCGTAC CTTCCTCGAT
ACCGCCCGCA CCATCCGCGC CCAGGTCTAC GGCCCGCAGA GCCCGCAGGT CGCCGCGGTC
CACGTGGCGC TGGCCAAGCT CGAGACCAAG AGCGGCGCCC TCGATGAGGC CGCGTACCAC
GCCGGACGCG CCATCGATTT CTATCGTGCG CACTACGGCG AACACGCGCC GCAGCTCGCC
GAAGCGCTGT CGCAGCTGGG ATTCGTCCAG TTCCGCAGCC GGCGCTACAA GGAAGCCCTC
GTCGCCTGGC AGCAGGAGAG CGCCCTGCGC GAGCAAAGCG GCGCCCTCCC CATCGTGCGC
GGCCTCAACC AGAGCAACAT CGCCGAGGCC TTGGTCCACC TGCGCCGCTA CGACGAGGCC
TTCGCCGCCT TCGAGCGCGC CCAAGAATAC TACGGCGCCG AAGACGATAT CTCCCCGATT
TACTCCGGCC TGGTCGAAAA AGGCCGCGGC CAAGCACTGC TGGGACAGGG TCGCGCCGCG
GCCGCGGTCC CGCACCTCGA GGCCGCACTC GCCGTCTTCC ACGAGCACCC TTTCGACGTG
CTCGAAGACG CGGATACAGC CTGGAGCCTG GCGCGTGCGC TGCGCATCAG CGCACCCACA
CGCCTCGAAG AAGCGCGTTC CTTGGCAACC AAAGCCGAGG ACATCTATCG CGCCAACGAG
GCCACTCGCG ACATAGCAGA CGAAATCCGC ACCTGGCTCG ACTCTCTCTG A
 
Protein sequence
MSSRFPNQPR DRQPGTPTQI GEEAEYTDKM AAAPTDDFLP EALPTDRTLQ SAADTGALAT 
LKGSPLGHLG RYELLEYKAA GAMGEIYIAY DAQLDRKLAL KVLRPEVRRR SDHASQRLLR
EAQSLAQVSH PNVVPVYEAG EIDGRVFVAM ELVAGEPLGR WLAKHAPPAA RFWRSILDTF
LEAGRGLAAV HEAGLLHRDF KPDNVLVGDD GRVRVVDFGL ARASNTTASI DERSTTSMEL
EPHASAQRPR AWTADLTRSG VVVGTPAYMA PEQIRGGTVD ARSDQFSYCV ALYEALYGHR
PFRGEDFTSL RASVLSGEVL APSASAKVPS WVWRVLLRGL ATEPEQRYPD MHALLSALSA
DPARKRRQRG TAVALLFGGT AAGVLATMLA SEHRDERCTA AAESALEEVW TEATRNQVRA
AFAASTLPYA ATAWQSVDRG IQDYIRRWQH SSIELCESDA EDGLDGTADG HAPTTLCLDT
QAERLQLLVT ELVHADAHLV ENASSALAFL GQPEACGTQF SSLPPAPDAQ TRDKLSQIRT
AMSSARLQAI AGSFDPALRQ LDDQIEAAES LSYEPVLAEA LYHAGDARLE RGREDEIETG
TELLKRALDI AESSGNDALA TDIWNALARR REATLPPEQI AFWSRRALAL TKRMSSYDQR
RAQALRNLGT ALYRSQQYTE AEVYQRRAID LARSNNASPL LSADMLHALA NTLHALAKYD
DARVHYEEAL ALADGELGRG HPKVQALRFD FADFLIETAE LSGDDGKGAL ALDQARTFLD
TARTIRAQVY GPQSPQVAAV HVALAKLETK SGALDEAAYH AGRAIDFYRA HYGEHAPQLA
EALSQLGFVQ FRSRRYKEAL VAWQQESALR EQSGALPIVR GLNQSNIAEA LVHLRRYDEA
FAAFERAQEY YGAEDDISPI YSGLVEKGRG QALLGQGRAA AAVPHLEAAL AVFHEHPFDV
LEDADTAWSL ARALRISAPT RLEEARSLAT KAEDIYRANE ATRDIADEIR TWLDSL