Gene Hoch_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0074 
Symbol 
ID8542445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp112151 
End bp115054 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content73% 
IMG OID646384862 
Productserine/threonine protein kinase with TPR repeats 
Protein accessionYP_003264608 
Protein GI262193399 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCACTA GCCCGCAAAG CACCGTCTGC ACCCACTGCG GCCTCGCGCT CGACCACGAC 
GCCCTGTTCT GTCCCAAGTG CGGCACCGCA CGCAGCAAGG ACCTGAACGG CGACCCGCTG
ATGGGTACCG TGGTCGGCGA CCGCTTCCTC ATCCTCGAGC GCCTCGGCCA CGGCGGCTCG
GGGACCATCT ACCGCGCCGA GCACGTCACC CTGCGCCGCA AAGTGGCGGT CAAGGTGTTG
CATCACGAGC TGTCGCGCGA CGACCTGGCG GTCGAACGCT TCCGCCGCGA AGCCACGACC
GTGAGCGAGC TCGACAACGA TCACATCGTC GGCATCTACG ATTTCGGACG CACGAACGAC
GGCCGCCTGT ACCTGGCCAT GGAGATGCTC GAGGGCGAGA CCCTGCAGGA CGTGCTGGCG
CGCGATCGCG AGCTGCCGCC GGTGCGCGCT GTCGATGTGC TGCTGCAGCT CGGCGAGGCG
CTGTCCGAGG CCCACGCCAT CGGCTACGTG CATCGCGACC TGCGGCCGCG CAACATCTGG
CTGACCGAGC GGCGCGGGCG CGAGAACTTC GTCAAGCTGC TCGACTTCGG TCTGGCCAAG
CTGGTCGAAA ACGAGGGCCG CGCGGCCACG ACCAGCCTGG GCATGACCTT TGGCGATCCC
AAGTACATGT CGCCGGAGCA GGCCCGCGGC GACGTCGTCG ACCGCCGCGC CGACATCTAC
TCGATGGGCT GCATCGCCTA CGAGATGCTG GTGGGCGCGC CGCCGTTCGA CGGACAGGTG
TTCGAGATCC TCAGCCAGCA CACCAACGCC GCGCCCATGG CGCCGCGCGA GCTGCGCCCC
GAGCTGCCCG AGTGGCTCAG CCAGGTGGTG CTGCAGATGC TGGCCAAACG GCCAGACGAG
CGCTTTTCGA CCGTGATGGC GCTTATCGAG GCGCTGCGTC AGCGCACCGA ATCGCCGGCC
ATGCCCGCGC CTTTTCCGCT GGCCGACGAG GACGCCGAGA GCGCCGACGA CGAGGGCGAG
GGCGCGCCGA GCAAGAGCCG CGCGCGCACT GGTACCATCA GCTACGGCGA GGGCGGCGGT
GAGGATGGCG ACGACGACGG CGAGAGCGAC GGCGACCGAC CCACGACCGG GGCGCGCGGA
CGGCGCGCGG TGACCGCCAG CGGCCGCGCC AAATCGCGCA ACACGCGCAC CGCGCGCAAG
ACCAGCCCCG GCGTCCCGCC AGCGAGCAGC TCGGCCAGCC ATCTGCCGGC GATGGCGATT
CCCGCCGTGC CCATGCCGAG CGTCGAAGGT GGCCGCGGCG CGTCGGCCGG GGGCGCCGAG
GACGCGGCCG GGACGCCGGC GCACACGCCG GAGACGTCTG CGCCCAGCGC TAAAAGCGGA
CGCGCGGACG AGAACGTAGC GGCGGGCACT GACGGCGCGG TGAGCGCGCC GCACAAGCAC
GAATCCACCG CGCCCATCGG GGTACCGGCA GCGTCCGAGG CGGCGCCCGC GAAGACGCCG
GCAGTGTCCG CGAAGACGCC GGCAGCGCCC GCGAAGACGC CGGGCGCGAG CACGGGCGCC
GAGGCCACAG CCGCGAGCCG CAGCGCCTCG CCCGAGCCCG AGCGCAGCGC GCGCTCGCGT
CCGCCGACGC CAGTCCCGGG CGCGGGACGC GCGGCCACGC CATCCGCGCC CGCGGCCAAG
AGTTCGGAGG CGGGCGCCGG CAAGAGCCGC TCGCCGGTCT CCGAGAGCAC GGTCGAGCTG
CCGGCGGCAG CGTCTGCGAA GCCAGCGGCG TCCGCGAAGC CGGCAGCGTC TGCGAAGCCG
AGCGAGAGCC CTGCGAAGCC GGCGGCGTCC GCCAAACCGG CGGCGTCCGC CAAACCGGCG
GCGTCCGCCA AACCGGCGGC GTCCGCCAAA CCGGCGGCGT CCGCCAAACC GGCGGCGGCC
GCGAGTCCGA AGACGGGTAC GGGCACGGAG CCCGAGGGCG GCGATCTGAG CGGCGATTGG
TTCATGGACG CGGGCGGCGA TACCGGCGCC GCCGCCAGCG CCGCCCGGGA ACCCGGTGAC
GGCGACAGCG GCGTTCTCGC CGACGACGTG GCTTCGTTCA CGCCGCGGAG CAAGCGCCAG
CGACAGGCCT TCTTGGCCGT GGGCGGCATC GTGCTGCTGC TGCTGATCGT CGGGCTGCTG
TGGGCCTCGG GCGGTGACGA TGAGGGCGGT GACGAGGGCG CTGCCGTGGC CGTCGATGCC
GGCGCGCCGA GCGCGGCCGA GATCTTGTCG GCGCAGGCCG AAGCCGCAGC GACAGCCGAA
GCGACAGCCG ACGCCGGCGC GAGCCCGGCC GACGCCGCGC CCGCCGATGC CGCGCCTGCA
GCGAGCAGCG CGACTCCGTC GCCGCCGACG AGCCCGGAAC CCAGCCCTGA GGCCGGCGGC
GACCAGAACC CCGAGCGCAC CGCGGCGCGC ACTTCCGAGC GCACCACGGC GCGCGATGAG
CGTCCGTCTC GCCCGGCCCG GCCCGCGGCC GGATCGTCGG ACGAGGGCGA CGAGGGCGAC
GACGCGCCGG CCGACGACAA CGCGCGCCAG GCCGCGTTCT ACGTCAAGCT CGGCAACTCC
AAGCTCAAGG GCGGCAATCC CCTGGGCGCA GCCGGCGACT TCAAAAAGGC CCTCGACCTC
GACCCGAGAA ACGTCGACGC CACCCTCGGA CAGGCGCAGA TCGCCTACAA CCAGGGTTTA
TACGGCAAGG CGATCCCCCT GTTCGAGAAG GCCGCGCGCA TGCGTCCGCG CAGCGCCGAA
GTACAGATTC TGCTCGGTCA GGCGTATTTG GCAGCCGGCA ACAAATCCAA AGCGGCGAGC
AGCTTTCGCC GCGCGCTCCA GCTCCGTCCC GGCGACGCGC GCGCCGAGCG CGGTTTCACC
GAAGCCACCG GCAACCCCCC GTAG
 
Protein sequence
MPTSPQSTVC THCGLALDHD ALFCPKCGTA RSKDLNGDPL MGTVVGDRFL ILERLGHGGS 
GTIYRAEHVT LRRKVAVKVL HHELSRDDLA VERFRREATT VSELDNDHIV GIYDFGRTND
GRLYLAMEML EGETLQDVLA RDRELPPVRA VDVLLQLGEA LSEAHAIGYV HRDLRPRNIW
LTERRGRENF VKLLDFGLAK LVENEGRAAT TSLGMTFGDP KYMSPEQARG DVVDRRADIY
SMGCIAYEML VGAPPFDGQV FEILSQHTNA APMAPRELRP ELPEWLSQVV LQMLAKRPDE
RFSTVMALIE ALRQRTESPA MPAPFPLADE DAESADDEGE GAPSKSRART GTISYGEGGG
EDGDDDGESD GDRPTTGARG RRAVTASGRA KSRNTRTARK TSPGVPPASS SASHLPAMAI
PAVPMPSVEG GRGASAGGAE DAAGTPAHTP ETSAPSAKSG RADENVAAGT DGAVSAPHKH
ESTAPIGVPA ASEAAPAKTP AVSAKTPAAP AKTPGASTGA EATAASRSAS PEPERSARSR
PPTPVPGAGR AATPSAPAAK SSEAGAGKSR SPVSESTVEL PAAASAKPAA SAKPAASAKP
SESPAKPAAS AKPAASAKPA ASAKPAASAK PAASAKPAAA ASPKTGTGTE PEGGDLSGDW
FMDAGGDTGA AASAAREPGD GDSGVLADDV ASFTPRSKRQ RQAFLAVGGI VLLLLIVGLL
WASGGDDEGG DEGAAVAVDA GAPSAAEILS AQAEAAATAE ATADAGASPA DAAPADAAPA
ASSATPSPPT SPEPSPEAGG DQNPERTAAR TSERTTARDE RPSRPARPAA GSSDEGDEGD
DAPADDNARQ AAFYVKLGNS KLKGGNPLGA AGDFKKALDL DPRNVDATLG QAQIAYNQGL
YGKAIPLFEK AARMRPRSAE VQILLGQAYL AAGNKSKAAS SFRRALQLRP GDARAERGFT
EATGNPP