Gene Hoch_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2501 
Symbol 
ID8544888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3443639 
End bp3446749 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content74% 
IMG OID646387201 
Productserine/threonine protein kinase with TPR repeats 
Protein accessionYP_003266930 
Protein GI262195721 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.120058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAGGAT GGTCGTTCCT CGAACGGCTG TTTCTCATGT CGGTCTCCGA ATCCAAGCGT 
CATCAGCGCC GGGGTGACGC CACGCCGCCC CCCGAGCCCG CGTCCACCGC CCTCCGTGAG
GGGACGGGCG AGGGCGTCGG CGTCGGCGAC AGCGAGGGCG AGAGCGTGGG CGTGGGCGTG
GGCGTGGGCG CGAGCGCCAG CGGCGACGCC AGCGCGGTGG GCACCGGCGA TGACAGCGCT
GCTTCCGACG GGGTAGAGGC GGCGGACGAC GACACCGGCA GCGCCGCCAG CGCCACCGAT
ATCGGCGTGT CGGTGCCGCA GCCGGACCGC GACGAGGTCA CCCGCCGCGA CCTGTTGCCG
GCCAGCGAGT TTGCGGTCAC GCAGGCGACC ATCGAGGCCT TGGCGCCGGC CGGCGAGGCC
GGGCACGAAG CCGGGCGCGA GGCCGGAGCC GGGGCCGGGA CCGGGGCCGC ACCCGACCCC
GAGCGGCGGC GCGCGCGGGC GCATCCGCGA GCGTCGGTCG CGGCGCCCGA TGTGACCCTG
GCGCCGGCCA TGCGCATCGG CCGCTACAGC ATCCTGCGCG AGCTGGGGCG CGGGGGCATG
GGCATCACCT ACGTCGGCTA CGACGAGGAG CTCGAGCGCA AGGTGGCCAT CAAGCTGGTG
CGCCCCGAGC TCGGCGGCAG CGGCGAGTCC GAGGTGCGCC TGCGCCGCGA GGCTCAGGCG
CTGGCCCGGC TGGCGCATCC CAATATCGTC TCGGTGTACG AAGTCGGCCA CTACCGCGGC
CAGACCTACC TGGTGATGGA GCTCATCGAG GGCCAGAGCG CGTCGGCCTG GCTCAAGCAG
TCGGCGCGTA CCTGGCCCGA GGTGCTCGAG GTCTTTGTCC AGGTCGGCCG CGGCCTGCAG
GCAGCGCACG CGGCCGGCCT CATCCACCGC GACGTCAAGC CCGCCAACAT GATCCTCGGC
GAGGACGGGC GCGTGCGCAT CCTCGATTTC GGTCTCGCCC AGCTCGACGG CGAGCCCGCC
GAGATGGCGA GTTCGGACAC CTTGTCGGTC TCCGAGTTCG AGACCTCGCG CAGCAGCAGC
GGCGAGCGCA CCGCGCTCAC GGCCTTTGGC ACCACCGTGG GCACGCCCGC GTACATGGCG
CCCGAGCAGA TCCGCGGCGA GGTCGCCGAC GCGCGCAGCG ACCAGTTCTC GTTCTGCGTG
AGCCTGTTCG AGGCGGTCTA CGGCGAGCGC CCGTTCTCGG GCTCGAGCAT GAGCGTGCTG
CACAGCGAGG TCGAGCGCGA GGCCATCCCC GAGGTCATCG GCAAGAGCTC GGTGCCCGGC
TGGCTGCACG CGACCATCGT GCGCGGTCTG GCGCCGCGGC CCAAGGACCG CTGGCCGAGC
ATGGACGCGC TGCTCGAGGC GCTCAAGCAC GAGCCCGGGC GCGCGCGCCG GCGCTGGTTC
GCGATCGCGG CGCTGGCGGT CGGCGTGCTC GGGCTGGGCT CGGCGGGCGT GGCCGGCTAC
CTGGCCATGG AGAGTCGCCG CGAGGCGCTG TGCAGCGGCG CGCAGGCGCA GATGGACGAG
GTCTGGAACC GCGACACGCG CGCGGCCATC GAGCGCGCCA TGGACGCCAC CGGCGTGCCC
TACGCGGCCG CGAGCTGGCG GCGCACCGAG AGCTTGCTCG ACGACTACGC CGCGCGCTGG
GTGAGCGCGC ACACCGACGC CTGCGAGGCC ACCGCGGTGC GCGAGGAGCA GAGCCGGGCG
GTGCTCGACC AGCGCATGCA CTGCCTGGCC GGGCGCCGGC GCAGCCTGGG CGCGCTCGCG
GCCGAGTTGC AGCGCATCGA CGCGGTCTCG GTGGCCAACG CCGCGCAGGC CGCCAGCCGC
CTGCCGTGGA TCGCCTCCTG CGCCAACGCC GATTACCTGA GCGAGCAGGT GCGGCCGCCG
GACAGTCCCG AGGTCGCGCG CGAGGTCGAG GCGCTGCAGG GGCTGCTCTC GCAGGCCGAA
CAGCTCGACG AGCTGGGACG CTACGGCGAG GGTCTGGCCA TCGCGCAGAG CGCGCTCGAG
CGCGCCATCG CCTCCGAGTA CCAGCCGATC CTGGCCCGCT CGCACCTGCA CGTCGGCGTG
CTGCAGCTCC GCGAGGGGCT CTACGAGCAG GCCGAGAGCC ACCTCAGCGA GGCCCACTAC
ATCGCGCGCG CGGCCGGCGA TCACGAGGTC GCGCTGCGCG CCGCCATCGA GCTGGTGTAC
GTGGTCGGCT TTCGACTGTC GCGCTTTGCC GAGGGGCTCA CCTGGAGCCG CCACGCCGAG
GCCGAGCTGC CCTGGGTGGG ATCGCGCGTG GCCGAGGCGC TGCTGCTGCA GCGCGTGGGC
GAGCTGTTGA CCATGCAGGG CGCGTACGAA GAGGCGCTCG CGCGGCTGCA GCGCGCGCTC
GGTCTGCTCG AGGACGCGCT CGGCCGCGAG CATCCCTCGG TGGCCGACGC CGCGGTCACG
CTGGCGATGA CCTACCACCG CCAGGGCCGC TTTGCCGAGG CGCGCGCGCG CTACCTGCGC
GCGCTGGCGA TTCAGAAGCA GGCGCTCGGC GAGGACCACC CGCAGATCGC GCGCACGCAC
AACAACCTCG GCCTGGCGCT GCGCGAGCAG GGCCGCTACG ACGAGGCCGC GGCCGAGTTC
GAGCGCGCCG CGGCCATCTG GCGCGTGCTG TTCGGGCCGG TGCACCCCTC GACCGCGGCG
CCGCTCAACA ACCTGGGCAC CGTGCGCCTG CGCCAGGGCG AGCTGGGCGA GGCGCTCAAT
CACTACCAGC AGGCGCTCGG GAGCTTCGAG GAGACCCTGG CTCCCGATCA CCCCAACCTG
GCCTATCCGC TGCTCGGCAT GGCGGTGGTC TATCTGCGCC AGAACCAGCC GGCGCGCGCG
CTGCCGCTGG CCCAGCGCGC GCTGCGCGTC CGCGAGGCCC GGGGCGTGGC CGCCGCCGAG
ATCGGCGAAG CTCGCTTCCT GGTCGCGCAG GCGCTCATGG GCCAGGGCGA GCGCAGCAAG
GCGCTGGCGC TGGCCCGCGA CGCGCGCGAC ACCCTGCGCG CCAGCGACGG CGTCTCGCCC
TACGTCAAGC TCGACGAGGT CGAGGCCTGG CTGAGCGAAC ACGGCCGCTG A
 
Protein sequence
MLGWSFLERL FLMSVSESKR HQRRGDATPP PEPASTALRE GTGEGVGVGD SEGESVGVGV 
GVGASASGDA SAVGTGDDSA ASDGVEAADD DTGSAASATD IGVSVPQPDR DEVTRRDLLP
ASEFAVTQAT IEALAPAGEA GHEAGREAGA GAGTGAAPDP ERRRARAHPR ASVAAPDVTL
APAMRIGRYS ILRELGRGGM GITYVGYDEE LERKVAIKLV RPELGGSGES EVRLRREAQA
LARLAHPNIV SVYEVGHYRG QTYLVMELIE GQSASAWLKQ SARTWPEVLE VFVQVGRGLQ
AAHAAGLIHR DVKPANMILG EDGRVRILDF GLAQLDGEPA EMASSDTLSV SEFETSRSSS
GERTALTAFG TTVGTPAYMA PEQIRGEVAD ARSDQFSFCV SLFEAVYGER PFSGSSMSVL
HSEVEREAIP EVIGKSSVPG WLHATIVRGL APRPKDRWPS MDALLEALKH EPGRARRRWF
AIAALAVGVL GLGSAGVAGY LAMESRREAL CSGAQAQMDE VWNRDTRAAI ERAMDATGVP
YAAASWRRTE SLLDDYAARW VSAHTDACEA TAVREEQSRA VLDQRMHCLA GRRRSLGALA
AELQRIDAVS VANAAQAASR LPWIASCANA DYLSEQVRPP DSPEVAREVE ALQGLLSQAE
QLDELGRYGE GLAIAQSALE RAIASEYQPI LARSHLHVGV LQLREGLYEQ AESHLSEAHY
IARAAGDHEV ALRAAIELVY VVGFRLSRFA EGLTWSRHAE AELPWVGSRV AEALLLQRVG
ELLTMQGAYE EALARLQRAL GLLEDALGRE HPSVADAAVT LAMTYHRQGR FAEARARYLR
ALAIQKQALG EDHPQIARTH NNLGLALREQ GRYDEAAAEF ERAAAIWRVL FGPVHPSTAA
PLNNLGTVRL RQGELGEALN HYQQALGSFE ETLAPDHPNL AYPLLGMAVV YLRQNQPARA
LPLAQRALRV REARGVAAAE IGEARFLVAQ ALMGQGERSK ALALARDARD TLRASDGVSP
YVKLDEVEAW LSEHGR