Gene Hoch_5302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5302 
Symbol 
ID8547714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7290999 
End bp7293917 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content71% 
IMG OID646389976 
Productserine/threonine protein kinase 
Protein accessionYP_003269680 
Protein GI262198471 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATC ACCAGGCGCT GGGCGCGGCC TCCACTCGCA GGCTGCCGGG GGGAAACGGC 
CGCGTCACCG AGGTCGTCGG CCCCGAGGAC TGGGACGATG GATGGGGTGG ACGGACCGAA
GCTTCGCAGC GCGTGCTCAG CGTCCCCGTG GGCACGCGGG TCAATCAATA CGAAATCATC
CGCGAGCTCG GCCGCGGCGG CATGGGCACC GTGCACCTGG CCCGCGACAC CAAACTGGGC
CGCCGCGTGG CCATCAAATT CCTGCACAGC CAGCAGGCGG CGCTCACCGA GCGCTTCCTC
ATCGAGGCCC GCACGACCGC GCAGTGCAGC CACGAGAACA TCGTGGTCAT CCACGAGGTC
AGCGAGCACC ACGGCCAGCC CTTCATGGTC CTCGAGTATC TGCAGGGCCA GTCGCTGCGC
GAGCTGATGA GCGCCGACGC GCTCTCTCCC GGACGCGCGG TCGAACTCGT GGTGCCGATC
GTGCGTGCCC TGGTGTGCGC GCACGCGCTC GATATCGTCC ACCGCGACCT CAAGCCCGAA
AATGTGTTCG TCACCGAGTC GGGCACCATC AAAGTGCTCG ATTTCGGCAT CGCCAAGCTG
TTGCTCTCGG ACGAAACCGT CCGCGACGCG GTCGAGCGCG GGGCGGGTGG CGAACGCGAG
GGCCGCGACC GCGCCCCCGA TATGCTCACC GGCGCGGGCG CCATCCTCGG CACGCTGCCG
TACATGGCAC CCGAGCAGTG GGGCGCAAGC ACGGTCGATC ATCGCAGCGA TCTGTGGGCT
GTGGGCATCA TCCTCTACGA GATGCTCGGC GGTCGCCACC CGCTGGCGCC GCTCACCCGC
GACAAGCTCA CCGACATCGC CGATCTCGGC ACGCCCATGC CCAAGCTGCG CGACTCGGGC
GCCGATGTGC CCGACGCCCT GGCCCGCCTG GTCGATCACT GCCTGGTCAA GCCCAAGCAG
TATCGCGTGG GCAGCGCCAA GCTCTTGCTC GAGGCGCTCG AGCCGCTGCT ACCGGGACGC
CAGGGCCGCC CGCTGGCCGT CGATGAGTGC CCCTACCCCG GCCTCACCGC CTTCCAGGAG
AGCGACGCCG ACCGCTTCTT CGGCCGCTCG CGCGACGTCG CCAACGCGCT CTCGCGACTC
GCCGGCCACC CCATGCTCGG CGTCGTCGGG CCCTCGGGCG TGGGCAAATC CTCGTTCGTG
CGCGCCGGCC TGATCCCGGC CCTCAAGCGT TCGGACCAAC CCTGGGAGAC CTTCATCATC
CGCCCGGGCC GGTATCCGCT GGCGGCGCTG GCCAACTTGC TGCAGCCGCT GCGCCGCAGC
CAGACCGAGG ACGGCGACCC GGCCACCGAG CACCGAGGGC TTATCGACCG ATTGTCCGAG
GAACCCGGCT ACCTCGGCGC GACGCTGCGC CACCGCGCGC GTATGCGCGG CGAGCGCATT
CTGCTGGTCA TCGACCAGTT CGAGGAGCTG TACACGCTGG TATCCGACCC CGAGCAGCGC
AAGGCCTTTA CCGCCTGCCT CAGCGGCGTG GCCGACGACA GCGCCGCGCC GCTGCGGGTG
ATCACCTCGC TGCGCTCGGA TTTCATCGAC CGCGTGGTCG AAGACCGCCA CTTCATGGCC
GCGCTCACCT CCAACCTGAT GCTGCTCGCC CCGCCGGACC GCGACGCGCT CGAGGAGGCC
TTGCTGCACC CGGCCGAGCT GGTCAACTAC CGCTTCGAGA CCCGCGATAT GGTCGAGCAC
ATGCTCAACA CCCTGGCCGC GACACCGAGC GCGCTTCCGC TGATGCAGTT CGCGGCCATC
AAGCTGTGGG AGGCCCGCGA CCGCGAGCAC CACACCTTCA CCGCCGCGAG CTACGACAGC
ATCGGCGGCA TCGGCGGCGC GCTGGCCAGC CATGCCGACG CGGTGGTCGC CGGTCTGCCA
CGCGCCGATC AGGCGCTCGC GCGGACCATG TTTCAGCACC TGGTCACGCC CGAGCGCACG
CGCGCCATCG CCTCGCGCAG CGACCTCCTG CCGCTGGCGC CCGACCCCGA GCAGGTGCAG
CGTCTGCTCG ATCGCCTGGT CGCCGCGCGG CTCTTGATCG TACAGACGGG CGACGACGCC
GAGGGCGTGA GCGTCGAGAT CGTCCACGAG TCGCTGGTGC ATAGCTGGCC GCGACTGCGA
CGCTGGCTCG ACGAGCACGA AGAGGACGCC GTGGTGCTCG AGCAGTTGCG CACCGCGGCC
AAGCAGTGGG AAGCCAAGAA ACGACCCCAG GGTCTGCTGT GGCGGGGCGA AACCCTCAAG
GAGGCGCGCC GCTGGCACCG CCGCTACCAG GGCACACTCA CGCCGCTACA GCGCGCGTAC
CTGCAGGCAG CGTGCGCGCT CGCCGACCGC GCCAAGCGCC GCGTGCGCCT GGGCATCGCC
GGGGCCATGG GCTTTTTCGT CCTGCTGGCC GCGGCCTCGA CCATCGCCCT GGTGCGCATC
CAGCGCGCCG AACGGGCCGC GCAGCAGCAG GCGCAGGCGG CGCGCACGGC CGCCCAGCAG
GTGAGCGAGC AGCTCTCGCT GGTGCGCGCC AAGGAGCGCG AGCGCAGCGC GGCCAAGGTC
AAGCAGGCCG AGGCCGAAGC CGAGGCCGAC CGCGCCCAGG GCGAGGTGCT GCGCAAGCAG
GCCGCGCTCG AACAGGCCAA CGAGCGGCTG CGCGGCGCGC TGAGCGAGGC CGAGGAGGCG
CGGCGACGCG CCGAGAACGA ATCGCTGCGC GCGCGCCACG CCCTGGCCGA GACCGAGCGC
GCCAAGGATC TGGCCGAATC CGAAGAGGCC CGCGCCCACA CGGCCATCGA GGCCGAGCGC
CAGGCGCGCG CCGAGCTGCA GCGCCTGCTG CGCAGAGAGC GCGAGCGCAC CGAGCGCCTG
CAGAAGAAGC TCGGCACCTT CGACAGCACG CTCAAATGA
 
Protein sequence
MSDHQALGAA STRRLPGGNG RVTEVVGPED WDDGWGGRTE ASQRVLSVPV GTRVNQYEII 
RELGRGGMGT VHLARDTKLG RRVAIKFLHS QQAALTERFL IEARTTAQCS HENIVVIHEV
SEHHGQPFMV LEYLQGQSLR ELMSADALSP GRAVELVVPI VRALVCAHAL DIVHRDLKPE
NVFVTESGTI KVLDFGIAKL LLSDETVRDA VERGAGGERE GRDRAPDMLT GAGAILGTLP
YMAPEQWGAS TVDHRSDLWA VGIILYEMLG GRHPLAPLTR DKLTDIADLG TPMPKLRDSG
ADVPDALARL VDHCLVKPKQ YRVGSAKLLL EALEPLLPGR QGRPLAVDEC PYPGLTAFQE
SDADRFFGRS RDVANALSRL AGHPMLGVVG PSGVGKSSFV RAGLIPALKR SDQPWETFII
RPGRYPLAAL ANLLQPLRRS QTEDGDPATE HRGLIDRLSE EPGYLGATLR HRARMRGERI
LLVIDQFEEL YTLVSDPEQR KAFTACLSGV ADDSAAPLRV ITSLRSDFID RVVEDRHFMA
ALTSNLMLLA PPDRDALEEA LLHPAELVNY RFETRDMVEH MLNTLAATPS ALPLMQFAAI
KLWEARDREH HTFTAASYDS IGGIGGALAS HADAVVAGLP RADQALARTM FQHLVTPERT
RAIASRSDLL PLAPDPEQVQ RLLDRLVAAR LLIVQTGDDA EGVSVEIVHE SLVHSWPRLR
RWLDEHEEDA VVLEQLRTAA KQWEAKKRPQ GLLWRGETLK EARRWHRRYQ GTLTPLQRAY
LQAACALADR AKRRVRLGIA GAMGFFVLLA AASTIALVRI QRAERAAQQQ AQAARTAAQQ
VSEQLSLVRA KERERSAAKV KQAEAEAEAD RAQGEVLRKQ AALEQANERL RGALSEAEEA
RRRAENESLR ARHALAETER AKDLAESEEA RAHTAIEAER QARAELQRLL RRERERTERL
QKKLGTFDST LK