Gene Hoch_5102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5102 
Symbol 
ID8547513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7033208 
End bp7035097 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content72% 
IMG OID646389778 
Productserine/threonine protein kinase 
Protein accessionYP_003269483 
Protein GI262198274 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.916534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGG GCCCCAAGCC GACCTCCCCC CCGCAATCTC CGCCGCCGGC GGAGCACGTC 
CCCGAGGTGG TGGGCAACTA CCGCATCCTG CGCGAGCTGG GTCGCGGCGG CATGGGCACC
GTGTACACGG CCGAGCACAA GCTGCTGGGC CGCGCGGCCG CGATCAAGCT GCTGGCGCCG
CGCTACGTCG ACCATCCCGA GGTCATGATG CGCTTCTTCA GCGAGGCGCA GGCCGCAGCC
GCGGCCAAGA ACCCGGGCAT CGTCGAGATC TACGACTTCG GCGAGCTGTC CGACGGCGGC
GGCGCCTTCA TCGCCATGGA GCTGCTCGAG GGCGAGGGCC TGGACGCGCG CCTGCGCCGC
AACGGCCGTA TCCCGCTGGC CCAGGCGCTG CTGTTCACCT CGCAGATCGC CAGCGCCCTG
GCCGCCGCCC ACGCCAACGG CATCGTCCAC CGCGACCTCA AACCCGCCAA CCTCTTCGTC
GTCCGCGACC CCCAGGTCGT CGGCGGCGAG CGCATCAAGA TCCTCGATTT CGGCATCGCC
AAGGTGTCGC TGGCGGTCGC CGAATCCGAG ATCGAAAACC AGGCCGAGGT CGAGACCCAC
AACCTCACCC GCGACGGCTC GGTCATGGGC ACGCCCACGT ACATGGCGCC CGAGCAGTGC
CGCGGCCAGC TCGACCTCGA CGCCCGCGCC GACCTGTACT CGCTGGGCTG CATCCTGTTC
GAGATGCTGT GCGGGCGCCC GCCCTTCACC GGCTCCTCGA GCATCGACCT GATGTCCGGG
CACCTGCGCG ACGCGCCGCC GCGGCCGAGC AGCATCGAGC CCGCCATCGG CCCCGAGCTC
GACGCCCTCA TCCTCAGCCT GCTGGCCAAA GACCCCAAAG ATCGCTTCCA GCGCGCCAGC
GATCTCGAGC GCGCGCTGCA CATGCTGCTC TCGGGCAACC CCGCCGGCGT CATCGCCTCG
CTCGGCGGCG AGCCGCCGGC ACCCGCGCGC GCGCCCTGGT ATCTGCGCCC GGGCGTGCTG
GTGCCCATGA TCCTGGCCAT GGCCGGCGGC GGCGTCGCCC TCGCGCTGCT GCAGCGCCCG
GCTGACGACA CCGGCGACGA CCCGCGCCGC CAGCCCGTGG CCCAGGTCGC CATCGACGCC
GCGCCGCCGC CGCCGCCCGA GCCGCCCGGC TTCTACCAGG CCGAGGGCCC GGACCAAGAG
CCCGAGGCCG TGGTCGCCGC GCGCGCGAGC GGCGACGCCG TGATCTGGCG CGTCGACAGC
ACCCCGCGCG ACGCTCAGGT GCTGTACCAG GACCAGCTCG TCGGCGACAC CAAGAGCCCG
CTCTTCGTGG TCATCGAGCG CCAGGGGCGC AGCGAGGGGC GCAGCGAACG CCTGCTGGTC
AAGCGCTTCG GCTTCGTCGA ACAGACCGTC GATCTCGACA CCGACAGCGG CGGCGTGCAC
CACGTCGAGC TGGTCAAGAA GATCGAGCTC ACCATCCGCT CGCAGCCCAG CGGCGCGCTC
ATCTACGGCG ACGACGACGA GGTCGCCGGC CGCACCCCGG GCATGGTGTT CTCGCCCCCG
GGCGAGGAGC CGCTGATGTT CACACTCAAA GCCGAGGGCT ACGCCGACGA GCCCATCGAG
ATCGTGCCCG ACGAGAACAA GGCGATCTCG GTCAAGATGG CGCCGCTGGT CACCCTGCGC
ATCGAATCCG AGCCCATGGG CGCCGAGGTC TGGCGCGATG GCGCCCGGCT CGGCGAGACC
CCCCTCGAGG ACCGCGTGGC CCGCGGCCGC GAACCGCTTA CCTACCGCAT TGCGTACACC
GGCTACCGGG ACGAGGAGTT GCAGATGATC CCGCGCCGCG ACGGCGAACG CCGCGTCACC
CTGCAAACCC TGGCCGCCGA CGCGGAATGA
 
Protein sequence
MTTGPKPTSP PQSPPPAEHV PEVVGNYRIL RELGRGGMGT VYTAEHKLLG RAAAIKLLAP 
RYVDHPEVMM RFFSEAQAAA AAKNPGIVEI YDFGELSDGG GAFIAMELLE GEGLDARLRR
NGRIPLAQAL LFTSQIASAL AAAHANGIVH RDLKPANLFV VRDPQVVGGE RIKILDFGIA
KVSLAVAESE IENQAEVETH NLTRDGSVMG TPTYMAPEQC RGQLDLDARA DLYSLGCILF
EMLCGRPPFT GSSSIDLMSG HLRDAPPRPS SIEPAIGPEL DALILSLLAK DPKDRFQRAS
DLERALHMLL SGNPAGVIAS LGGEPPAPAR APWYLRPGVL VPMILAMAGG GVALALLQRP
ADDTGDDPRR QPVAQVAIDA APPPPPEPPG FYQAEGPDQE PEAVVAARAS GDAVIWRVDS
TPRDAQVLYQ DQLVGDTKSP LFVVIERQGR SEGRSERLLV KRFGFVEQTV DLDTDSGGVH
HVELVKKIEL TIRSQPSGAL IYGDDDEVAG RTPGMVFSPP GEEPLMFTLK AEGYADEPIE
IVPDENKAIS VKMAPLVTLR IESEPMGAEV WRDGARLGET PLEDRVARGR EPLTYRIAYT
GYRDEELQMI PRRDGERRVT LQTLAADAE