Gene Hoch_3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3878 
Symbol 
ID8546274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5335997 
End bp5337187 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content71% 
IMG OID646388550 
Productserine/threonine protein kinase 
Protein accessionYP_003268270 
Protein GI262197061 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.683762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.402263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAG CCCAGCGTCA GAAGAGTGGT TACTTCGCTA CTGCCGTCAG CACCTTTGAT 
TGTTCGGTGG CGGCGGCCCG GCGTGACGCG CCCATGCAGG CGCCATTTCT GGCCCACGGC
AGCCCCCGCG ACGGCCTGGG CGGGACCGCG CCCACGCAGA CCCCGAGCCG TGGGGAACCG
ACACGGGTGC TGCAGCCCGG CACGGTCATC GCCGGCCTGT ATCGCATCGA CGATCTGCTC
GGCACCGGCG GCATGGGCCA GGTCTACGCC GCCACGCAGC TCCTCACCGG CGTGCACTAC
GCGCTCAAGG TGTGTCACCC GGGCATCCCG GCCGAGATCC TCGACGCCGA GGTGCGCGCG
CTCGAGGCGG TCCTGCATCC CGGCATCGTG CGCGTGCACG CCACCGGTAG CCACGAGGGC
ATCCCGTTCC TGATCATGGA GCGCATCTAC GGCAGCACTC TGCGCCAGCA CCTGCACGAG
GCCGAGGCCG AGAACCTCGC GTGCGGTCGC GCCGCGCCGC GCATGCCGGT CGAGCGTGTC
ATTTCGATCC TGGCGAGCAT CGCCGACGCG CTGGCCGTGC TGCACGAGCA CGGCTTCGTC
CACCGCGACC TCAAGCCCTC GAACATCATG CTCACCAGCG ACGATCGCCC GGTGCTGCTC
GATCTCGGCG TCTCCTGCCA GAGCATCGAG GCCGAGCACG AGCGCCGCCT GGCCGGCTCG
CCGCACTACA TCGCGCCCGA GGTCATCACC GCGTCGATCG CCAAGCATCA GGCGCCGTGC
ATCGACATCT ACGCGCTCGG CGTCATCGCC TTCGAGATGC TCACCGGCGC GCGCCCCTTC
GACAGCCACA CGCAGCTCGA TCCCCTGCGC CAGCAGCTCC ACGCGGTCCC GCCGCGGGTG
AGCGAGCTGG TCGCCGAGGT GCCCCAGGGC CTCGAGCACC TCATCGAGGA GATGCTGCGC
AAAGAGGCCG ACGAGCGCCC GCGCTCGGCC CGCGTGGTGG CCGCGCGTCT GCGCGCGCTG
CAGCACGCGG CCAACGCCAC TCGCCTGGTG CGCAACGCGC GTCGCCGCCG CCCGAGTATG
CGCCGCCGTC GCGTCACCGA ATCGGTCGTG CTCTCGAACT CGCTGTGCGG TCGCCCGACG
CTGCCGCCGC GCCCCATGCG CCGCGTGCGC CGCAGCCCCA AGGCCGAGTG A
 
Protein sequence
MWKAQRQKSG YFATAVSTFD CSVAAARRDA PMQAPFLAHG SPRDGLGGTA PTQTPSRGEP 
TRVLQPGTVI AGLYRIDDLL GTGGMGQVYA ATQLLTGVHY ALKVCHPGIP AEILDAEVRA
LEAVLHPGIV RVHATGSHEG IPFLIMERIY GSTLRQHLHE AEAENLACGR AAPRMPVERV
ISILASIADA LAVLHEHGFV HRDLKPSNIM LTSDDRPVLL DLGVSCQSIE AEHERRLAGS
PHYIAPEVIT ASIAKHQAPC IDIYALGVIA FEMLTGARPF DSHTQLDPLR QQLHAVPPRV
SELVAEVPQG LEHLIEEMLR KEADERPRSA RVVAARLRAL QHAANATRLV RNARRRRPSM
RRRRVTESVV LSNSLCGRPT LPPRPMRRVR RSPKAE