Gene Hoch_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0201 
Symbol 
ID8542580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp300514 
End bp302703 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content73% 
IMG OID646384997 
Productserine/threonine protein kinase 
Protein accessionYP_003264735 
Protein GI262193526 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATA CGCGCATCGG CGGCTACCGC ATCACCAAGC GAATCGCCAA GGGCGGCATG 
GGCGAGGTCT ACCTCGCGCG CCACGAGCTC ATGGAGCGCG AGGCCGCGAT CAAGGTCCTG
CACCCGGACC TGAGCGGCAA CGAGCAGCAG GTCAACCGCT TCCTCAACGA GGCCCGCGCC
ACCGCCAGCA TTCGCCACCC GGGCATCGTC GAGATCTTCG ACGTCGGCCA CGAGGGCGCG
CGCGCGTACA TCGTCATGGA GTACCTGCGC GGCGAGATGC TGGCCTCGCG GCTGGCGCGC
ACGCGCATCG AGATCGACAA AGCGCTGCAG TTCACGCGGC AGATCGCGGG CGCCCTGGGC
GCGGCCCACG CCTGCGGCAT CATCCACCGC GACCTCAAGC CCGAGAATAT CTTCGTCGTG
CCCGACCCCG ACGTCATCGG CGGCGAGCGC ACCAAGATCC TCGACTTCGG CATCGCCAAG
CTGCTCGAGT CGCGCGGCGG CGTGCACACC GTGCAGGGCA CGATGTTCGG CACGCCCGCG
TACATGGCGC CCGAGCAGTG CGAGGACGCC GCCCAGGTCG ACCGCCGCGC CGACCTCTAC
GCCCTGGGCT GCATCCTGTA CGAGTTTCTG TGCGGCACGC CGCCCTTTGG CCGCGGCGGC
ATCGAGCTGG TCGCCGCCCA CCTGCGCGAT GTCCCCACGC CCGTGCGCGA CCGCGAGCCG
CGCGTGTCCG AGGCGCTCGA CGCCGTGGTC ATGCGCCTGC TGGCCAAAGA CCCCGAGCAG
CGCTACGGCA CCTGCGAGGC GCTGGTGCGC GCGCTCGACC AGGTACCGAA CGCGCGCACG
CCGCAGCCCG AGCTCGCCTT CGCCGAGACC TCGCAGGGCA TCGCCGGCGA TATCCAGCAC
GCGGCCACGG CCACGGCCCA GCCCGGCGTC GGACGCGTGT CCGTCAAAAA CCCGGGGACC
GAACCCGGCG CGATCATGAA CCCGGGCACG GCCGCGGGCA CCAGCGTGGC CGCGGCCACC
AGCCCTGGCC AGGACGGCAA CGCGTCCACC CACGGGGGCG CCGGCGCGGC TCCCACCCAG
TCCGGCGCGA GCGCCAGCCC GGCCGCGGCC GAACCCGTGA ACCTGCCCTC GAGCGGTCCG
CAGCCGAGCG GGCGCGGACG CGGCCCGGGA CTGTGGCTGG CCGCGCTTGT CATCCTGAGC
GTGGTCGGCG GCCTGGTGTG GTGGCAAAGC AGCGGCTCGC AGACGCCGAG CGCGCCGGTC
GAAGCCGGCC CCAGCCCCGA GGAGAACCTC GAGCGCGCGC GCCAGGCCAT GAGCGCGCGC
GATTGGCAGT CGGCCGAGTT CTCGGCCAAC GAGGCGCTGC GTGCGCTCGA GAAGCGCGGC
GCCGAGGGCG ACGACGCGCT GATCGAAGAG ATCTCGGCGC TCCGCCGCCT GGCCCAGGAC
GAGCAGCACA ACCAGTACGA ATTCGAAAAG TTCCAGCGCG CCTTCGAAGC CGACGAGCTG
CTCGACGCGG TCTCGATCCT CGAGGACATC ACAGCCGAGA GCGTGTACCG GCCCGAGGCC
GAGGCCCTGC TCGCGCCCGC GCGCGAGGCC TGGCTGGTCG AGCAGCGCGC GGCCGCCGCC
GAGGCCGCCG AACGCGGTCG CTGCCGGTCC GTGCGGCAGA TCAACCAGGC CGCCCACCAG
CTCTTCCCGG GCGGCGACCC CGAGATCGAA GAGCAGCTCG CGAACTGTCG CGAGCAGCGC
CGCGACGACG ACGACGACAA AGCCGAAGAG CCCGCCAAGC CGAGCCGGCC CACCCGTCCT
CGCCGACCGC GCCAGGGAGG CGGCTCCGGC TCGGGCTCCG GCGGCTCCGG CTCAGGCTCC
GGCGGCTCCG ATGGCTCTGG CGGTGATTCC GGTGGTGATC CCGGTGGCGA CTCGGGCGGC
GACGCCGAGA CCCCGCCTCC GACCGACCCG CTCGGCGGGT TCCGCGCCCT GGTGCTGGTG
GTGCAGCGAC GCATCGTCCG CTGCGCGAGC AACAACGACG TGAGCGGAGA CCACGTGTTT
ATCGAGGTCA TCATCGCCGC GGACGGCACG GTGACGCTCG TGCCCGACAG CGACAACCAG
GCCTTCGCCA CCTGCGTGCG CAAGATCGCC ATCCCGCCGG TGCCCGGCAT GCCGCCGGTG
CGCCAAACCG TCAGGGTACC GCTGTCATGA
 
Protein sequence
MLDTRIGGYR ITKRIAKGGM GEVYLARHEL MEREAAIKVL HPDLSGNEQQ VNRFLNEARA 
TASIRHPGIV EIFDVGHEGA RAYIVMEYLR GEMLASRLAR TRIEIDKALQ FTRQIAGALG
AAHACGIIHR DLKPENIFVV PDPDVIGGER TKILDFGIAK LLESRGGVHT VQGTMFGTPA
YMAPEQCEDA AQVDRRADLY ALGCILYEFL CGTPPFGRGG IELVAAHLRD VPTPVRDREP
RVSEALDAVV MRLLAKDPEQ RYGTCEALVR ALDQVPNART PQPELAFAET SQGIAGDIQH
AATATAQPGV GRVSVKNPGT EPGAIMNPGT AAGTSVAAAT SPGQDGNAST HGGAGAAPTQ
SGASASPAAA EPVNLPSSGP QPSGRGRGPG LWLAALVILS VVGGLVWWQS SGSQTPSAPV
EAGPSPEENL ERARQAMSAR DWQSAEFSAN EALRALEKRG AEGDDALIEE ISALRRLAQD
EQHNQYEFEK FQRAFEADEL LDAVSILEDI TAESVYRPEA EALLAPAREA WLVEQRAAAA
EAAERGRCRS VRQINQAAHQ LFPGGDPEIE EQLANCREQR RDDDDDKAEE PAKPSRPTRP
RRPRQGGGSG SGSGGSGSGS GGSDGSGGDS GGDPGGDSGG DAETPPPTDP LGGFRALVLV
VQRRIVRCAS NNDVSGDHVF IEVIIAADGT VTLVPDSDNQ AFATCVRKIA IPPVPGMPPV
RQTVRVPLS