Gene Hoch_5101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5101 
Symbol 
ID8547512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7029995 
End bp7032985 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content72% 
IMG OID646389777 
Productserine/threonine protein kinase 
Protein accessionYP_003269482 
Protein GI262198273 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATA AGGTCATCGG CAGCTATCGC ATCACGCAGG AGTTGGGGGC AGGCGGTATG 
GGCGCCGTGT ACGCGGCCGA ACACACACTG CTCGGTCGCC GGGCGGCGAT CAAAGTGCTG
CTGCCGCAGA TGTCGCGCGA AGCGGAAATC GTCAACCGCT TCCTGAACGA GGCCAAGGCC
GCCTCGGCCA TCAAGCACCC GGGCATCGTC CAGATTTACG ACCTCGGCCA TCAGGAAGAT
GGCTCGGCCT ATATCGCCAT GGAGTACCTC GAGGGCGAGG GCCTCGAGGT GCGCATCCGC
CGCCTCGGCC GCTTGCCCGT GCAGCAGGCG CTGCGCTTCA CCGGCCAGAT CGCCAGCGCG
CTCGCGGCCG TGCACGAGCG CCGCATCTAC CACCGCGACC TCAAGCCCGG CAACGTATTC
CTGGTTCCCG ACAAGCAGGT CACCGGCGGC GAGCGCATCA AGCTGCTCGA CTTCGGCATC
GCCAAGCTGC ACGACCCCGA CCCCGGCGCC CAGGCCACGC GCGCGGGCGC GCTGATGGGC
TCGCCCTCGT ACATGTCGCC CGAGCAGTGC CGGGGCGCCG GCGAAGTCGA TCACCGCGCC
GACCTGTACT CGCTCGGCTG CATCCTCTAC GAGATGCTGT GCGGCCGGCC GCCCTTCGAC
GGCGGCGGCG GCGGCGTGAT GGCCATCCTC AGCGCCCAGC TCCGCGACGC CCCGCCCCTG
CCCAGCCAGT CGCGGCCCGA GCTCACGCCC GAGATCGACG CGCTGGTGCT CGGCCTGATG
GCCAAAGACC CCGACCTGCG CCCGCCCAGC GCGGCGCATC TGGCCCACGC CATCACCCAG
CTCACCGGCG AGCAGATGTC GTTTGCCGCC ATCGGCAGCG CCTTCGACCC CGGCGCGGCC
CCCGGCGCGG ACGACGACTT CGGCGGCGCC ACCATCGTCG ACATGCAGCT GCCGTCGACC
CTGGCCGCCA CCCAGCCCTC GCAGCAGACC GCGCAGGCCG CGACCCGGCC GAGCCACACC
GGCACTGGCA CTGGCACCGG CCTCGGCACC GGGCCGCACA CCAGCAACAC GCCGCTCATC
GAGCAGCTCG CCGGCAACAC CACGGGCGCG CGCAGCCGGC GCTCGACCGG CCTCATCACC
GGCCACGTGT CCGATCAAGG AACCAGCCCG ACCGGCAACA TGGCGGCCGC CGGCCAGACT
CGCGCCGGCG ACAGCCGCGT GCAGACCGCG CTCGGCCAGC GCGCCGGCGG CTCGCGTCGC
CTGCTGTGGG TGGTGGTGGC GCTGCTCCTG GTCGGCGGCC TCAGCGTCGC GCTCACGCAG
ATCGGGGGCT CGGACAAGCG CGCCGAGGTC CCCACCGACA GCGAGCAGCC CGAGGAGCCG
GTGGTGGCCA TGAACGACGA GCCGCCGCCG CCGCCCGAGC CCACCTTGGA CTTCTTCCCG
CTGGCCGAAG AGGAGCAGCG CCGCCGCCGC GAGCAGCCGC GGCCCGAGCC CATCCGCCTG
GCCGACCTGT TCATCCCGCT CGAGGAGGGC GGCGGCGTCA ACCTGCCCAA GCTGTCGATC
CCGAGCTTCA CCTGGGGCTC GGTGCTGTCC GGCAACTTCT TCGACAACCT GGTCGAATCC
ATCACCGAGG TGTTCGTCGA GCCGCCGCGC CAGGTGGTCT GGCGCATCGC CACCGAGCCC
GCGGGCGCCC AGGTGCTGCG CAACGGCAGC GAGGTGGTGG GCACCACCGA GGAGCCCTTT
GGCATCTCGC TCGAGGAGAC CCCGGGCTTC TCCGAGACCT TCATCCTGCG CCTCGATCGC
TACCACGATC ACGAGCTGAC CCTCAGCGGC GAGGAGGACT TCGACGAGGT CGTGGCCCTC
GAGCCCAAGG TCTACGCCAC CGTGGTCTCG CAGCCCGCGG GCGCCGAGGT GCTCGACGCC
GCCGGCGCCG TGCTCGGCAC CACCCCGCTC GAGCTCGAGC TGCCGCGCGC CCAGGACGAG
GGCGCGGCCG CGGGCAAGAC CGTGACCCTG CGCATGGAGC GCTTCCTCGA CACGCCCATC
GAGCTACGCG GCGAGCAGAC CTTCGAGGAG AAGGTCGCGC TGCCGCCGCG CGTCTACGCC
ACCGTGGTCT CGCGCCCCGA AGGCGCCGAG GTGCTCGACG CCAGCGGCGC CGCGCTCGGC
ACCACGCCGC TCGAGATCGA GCTGATGCCC GACGAGGCCG GCCAGCCCAC ACCGCAGACC
GTGACCCTGC GCATGGAGCG CTATGAGGAC GCCGAGCTCG AGCTGGCCGG CGAGCGCAGC
TTCCGCGAGA CCGTGCGTCT CGAGGCCAAG GTGTTCGCGA CCGTGAACTC GCAGCCCGAG
GGCGCGCAGG TGGTCGACGC CGCCGGCGAG CTGCTCGGCA CCACGCCGCT CGAGTTCGAG
CTGCCGCGCA GCGAGCAGCC GCTCGAGGTG ACGCTCAAGC TCGACGATCA CGTCGATACC
AGCGCCGTGC TGCGCGGCAA CCGCAGCTTC ACCAAGCGCG TCACCCTGCG CCCGCTGCCG
CGGGCCACCC TGGTCTCGCA GCCCGAGGGC GCGACCATCT ACGACGCCGC CGGTCAGCGC
GTCGGCACCG CGCCGCTCGA GCTCAAGCTC CCGGGCACCG GCGACGCGCT GGTGTACACC
ATGAAGCTCG AGGGCTACCG CGACGCCACG CTCGAGGTCG ATCCCCGGCG CGGCCGCAAG
ATCGTCACCA AGCTGAGCCG CGATCTCGGC ACCACCATGG TGAACATCAG CTCCGAGCCC
AGCGGCGCCG AGGTCTTCCG CGGCAACAAG CGCATCGGCG AGACCCCGCT CACCGACGAG
GTCCCGGGCC AGACCGGCAA GCTGCGCTAC ACGCTCAAGC TGCCCGGCTT CCAGACCCGC
AACATCGCGG TGGCCGGCGA TGAGAACAGC GAGACCTCGG TGCAGCTCAA GCGCTGCGCG
CCGCAGCGCC GCGGCACGCT GGGGCCGGTC TCTGTCTACG GCGGCTGCTG A
 
Protein sequence
MTNKVIGSYR ITQELGAGGM GAVYAAEHTL LGRRAAIKVL LPQMSREAEI VNRFLNEAKA 
ASAIKHPGIV QIYDLGHQED GSAYIAMEYL EGEGLEVRIR RLGRLPVQQA LRFTGQIASA
LAAVHERRIY HRDLKPGNVF LVPDKQVTGG ERIKLLDFGI AKLHDPDPGA QATRAGALMG
SPSYMSPEQC RGAGEVDHRA DLYSLGCILY EMLCGRPPFD GGGGGVMAIL SAQLRDAPPL
PSQSRPELTP EIDALVLGLM AKDPDLRPPS AAHLAHAITQ LTGEQMSFAA IGSAFDPGAA
PGADDDFGGA TIVDMQLPST LAATQPSQQT AQAATRPSHT GTGTGTGLGT GPHTSNTPLI
EQLAGNTTGA RSRRSTGLIT GHVSDQGTSP TGNMAAAGQT RAGDSRVQTA LGQRAGGSRR
LLWVVVALLL VGGLSVALTQ IGGSDKRAEV PTDSEQPEEP VVAMNDEPPP PPEPTLDFFP
LAEEEQRRRR EQPRPEPIRL ADLFIPLEEG GGVNLPKLSI PSFTWGSVLS GNFFDNLVES
ITEVFVEPPR QVVWRIATEP AGAQVLRNGS EVVGTTEEPF GISLEETPGF SETFILRLDR
YHDHELTLSG EEDFDEVVAL EPKVYATVVS QPAGAEVLDA AGAVLGTTPL ELELPRAQDE
GAAAGKTVTL RMERFLDTPI ELRGEQTFEE KVALPPRVYA TVVSRPEGAE VLDASGAALG
TTPLEIELMP DEAGQPTPQT VTLRMERYED AELELAGERS FRETVRLEAK VFATVNSQPE
GAQVVDAAGE LLGTTPLEFE LPRSEQPLEV TLKLDDHVDT SAVLRGNRSF TKRVTLRPLP
RATLVSQPEG ATIYDAAGQR VGTAPLELKL PGTGDALVYT MKLEGYRDAT LEVDPRRGRK
IVTKLSRDLG TTMVNISSEP SGAEVFRGNK RIGETPLTDE VPGQTGKLRY TLKLPGFQTR
NIAVAGDENS ETSVQLKRCA PQRRGTLGPV SVYGGC