Gene Hoch_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1422 
Symbol 
ID8543804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1918100 
End bp1921240 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content70% 
IMG OID646386134 
Productserine/threonine protein kinase 
Protein accessionYP_003265869 
Protein GI262194660 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGC GCAAGCACCA GCGCCGAGCC GGCAACGAGA CCTCGAGCTC CCTCGATACG 
CTTCAGGAGG CGTCGCCGCG CTCCGCTTGT GCGGCACCCG GGGAGTCCGT CACCGCGATA
TCGCTGGCGG CGACCGTGAG CGAGGACATC GAGAATCGCG CGGACGGCGC GGGCGCCGGC
AAGGACGAGC CCACCCTGGC ATTGGACGAA CTGAGCGGCG ACGGCACCGA GCATGCGCGT
CCGATCGTTG AGGCGCTGCG CAAGTCGCCG CATTCGCAGG GAGGGCAGCG AATAGGCCGC
TACGAGCTGA TCCGCTCGCT GGGCCGCGGT GGCATGGGCG AGGTGTTTCT GGCGCGCGAT
CTGCGGCTCG GGCGCCTGGT CGCGCTCAAG CGCCTGCACG CGCCGGGGAC CGGCCTGGTC
GAGCGTTTCC TGCGCGAGGC GCGGACCACG GCGCGGTGCA CGCACGAGAA CATCGTGGTC
ATTCACGAAG TCGGCGAGCA CGGCGGCTAT CCATTCATGG TGCTCGAGTA TCTCGAGGGC
CACACGCTGC GCGAGTGGAT GAACGCTCGC GTACGGCGCA TGAACGAGCC GGGACCGCTG
CCGCCGGCGC GCGCGGTCGA GCTGATGCTG CCGGTGGTGC GCGCGCTCAG CTACGCTCAC
GCCCGCGGCG TGGTGCACCG CGATCTCAAG CCCGAGAACG TCATGCTCAC CCGCAGCGGC
ACCATCAAGG TGCTCGACTT CGGCATCGCC AAGCTGCTGT CGGTGGCCCG CGGCGAGGAG
GAACCTGCCG ACGGTGTACC CGTGGACGTC GGCGAGTTCC ATTCCGCCCG CGTGTCCGGG
ATGATGCCGG CCTACAGCAG CGCGCGCATC GGCACGCTGC CGTATATGTC GCCCGAGCAG
ATGAACGCCG GCTTGATCGA CCACCGCAGC GATCTCTGGG CCGTGGGCAT CATGCTGTTC
GAGCTGGTGA CCGGCCGCCA CCCCATGGCG GATGATTCGC GCGCGCAGCT CCTGCGCATC
GCCGAGCTCG ACGAACCCAT GCCCAGCGTG CTCGAGGTGA TGCCCGAGCT GGTGCTCGAG
ATGCGTGCGC TGGCCAGTAT CATCGACCGC TGTCTGATCA AGAATCGGGC GCATCGCACG
GCCAAGGCGC GCGTCCTGCT GGCCGAACTC GAGGCGCTGG CCACCGGGCG GCGCGCGCGG
CTCACGAACG AGCACAGTAA TCCCTTTGCC GGTCTGGCTG CATTTCAAGA GACGGACGCC
GGCCGCTTTT TCGGACGCGA CTGCGATATC AATCAGGTGG TCACCGACTT GCGCAGCCGG
CCGCTGGTGG CCGTGGTCGG TCCCTCGGGC GCGGGTAAGT CGTCGCTGGT ACGCGCCGGT
ATGATCCCGC GGCTCAAGCA ATCGGGGGAG GGCTGGGATG CCCACGTGTT GCGTCCGGGC
CGCGAACCGC TCAGCGCGCT CGGCGGCCTG CTGGCGGCGC TGTGTCAGGA CGTGGGCGAG
CCGCTCGAAG TGCAGGGAGG GGCGGGCGAG ATTGTTACCA GCGACGAGAT TCTGCTCAAC
GGCACGCTGC CGGCCGTGGC CTTTCGCGAG CGTCTGCGCG CCGAGCCTGG CGCGTTCGGC
GCGCTGCTGC GCTCGTGGGC GCGGCGCACG CGGCGGCAGG CCGTGGTCTT CGTCGACCAG
TTCGAGGAAC TCTACACCCT GGGCGCCGAC GCCGAAGAGC GCGCCGTGTT TACCGCCTGC
CTCGACGGGG CGGCCGACGA CGCCAGCTCG CCAGTGCGCG TGGTGTTGGC CCTGCGTGCA
GATTTTTTCG AACGATTGGC CGATAATCGC GTGCTCCTCG ACAAGGTGAG CCGCAGCATG
TGTTTCGTGT CGCCGCTGGA TCGCGACGGT CTGCGCGAGG CCCTGACCCG GCCGCTCGAG
GCCGCCGAGC ACAGCTACGA GGACAACGCT TTGGTCGAGG AGATGCTCGA CGTCATCGAC
GGCACGGCCT CTGCGCTGCC GCTGCTGCAG TTTGCCGCGC GCGCGCTGTG GAACCGGCGC
GACCCCAAGC GTCGCCAGCT CACGCGCGCC GGCTACATGG AGATGGGCGG AATGGCCGGT
GCGTTGGCCG TGCACGCCGA CAACGTCCTG GCCTCGATGA GTCAGGCCGC CCGCGGCCAG
GCCAAAGCCG TGTTTCTGCG CCTGGTCACG CTCGCGCGGA CCCGGGCGCC AGCGACCATG
AGCGAGCTGC GCGAACTCCC CGGCGACGCA GAGACCCTGG AAGCCGTGGT CGCCCGCCTG
GTCGACGCGC GCCTGCTGGC GGTGGAGGGC GGCCGTGCGG ATACCGACGG CACGGTCGAA
ATCGTCCACG AGTGCCTGAT CGAGAGCTGG CCGACGCTGA GCACCTGGCT CGACGAGTCG
CGCGAGGACG CCGCGTTTCG CGCCCGTCTG CGCGCGGCTG CCCGGCAGTG GTGCGAAAAC
GGCCGCCGCG AGGGGCTGCT GTGGCGCGGC GAGCCGGCCC GCGAAGCCGC GCTGTGGCGG
GCGCGTTACG GGCGCGAGCT GCCCAAGACC GAGCGCGCCT ATCTCAGCGC AGTGCTGGCC
CTGGAGCGGC GCCAGCGGCG GTGGGGCCTG GTCGTCTGCA CGGCGGTGTT TGCCACCGTG
CTCGTGCTGC TCGGCGCCGC GTTGACCATG GACCGGCTCC GCAGCAACTC GCACGAGGCC
AGGGCGCAGG CCAGCTATGC TGAGGATCGG GCCGAGATTG CCAAGAAGCA ACGCACCCTC
GCCGAGCAGC GCGCCAACGA GCTGGCGCTC AAGAAGCGGG AACTCGACGA TAGCCTGGAG
GCCACGGAGA GGGCCCGGCG CGCAGCCGAA GACGCGCGAG CAACCGCCGA GAGGGCAAAA
GACGCCGAGG CGCGCGCCCG CAGGTCGACC GAGGCCGCGC TGCGCAAGGC CGAGCTGGCG
CGTCGCCACG CCGAGACCGT GTCGGGGCGG TTGCGGGAGG CCGAGGCGGA GCTCCGCACA
GCGCTCGACC GCGCCGAGCA GTCCGCCGAG CGCGAGCGCG AGCTCAGAGA ACGGCTCGAG
GACGTGATCC GGCGCGCCCT GGGCGCCGAC CTCGACGAGC ATATCCCGGG CATGGGGAGC
CAGAGCGAGG AGACGATATG A
 
Protein sequence
MTVRKHQRRA GNETSSSLDT LQEASPRSAC AAPGESVTAI SLAATVSEDI ENRADGAGAG 
KDEPTLALDE LSGDGTEHAR PIVEALRKSP HSQGGQRIGR YELIRSLGRG GMGEVFLARD
LRLGRLVALK RLHAPGTGLV ERFLREARTT ARCTHENIVV IHEVGEHGGY PFMVLEYLEG
HTLREWMNAR VRRMNEPGPL PPARAVELML PVVRALSYAH ARGVVHRDLK PENVMLTRSG
TIKVLDFGIA KLLSVARGEE EPADGVPVDV GEFHSARVSG MMPAYSSARI GTLPYMSPEQ
MNAGLIDHRS DLWAVGIMLF ELVTGRHPMA DDSRAQLLRI AELDEPMPSV LEVMPELVLE
MRALASIIDR CLIKNRAHRT AKARVLLAEL EALATGRRAR LTNEHSNPFA GLAAFQETDA
GRFFGRDCDI NQVVTDLRSR PLVAVVGPSG AGKSSLVRAG MIPRLKQSGE GWDAHVLRPG
REPLSALGGL LAALCQDVGE PLEVQGGAGE IVTSDEILLN GTLPAVAFRE RLRAEPGAFG
ALLRSWARRT RRQAVVFVDQ FEELYTLGAD AEERAVFTAC LDGAADDASS PVRVVLALRA
DFFERLADNR VLLDKVSRSM CFVSPLDRDG LREALTRPLE AAEHSYEDNA LVEEMLDVID
GTASALPLLQ FAARALWNRR DPKRRQLTRA GYMEMGGMAG ALAVHADNVL ASMSQAARGQ
AKAVFLRLVT LARTRAPATM SELRELPGDA ETLEAVVARL VDARLLAVEG GRADTDGTVE
IVHECLIESW PTLSTWLDES REDAAFRARL RAAARQWCEN GRREGLLWRG EPAREAALWR
ARYGRELPKT ERAYLSAVLA LERRQRRWGL VVCTAVFATV LVLLGAALTM DRLRSNSHEA
RAQASYAEDR AEIAKKQRTL AEQRANELAL KKRELDDSLE ATERARRAAE DARATAERAK
DAEARARRST EAALRKAELA RRHAETVSGR LREAEAELRT ALDRAEQSAE RERELRERLE
DVIRRALGAD LDEHIPGMGS QSEETI