Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5349 |
Symbol | |
ID | 8547761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 7361348 |
End bp | 7364398 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646390022 |
Product | serine/threonine protein kinase with TPR repeats |
Protein accession | YP_003269726 |
Protein GI | 262198517 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.553326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCCC GCTTCCCCAA CCAGCCTCGT GACCGCCAAC CCGGCACGCC GACGCAGATC GGCGAAGAGG CCGAGTACAC TGACAAGATG GCGGCGGCGC CCACCGATGA TTTCCTGCCC GAGGCTCTCC CCACCGATCG CACCCTGCAA TCGGCCGCGG ACACCGGCGC GTTGGCCACC CTGAAGGGGT CGCCGCTCGG CCACCTGGGA CGCTACGAGC TGCTCGAGTA CAAAGCCGCC GGCGCCATGG GCGAAATCTA CATCGCCTAC GATGCCCAGC TCGACCGCAA GCTGGCGCTC AAAGTGTTGC GCCCCGAGGT CAGGCGCCGC AGCGACCACG CCTCGCAGCG GCTGCTGCGC GAGGCTCAGA GCCTGGCTCA GGTGTCGCAT CCCAACGTGG TGCCGGTCTA CGAGGCCGGC GAGATCGACG GTCGGGTCTT CGTCGCCATG GAGCTGGTCG CCGGCGAGCC GCTCGGCCGC TGGCTGGCCA AGCACGCGCC GCCCGCCGCG CGCTTCTGGC GCTCCATCCT CGACACCTTC CTCGAGGCCG GTCGCGGGCT GGCCGCCGTC CACGAGGCCG GCTTGCTGCA CCGGGACTTC AAACCCGACA ACGTGCTGGT CGGCGATGAC GGCCGCGTGC GCGTGGTCGA CTTCGGTCTG GCGCGGGCCA GCAACACCAC GGCCAGCATC GACGAACGCT CCACCACCTC CATGGAACTC GAACCCCATG CGAGCGCGCA GCGCCCCCGC GCATGGACCG CCGATCTCAC CCGCAGCGGC GTGGTCGTCG GCACGCCGGC GTACATGGCG CCCGAGCAGA TCAGAGGCGG CACGGTCGAC GCGCGCAGCG ATCAGTTCAG CTACTGCGTG GCCCTGTACG AAGCCCTCTA CGGCCACCGA CCCTTCCGCG GCGAGGACTT CACCAGCCTG CGTGCCTCGG TGCTAAGCGG CGAAGTCCTG GCGCCTTCGG CGTCGGCCAA GGTGCCCTCG TGGGTGTGGC GCGTGCTGCT GCGCGGCCTC GCGACGGAGC CCGAGCAGCG CTATCCCGAC ATGCACGCGC TGCTCTCGGC CCTGTCCGCG GATCCGGCGC GCAAACGGCG CCAGCGGGGC ACCGCGGTCG CGCTGCTGTT TGGCGGCACC GCGGCCGGCG TGCTGGCGAC CATGCTCGCC AGCGAGCACC GCGACGAGCG CTGTACCGCC GCAGCCGAGA GCGCACTAGA GGAAGTGTGG ACGGAAGCCA CCCGCAACCA GGTGCGCGCC GCCTTCGCCG CATCCACCCT GCCCTACGCG GCCACCGCGT GGCAGAGCGT CGACCGCGGT ATTCAGGACT ATATCCGCCG CTGGCAGCAC AGCTCGATCG AGCTATGCGA ATCCGACGCC GAAGACGGCC TCGACGGCAC CGCGGACGGC CACGCTCCCA CCACGCTCTG TCTCGATACG CAAGCAGAGC GCCTGCAGCT ACTGGTCACC GAGCTGGTTC ACGCCGACGC CCACCTGGTC GAAAACGCCT CCTCTGCGCT CGCGTTCCTC GGACAACCCG AGGCCTGTGG GACCCAATTC AGCAGTCTGC CGCCAGCGCC CGACGCGCAA ACCCGGGACA AGCTGTCCCA GATCCGCACG GCCATGAGCA GCGCGCGCCT GCAGGCTATC GCCGGCTCCT TCGACCCGGC TCTGCGTCAG CTCGACGACC AGATCGAAGC CGCCGAGTCC CTGTCCTACG AGCCGGTGCT GGCCGAGGCT CTGTACCACG CCGGCGACGC GCGCCTCGAA CGCGGACGCG AGGACGAAAT AGAGACCGGC ACCGAGCTGC TCAAGCGAGC GCTCGACATC GCCGAGAGCA GCGGCAACGA CGCCCTCGCC ACCGACATCT GGAACGCGCT GGCGCGACGC CGCGAGGCCA CCTTGCCGCC CGAGCAGATC GCCTTCTGGT CGCGTCGGGC GCTGGCGCTC ACCAAGCGCA TGAGCAGCTA TGACCAGCGC CGCGCGCAAG CGCTGCGCAA TCTGGGCACG GCCCTGTATC GCTCGCAGCA GTACACCGAG GCTGAAGTCT ACCAGCGCCG GGCCATCGAC CTAGCGCGCA GCAACAACGC CTCGCCGCTG CTGAGCGCCG ATATGCTGCA CGCGCTCGCC AACACGCTGC ACGCGCTCGC CAAGTACGAC GACGCCCGCG TGCACTACGA AGAGGCGCTG GCGCTGGCCG ACGGCGAGCT CGGTCGCGGT CACCCCAAAG TGCAGGCGCT GCGCTTCGAC TTTGCCGATT TCCTCATCGA GACCGCCGAG CTCAGCGGCG ACGACGGCAA AGGCGCGCTC GCGCTCGACC AGGCGCGTAC CTTCCTCGAT ACCGCCCGCA CCATCCGCGC CCAGGTCTAC GGCCCGCAGA GCCCGCAGGT CGCCGCGGTC CACGTGGCGC TGGCCAAGCT CGAGACCAAG AGCGGCGCCC TCGATGAGGC CGCGTACCAC GCCGGACGCG CCATCGATTT CTATCGTGCG CACTACGGCG AACACGCGCC GCAGCTCGCC GAAGCGCTGT CGCAGCTGGG ATTCGTCCAG TTCCGCAGCC GGCGCTACAA GGAAGCCCTC GTCGCCTGGC AGCAGGAGAG CGCCCTGCGC GAGCAAAGCG GCGCCCTCCC CATCGTGCGC GGCCTCAACC AGAGCAACAT CGCCGAGGCC TTGGTCCACC TGCGCCGCTA CGACGAGGCC TTCGCCGCCT TCGAGCGCGC CCAAGAATAC TACGGCGCCG AAGACGATAT CTCCCCGATT TACTCCGGCC TGGTCGAAAA AGGCCGCGGC CAAGCACTGC TGGGACAGGG TCGCGCCGCG GCCGCGGTCC CGCACCTCGA GGCCGCACTC GCCGTCTTCC ACGAGCACCC TTTCGACGTG CTCGAAGACG CGGATACAGC CTGGAGCCTG GCGCGTGCGC TGCGCATCAG CGCACCCACA CGCCTCGAAG AAGCGCGTTC CTTGGCAACC AAAGCCGAGG ACATCTATCG CGCCAACGAG GCCACTCGCG ACATAGCAGA CGAAATCCGC ACCTGGCTCG ACTCTCTCTG A
|
Protein sequence | MSSRFPNQPR DRQPGTPTQI GEEAEYTDKM AAAPTDDFLP EALPTDRTLQ SAADTGALAT LKGSPLGHLG RYELLEYKAA GAMGEIYIAY DAQLDRKLAL KVLRPEVRRR SDHASQRLLR EAQSLAQVSH PNVVPVYEAG EIDGRVFVAM ELVAGEPLGR WLAKHAPPAA RFWRSILDTF LEAGRGLAAV HEAGLLHRDF KPDNVLVGDD GRVRVVDFGL ARASNTTASI DERSTTSMEL EPHASAQRPR AWTADLTRSG VVVGTPAYMA PEQIRGGTVD ARSDQFSYCV ALYEALYGHR PFRGEDFTSL RASVLSGEVL APSASAKVPS WVWRVLLRGL ATEPEQRYPD MHALLSALSA DPARKRRQRG TAVALLFGGT AAGVLATMLA SEHRDERCTA AAESALEEVW TEATRNQVRA AFAASTLPYA ATAWQSVDRG IQDYIRRWQH SSIELCESDA EDGLDGTADG HAPTTLCLDT QAERLQLLVT ELVHADAHLV ENASSALAFL GQPEACGTQF SSLPPAPDAQ TRDKLSQIRT AMSSARLQAI AGSFDPALRQ LDDQIEAAES LSYEPVLAEA LYHAGDARLE RGREDEIETG TELLKRALDI AESSGNDALA TDIWNALARR REATLPPEQI AFWSRRALAL TKRMSSYDQR RAQALRNLGT ALYRSQQYTE AEVYQRRAID LARSNNASPL LSADMLHALA NTLHALAKYD DARVHYEEAL ALADGELGRG HPKVQALRFD FADFLIETAE LSGDDGKGAL ALDQARTFLD TARTIRAQVY GPQSPQVAAV HVALAKLETK SGALDEAAYH AGRAIDFYRA HYGEHAPQLA EALSQLGFVQ FRSRRYKEAL VAWQQESALR EQSGALPIVR GLNQSNIAEA LVHLRRYDEA FAAFERAQEY YGAEDDISPI YSGLVEKGRG QALLGQGRAA AAVPHLEAAL AVFHEHPFDV LEDADTAWSL ARALRISAPT RLEEARSLAT KAEDIYRANE ATRDIADEIR TWLDSL
|
| |