Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0068 |
Symbol | |
ID | 8542438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 100792 |
End bp | 103797 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646384855 |
Product | cysteine-rich repeat protein |
Protein accession | YP_003264602 |
Protein GI | 262193393 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02232] Myxococcus cysteine-rich repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCATA ATCGAGCTCT CGCCCGAGGT CTGTCCGTCG CCGCGCTGCT GGCATTTGCC GCCGCCGATG ACGCGCCCAC CGCCCACGCC TTCACCGGCG CCGACACACC CGGTATCGCC GGAGTCGAAC ACGCCGCCGA ACCCGGCCTC GAAGTCGCCG AGGGGCCGCG CGCGCGCGCG CAACCGGCGG TCACCTGGAA CCGGCCGCCG GCGCAGCGCG CGGCCGCCTG GGAGCGCTTC GTCGCGGACA CCGGGACGGC GTGGACGCCG ATGTGGGACG CCGACACCGC GATGCCGCTG CGCATCGCGG GCGCCGGCAT GGCGATGCCG GGCAGCGTGG CCTCGGCCGA TAAAGCCGCC GACTACGCGC GCGCGTTCCT GGCCGAGTAC ATCGACCTGC TGGCGCCCGG CAGCCGCCCG GAGTCGTTCC ACGTGGTCGG CAACGACCTC AGCGACGGCG TGCGCGCGGT GGGCCTGTAT CAATATCACG AGGGCATGCG CGTGCTCGGC GGCCAGCTCA GCTTCCGCTT CAAGAACGAC CGCCTGATCC TGGTCGCCTC CGAGGCGCTG CCCGATATCG CGCTGCCGGC CGTGAGCTAC ACCACCGCCG AGGCCGTGGT CCGGGATGCC GCGCTGTCGT GGGTGGCGGG CGAGGTCGGC AAAGCCTGGG TGACCGAGAC CAGCGGCCCC TACGTGCTTC CCGTCATCTC TACCGGACGT GTGTCCTATC ACACCGTCAT GCAGGCGACC GTGGAAGGGC GCGCGCCGAC CTCGCGCTAC CGCGTGTACA TCGACGCCTC CAGCGGCGAG CCGGTGGCGC GCGAGCAGAT GCTGCTCTAC GCCGACGCCC AGCTCCTGTA CAACGTGCCC GCGCGCTATC CCGAGGGCGA TCGCGCCGAC CTGCCCGCGA GCTTCACCGA GGTGGTCTAC GAGGACGAGA GCTACTTCAC CGACGGCGCT GGCGTGTTCT CGTGGGACGG CGAGGGCGCG GCGTCGGTGT CGGCCTCGGT CAGCGGCGAG CTGGTGACCG TGAGCAACCA GCGCGCGCCC GACGAGAGCA CGGTGTTCGA GGTCGCGCTC ACGCCCGCGG GCACCGGCGT GTGGGACGCG CGCGACGACG AGTTCGTCGA CGCCCAGCTC ACCACCTTCG TGCACTCCAA CATCGTCAAA GAGTACGTGC GCGTGTTCGC GCCCGGTCTC AAGTATCTCG ACGAGCAGCT CCTGGCGCGC GTCAACATCG ACGACACCTG CAACGCGTTT TCGGACGGCA CGACCATCAA CTTTTTCCGC GCCAGCGGGC AGTGCGCCAA CACCGGGCGG CTGCCCGACG TGGTCTATCA CGAGTTCGGA CACTCGATGC ACTGGCAGTC GCTGGTGCCG GGCGTGGGCG CCTTTGACGG CGCCTTCAGC GAGGGTCTGT CCGACTATCT GGCCGCGACC ATCACCGGCG ACCCGGCTAT GGCCCGCGGC TTCTTCTACG GCGACGAGCC GTTGCGGCAC CTCGATCCCG AGGACTTCGA GCACTCCTGG CCGCGCGACA TCGCCGGCGT GCACTACACC GGGCTCATCT TCGGGGGCGC GATGTGGGAT CTGCGCAAAG AGCTCGTCGC CCTGTACGGC GAAGAGGAGG GCGTGGCCGT GGCCAACCGG CTGTACTACG CCGCCGTGCT GCGCGCCAGC TCCATCCCGG CGACCTATTT CGAACTCCTG GCCGCCGACG ACGACGACGG CAACCTGGCC AACGGCACGC CGCACGAGTG TCTGATCAAC GACGCCTTCG GCGCGCTGCA CGGCCTGCGC GAGATCGGCA ACGAGCACAT CCCGCTGGGC ATTCAGCCGC CCGAGCGCGA GGGCTACTCT CTGAGCGCGC GCCTGCAGGG GACCAACGCG CGCTGTGCCG GTGACGAGGT GCTGTCGGTG ATCGTGCGCT GGCAGCGCCG CGGCAGCGAG GTCGGCGAGG ATCTCGAGGC CACGCTGCAG GACGGCGGCG ACGGCGTCTA CGAGGCCACG ATTCCGGCGC AGCCGGCGGG CAGCACGGTC CGCTATCAGG TGGTGGTCGA GTTCGCCAAC GGCGGCGTGA TCACCTTCCC CGACAACCCG GCCTGGGAAT ATTACGAGTT CTACGTCGGC GAGCTGATCG AGCTGTACTG CACCGACTTC GAGAGCGATC CCTACGCCGA GGGCTGGAGC CGCGGACAGA CCCGCGGCGT GGCCACCGGC GGCGCCAACG ACTGGCAGTG GGGCCGTCCG CTGGGCAAGG CCGGCGATCC CGCGGCGCCG TATTCGGGAG CCGCCAGCAT GGGCAACGAC CTCGGCGATG AAGGCTTCGA CGGCTTCTAC CAGCCGCAGA AGGGCAACTA CTTCGAGAGC CCGGTCATCG ATGTCGGCGA CTACAGCGAT GTGCGCCTGC AGTATCGCCG CTGGCTCACG GTCGAGGATT CGCGCTGGGA CGACGCCATC ATCTACGTCA ACGGCCGGCC GGCGTGGCGC AACCGCCAGA CCCCGTCGGG CAAGGTGCAC CATATCGACA AGCAGTGGAT GTTTCACGAT GTCTCGCTCA GCGGCCAGAT CCTGGGCGAT ACCGCCCAGC TGCGGTTTGC GCTCGAGACC GACGGCGGCC TGCAGTTCGG CGGCTGGAAC ATCGACGACG TGTGCATCGT GGCCGCGCCC GACGCCATCT GCGGCAACGG CACGGTCGAG GGCACCGAGC GCTGCGACGC GGGCGACGCC AACAGCGACA CCGAGTCCGA CGCCTGCCGC ACCAACTGCC GCACGGCCTT CTGCGGCGAC GGCGTGCGCG ATCGCTACGA GCAGTGCGAC GACGGCAACG ACGACCCCGA CGACGGCTGC ACGCCGGCCT GCTTCTTGCC CTTGCCCGAG CGCGGCTGTA GCGTGCGCCC GGGTGGCGCT GGCGATGGCG GCGCTGCCGG GTTGGCCCTG CTCGCGCTGC TCGGGCTTGT CGGACGCGCG TACCACACGC GGCGCCGCGG CCGCGCGCGC GCCTGA
|
Protein sequence | MTHNRALARG LSVAALLAFA AADDAPTAHA FTGADTPGIA GVEHAAEPGL EVAEGPRARA QPAVTWNRPP AQRAAAWERF VADTGTAWTP MWDADTAMPL RIAGAGMAMP GSVASADKAA DYARAFLAEY IDLLAPGSRP ESFHVVGNDL SDGVRAVGLY QYHEGMRVLG GQLSFRFKND RLILVASEAL PDIALPAVSY TTAEAVVRDA ALSWVAGEVG KAWVTETSGP YVLPVISTGR VSYHTVMQAT VEGRAPTSRY RVYIDASSGE PVAREQMLLY ADAQLLYNVP ARYPEGDRAD LPASFTEVVY EDESYFTDGA GVFSWDGEGA ASVSASVSGE LVTVSNQRAP DESTVFEVAL TPAGTGVWDA RDDEFVDAQL TTFVHSNIVK EYVRVFAPGL KYLDEQLLAR VNIDDTCNAF SDGTTINFFR ASGQCANTGR LPDVVYHEFG HSMHWQSLVP GVGAFDGAFS EGLSDYLAAT ITGDPAMARG FFYGDEPLRH LDPEDFEHSW PRDIAGVHYT GLIFGGAMWD LRKELVALYG EEEGVAVANR LYYAAVLRAS SIPATYFELL AADDDDGNLA NGTPHECLIN DAFGALHGLR EIGNEHIPLG IQPPEREGYS LSARLQGTNA RCAGDEVLSV IVRWQRRGSE VGEDLEATLQ DGGDGVYEAT IPAQPAGSTV RYQVVVEFAN GGVITFPDNP AWEYYEFYVG ELIELYCTDF ESDPYAEGWS RGQTRGVATG GANDWQWGRP LGKAGDPAAP YSGAASMGND LGDEGFDGFY QPQKGNYFES PVIDVGDYSD VRLQYRRWLT VEDSRWDDAI IYVNGRPAWR NRQTPSGKVH HIDKQWMFHD VSLSGQILGD TAQLRFALET DGGLQFGGWN IDDVCIVAAP DAICGNGTVE GTERCDAGDA NSDTESDACR TNCRTAFCGD GVRDRYEQCD DGNDDPDDGC TPACFLPLPE RGCSVRPGGA GDGGAAGLAL LALLGLVGRA YHTRRRGRAR A
|
| |