Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3059 |
Symbol | |
ID | 8545447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 4219011 |
End bp | 4222256 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646387730 |
Product | hypothetical protein |
Protein accession | YP_003267458 |
Protein GI | 262196249 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.893943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGG GTGAGCGGCA GACTGGTGGT TCTGAACAGC GCGCCGGCGA ACAGGGCACG GGTACGCGCG CGCCCGAGCG CCAGCGCGCG CGCCCGGCCC CGGGCAAGGT CACGCGCACC GGCAAGCTCG CGAGCGGCGG CAACGGGGCC GTGCAGCGCC GGGCGGCCGA AGCCGACGCG CCCACCGATC CCGAGGGCGG CGCGCCGCTC GACGGCGGCC TGCGCCAGCA GCTCGAGGAC GCCACCGGCT TCTCGGGCCT GGCCAGCGTG CGCGTGCACA CCGGTTCGGC CTCGCAGCGC GCGGCCGGCG GTCTCGGCGC CGCCGCGTTC ACCTCGGGGC AGCACATCCA CTTCGGCGCC GGCCAGTACA ACCCCAACTC GGCCGCGGGC AAGCAGCTCA TCGCCCACGA GGTCGCGCAC GTGGTCCAGC AGAGCGGCGC CCCGGCCAGC GCCACCGGCC AGGTCGGCAG CGCGGGCGAC GCCCACGAGG CCGCGGCCGA TCGCTTCGCC GGCGGCTTCG CCATCGGCAT GCCCGTGGTC CAGCGGCTGG CTCGCGGCAG CGCGCCGAGC GGCATGATCC AGCGCCGCAC GGGCACGCCG CAGCCGCTCA CCGTGGAGGA GCTGGCCGCG CCGGTGCGCG AGCGCCTGGT GCAGAAACGC GCCGAGGGCA CGGCCGCCCT GGTCGCCGAG GTGCGCTCGC TGCGCGCGCA GGCCGAGGGC CAGGTGCTCG AGGCCGTCAA CCTGGCGGTA TCGCAGACGC TCACGCAGGA GGAGCAGGAC GCGCTCGCCG GCAACACCTC GACAAGCTCG GAGGAAGGCG CCGAGGCGAC GCCGACACCC GAGGGCGGCG AGCAGACCCC GAGGCCGGCC GCGAACGAGA ACGCCCAGGG CCAGGACGAG GAGCCCTCGG CCACCCCCGA GGGCCCGCAG CCCGCCCCGA CCCCGAGCCC AACCGACGCG GCCCCGCAGA ACGCCGAGAG CGAGGACGCC AGCGCCCAGG AGCCGGGCAC CGAGGGCCAG GACGCCGAGG GCCAAGATCC GGCCGCCCAG GGCGACCAGA ACGCCCCCGG CCAGGGCGGC GACCAGGCCC AGAACCAGGG CGGCGGCGGC GAGCAGGCTG GCGGCCAGGC CGCGACCCCG GCGCCCATGA CCACCGAGCA GGCGCTGCAG CCGGTGGCGG CGCCGCCCGC GGACGAGCGC GCGTTCATCC AGGGCGAGCT CGGCTTCCAC GAGAGCTGGA CCGCCATGCG CGGCTCCACG GCCGACCGCG CCGCGCGCCT GTTCTCGGCC AACGACCTGG GCCAGGGTCT GCTCGGCGGC GGCGTCCAGG TGCTCGCCGG CGTGGGCATC GAACAGCTCG CCAAGCGCGT CCCGGTGCCC GGCCTGGGCA ACATGATCGG CGGCGGGCTC AGCGCCTACG CCCTGTTCTC CAACGGCGGC GCCGGCATCC GCAACATCGC CGGCACCATC GGCGAGGGCT TCAACTGGGA CGGCAAGGGC CCGTGGCAGA TCGCGGCCGA CGTGGTCGCC ATGATCAAGG GCGTCCTCGA CCTCATCGGC AACATCTGCA ACATCCTCTC GGGCCTGGCC TACGCCTTCG CCGCCATCGC CGCGGTCGGC GGCCTGCTCT CGGTGCTGTT CCCGCCCCTG GCCTTCCTGG TGCCCTACAT CCCCACGGCC ATCAATTTCG GACGCATGTG CGGCGGCGTG GCCACCGTGT GCATGGGCAT CTCCGACCTG ATATCGCCCA TCCCGCCGGT GCTGCGCGCG ATCCACATCC TGGTGTCGGA CCAGGACCCG CTGCAGCTCG CGCAGCAGGA GCAGACCTAT CACGGCGAGC TCCAGGGCGC GATCGCCAGC TACAGCGCGG CCGGCGCCAT GTCGGCGATC GAGGGCAAGG GCTTCAACCC GGTCGGCAAC ATGGTCGGCG GCGTGGCCGA CGGCGCCGGG GTCGCGCGCA GCGCATACGG CGACGCGCGC GCCGGCAACA CCGCCGTCAA CCGCGCCTTC GGCTTCGAGG GCGGCGACAC CCAGGGCATG CGCGGCGCGG GCCAGAACAA CGGCGAGCGC TACTTCGACG TCAGCGGCCA GCAGCGCACG CGGCTGCAGG AGAACACCGA GTCCGAGCAG CGCCTGCTCG GCAGCCGCCA GGACAACGCG CAAGAGCACC GCCGCGCCGC CGACGAGGCC CATCAGCGGG CCCGCGACAA CGCCGACGGC GGTCGCCGCG TGCGCCGCGA GGCCGAACTC GCCGAGCGCC GCGCGGTCAA ACGCGAGCAG CGCGTCGAGC AGTCCGAGGG GCGCGTCGCC GACGCGAAGA ACCAGCAGGA GCTGCAGAGC GGCATGCAGG CCGGCGGCCC CGGCGGCGAG GTCGGCAACA CCACCGAGAA CGCCCGCCAG GCCTTCCAGA GCAACGGCGA GAGCGAGCGG CAAGAGCCGG GCGGCGGCGC CATCGAGCGC GACCAGCGCG GCCACGTGGT CCTGCCGCCC CCGCCCGGCT CGCTGCAGGA GGTCGACCAG CAGGATCAGA CCATCCAGCA GCTCCAGCAG CGCCTGGCCG CGCAGCAGCA GCACACCCAG GGCGCCCAGG GCGTGCGCAC AGAGGCCGGC CAGCGCTCGG CCCAGCTCGG CGCCGTGCAG CAGACCGTGG ACGGGCGCGT CCAGGAGCAC ACGGCGCTCG AGGCCGAGCA CGCCAACGTG GCCCAGCAGA ACGCCGACGT CAGCAGCCGA ACCCAGGAGC AGAGCAGCAG CTCGGACAGC GGTCTGGGCC GCGCCGCCGA GGTGCTGTCG CCGATGGTGG GCCCGGCCCA GACCGTCAAC GATCTGGTGC AGCGGGTGCC CTCGAACCGC TTCTTCGACG TCTCCGGCAC CCAGCAGAGC CTGAGCCAGT TCGTGCAGGG CATGGAGCAG ATCACGGGCG GGCGCGACGA CGCGCAGGAG CAAAGCAGCC AGACCCAGCA GGTGCTGCAG TCGCGCGAGC AGCAGACGGC TGAGGCCGAC CAGCTCCATC AGCAGACCAG CAGCGAGGGC CAGGAGCTGC TGATGTGCGT GCAGAACGAC CAGGGTCAGG CCGACAGCGT GGCCCAGGAG GCCGCGTCCG AGCAGGCCAA CAGCCAGCAG CAGGAGCAGA CCCTGGAGCA GCAGATCCAG CAGGCCATCC AGGAGCGCGA GCAGAAGTGG AGCAGCCTGG TGGGCTGGGC CCAGCAGCAC TACTCCATCC GCCAACGCGC CAGCAGCGGA AGCTGA
|
Protein sequence | MKQGERQTGG SEQRAGEQGT GTRAPERQRA RPAPGKVTRT GKLASGGNGA VQRRAAEADA PTDPEGGAPL DGGLRQQLED ATGFSGLASV RVHTGSASQR AAGGLGAAAF TSGQHIHFGA GQYNPNSAAG KQLIAHEVAH VVQQSGAPAS ATGQVGSAGD AHEAAADRFA GGFAIGMPVV QRLARGSAPS GMIQRRTGTP QPLTVEELAA PVRERLVQKR AEGTAALVAE VRSLRAQAEG QVLEAVNLAV SQTLTQEEQD ALAGNTSTSS EEGAEATPTP EGGEQTPRPA ANENAQGQDE EPSATPEGPQ PAPTPSPTDA APQNAESEDA SAQEPGTEGQ DAEGQDPAAQ GDQNAPGQGG DQAQNQGGGG EQAGGQAATP APMTTEQALQ PVAAPPADER AFIQGELGFH ESWTAMRGST ADRAARLFSA NDLGQGLLGG GVQVLAGVGI EQLAKRVPVP GLGNMIGGGL SAYALFSNGG AGIRNIAGTI GEGFNWDGKG PWQIAADVVA MIKGVLDLIG NICNILSGLA YAFAAIAAVG GLLSVLFPPL AFLVPYIPTA INFGRMCGGV ATVCMGISDL ISPIPPVLRA IHILVSDQDP LQLAQQEQTY HGELQGAIAS YSAAGAMSAI EGKGFNPVGN MVGGVADGAG VARSAYGDAR AGNTAVNRAF GFEGGDTQGM RGAGQNNGER YFDVSGQQRT RLQENTESEQ RLLGSRQDNA QEHRRAADEA HQRARDNADG GRRVRREAEL AERRAVKREQ RVEQSEGRVA DAKNQQELQS GMQAGGPGGE VGNTTENARQ AFQSNGESER QEPGGGAIER DQRGHVVLPP PPGSLQEVDQ QDQTIQQLQQ RLAAQQQHTQ GAQGVRTEAG QRSAQLGAVQ QTVDGRVQEH TALEAEHANV AQQNADVSSR TQEQSSSSDS GLGRAAEVLS PMVGPAQTVN DLVQRVPSNR FFDVSGTQQS LSQFVQGMEQ ITGGRDDAQE QSSQTQQVLQ SREQQTAEAD QLHQQTSSEG QELLMCVQND QGQADSVAQE AASEQANSQQ QEQTLEQQIQ QAIQEREQKW SSLVGWAQQH YSIRQRASSG S
|
| |