Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1936 |
Symbol | |
ID | 8544318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2660793 |
End bp | 2664017 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646386640 |
Product | APHP domain protein |
Protein accession | YP_003266375 |
Protein GI | 262195166 |
COG category | [S] Function unknown |
COG ID | [COG1572] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.620471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.229187 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCCG CGAGTCTGGG AATCGCGGCC GCTACCAGTC TGAGTGGGTG TGCGGCCGAC GACGCCCCAT CGGAGCGGGA CGGGGTGTTG CAGCGAGCTA GCGTCGGCTT TGATCTCGCG GTGACCGCGG TCGAGGGCCC GGCGAGCGTG TTGCCGGGCG GCGAGGTCGA GGTGCGCGTC GAGGTCTGTA ATCAGGGCAC CGAGTACGCC GGCGGAGAGA ACGTGTCCGT CTACCTGTCC GAGGATGCTG TGATCGAGGC CAGCGATCAC CTACTCGGCG GCGAGCCGCT TGCGTCGCTC GTGCCCGGCG CCTGCACCGC GCTGAACGCG CGCGGCCCCG AGCCGGGTCT GGCGTCGTCG TACACCGTCG GCGCGATCGT CGAGAGCATG TACTCGTCCG ACGTCGTTCC TGCGAACAAC ACGCTCGTGG GCGGCACCCT CGTGGTCGGC CACGAGGCCG ATCTGGTCGT CAAATCGGTG ACCGGGCCGG CCAGCGTAGA GCCCGGTATG GGCTTCGAGA TCGCGGTGCT GGTGTGCAAC CAGGGCCAGA GCCCGGCCAA CGCCGAGGTC GAGGCCGTGC TCTCGAGCGA CGGCATCATC GACGCAGGCG ACACCGTGGT CGGTTACGGC TTCGCCATGA ACCTTCTCGA GCCCGGCACC TGCGATACCG TGCTGGTGCC GGCGGTGGCC TCAGAGCCCG ACGGCGTGTA CACGCTGGGT GCGCGGGTCG ATTTCAATCA GTTCGAGCCC GAGCTGGACG AGAACAACAA CACCGCCGCG GGTTCGAGCC TGAGCATCGG CTACGGTCCG GATCTCATCG TCAAGTCGGT GAGCGGCCCG AACAGCGCCA TGTCCGGCGG CAGCATCGCG CTCAGCGCTG AGGTGTGCAA CCAGGGCACC GACTTCGCGT CCTTCACCGA TGTCGAGTTC TTCCTGTCGA GCGACGCGAG CATCGACAAC ACCGACTACC CGGCGGGCTC TGCGCCGGTG TCGGAGCTCA CGCCCGGAAG CTGCACCACG GTGGTCGCGG ACGGCTACGT CGGCGTTCCG CAGGACGGCG TGTACATGGT CGGCGCCATC GTCGACGGCT ACGACGGCGT GTTCGAGCTG CGCGACGACA ACAACGCCAC CGCGGGCGCG CGCGTGGGTG TGGGCTACGA GCCCGACCTG GTGATCGCCA GCATCGAGGT GCCGGCGAGC GCGATCTCCG GGATGGATAT CGACGTCTCG GTCGAAGTCT GCAACTGGGG GCAGTCGGAG GCCTGGGGCG TGGATGTCGA GGTGGTGCTC AGCGGCGGTG GCGGCGCGGC GGAGCCGCTC TATTATCCGC TCGAGCCGGA TGACTGCCAC ACCCTGTCGA TAAGCGTCCC GGGCGCGCCC GATGGCGTTC ACACGCTCAC GGCTACGGTC GACATCAGCA ACTCGGTGTC CGAAATCTTC GAAGACAACA ACACCGCCAC CAGCGACCTC GTCGCCGTGG GCTACGAGCC CGACCTGGTC GTGAGCGTGG ACGCGCCGGC CACCGCCCCG CTCAGCGGTG AGGCCCTCGT CGAGGTCGAG GTGTGCAACC GCGGTCAGGC GCCGGTGCAT GGCGTGGACA TCGAGCTGTA CGGATCGACC GACGCGACGA TCACGCGTTC GGACATGCTG GCCGGGTTCG CCCACGTGTC CTCGCTTGTG CCGGGTCGCT GCACCAAGCG GGTTGCCGAC GCGTCTTTCT ACGCGCAGCC GGGCGGCACC TTGTTCGCGG GCGCCATCGT CGATCCGAAT GAGTCCGTGC CCGAGCTGCG CGAGGACAAC AACGCCAGCG CTGGCGACGC CATCGCTTTC GGCGATGAGG CCGACCTGAT CGTGAGCGCC ATCGACGCCC CGGCGACGGC CTCGTCCGGC GCTTCGCTGA CCGCGGAGGT GACCGTGTGC AACCAGGGCT ACAGCCCTGC GCCGGCCGAG GTCGAAGTGT GGACGCGCGG CCCCTACGGC GACATGTTCT TCGGCTCCAG CGCCGGAGCG AGCCCGTACC TCGAGCCCGG TGATTGCGAG CGCGTGTTCG TGTCCGGTTC CGCGCCCTAC GACGACGGCG TGCACGAATT CGTCGCCACG GTCGATTACC ATAACTACGA GCCCGAGATC CTGGAGAACA ACAACACCAC AATCGGCGGC CGCTTTGGCG TCGGCTACGA TGCCGACCTG TTCGTGGCCG AGGTGTCTGC GCCGCTCGCC GGCCTGCCGG GCGACGAGTT CGGCGTGAGC GCCCGCATTT GCAACCAGGG CCAGATGCCC AGCAGCCCCA GCTACGCGAG CTTCGTGTTG TCGCAGGATG GCCAGGCCGA TCCCGGGGAC TTCCTCGCCG GTGACGTGTT CGTGGACATG CTCGAGCCGG GGGCGTGCGT GGACGTGAAC GCCCAGCTCT TCGACCCGGG CTTCGGCGAC CGCTTCGAGG TGTTCGTGGT CGCCGATTGG CAGGAGATGG TCCCCGAGAT CTTCGAGGAC AACAACGCCA CGTCGGGCGG CGTCCGCGTG CTCGGTTACC TGCCCGAGCT GGTGATCGAG GCCGTGAGCG GTCCCGACAG CGTCATGCCC GGCGATACCT TCGAGGTGAC GGTGCGGGTG TGCAATATCG GTACGTACGG CTCCTACGGC ACCGATGTCG AGCTGTACAG CTCGTCGCCG AGCCCGGCCG GTCCCGGCGG ACCTGTGGGG ATGGCGCAGG TGGCCCCGCT GCCCGCGGGC GTGTGCCAGA ACCTGCGCGT CGAGGTCTGG GTCGATACCT ACGCCGAGGG CGCGATGACC ATCCTCGCCG AGCTCGACCC GTACGACACG GTGACCGAGC TCATCGAGGA CAACAACAGT GGCGAGAGCG CGCCCATCGG CGTGGGCTAC GACGCCGACT TCACCATCGC CTCGGTGTGG ACGCCGAGCA CGCTGATGGC CGGCGAGCGC TTCACGGCCG AGGTCGAGGT GTGCAATGTC GGACAGTCGG GCGCGTCGAG CGACGTCAAA CTCTGGTTCT CGCCGAGCAG CAGCCCCTCG AACTACGACA CCCGGGTGCC CACCGCCTGG CTCGAGCCCG ATATGTGCCA GGTCCTGCGC GTGGTGCTCG ATGCGCCGTA TGAGCCCGGG CAGCAGGTCA CGCTGGTGGC CGAGGTCGGC CCGGACAACT GGCAGCCCGA GCTGCGCCGG GACAATAACC TCGGCGAGAG CGAGCCCTTC ACCGTGAGCT ACTGA
|
Protein sequence | MAAASLGIAA ATSLSGCAAD DAPSERDGVL QRASVGFDLA VTAVEGPASV LPGGEVEVRV EVCNQGTEYA GGENVSVYLS EDAVIEASDH LLGGEPLASL VPGACTALNA RGPEPGLASS YTVGAIVESM YSSDVVPANN TLVGGTLVVG HEADLVVKSV TGPASVEPGM GFEIAVLVCN QGQSPANAEV EAVLSSDGII DAGDTVVGYG FAMNLLEPGT CDTVLVPAVA SEPDGVYTLG ARVDFNQFEP ELDENNNTAA GSSLSIGYGP DLIVKSVSGP NSAMSGGSIA LSAEVCNQGT DFASFTDVEF FLSSDASIDN TDYPAGSAPV SELTPGSCTT VVADGYVGVP QDGVYMVGAI VDGYDGVFEL RDDNNATAGA RVGVGYEPDL VIASIEVPAS AISGMDIDVS VEVCNWGQSE AWGVDVEVVL SGGGGAAEPL YYPLEPDDCH TLSISVPGAP DGVHTLTATV DISNSVSEIF EDNNTATSDL VAVGYEPDLV VSVDAPATAP LSGEALVEVE VCNRGQAPVH GVDIELYGST DATITRSDML AGFAHVSSLV PGRCTKRVAD ASFYAQPGGT LFAGAIVDPN ESVPELREDN NASAGDAIAF GDEADLIVSA IDAPATASSG ASLTAEVTVC NQGYSPAPAE VEVWTRGPYG DMFFGSSAGA SPYLEPGDCE RVFVSGSAPY DDGVHEFVAT VDYHNYEPEI LENNNTTIGG RFGVGYDADL FVAEVSAPLA GLPGDEFGVS ARICNQGQMP SSPSYASFVL SQDGQADPGD FLAGDVFVDM LEPGACVDVN AQLFDPGFGD RFEVFVVADW QEMVPEIFED NNATSGGVRV LGYLPELVIE AVSGPDSVMP GDTFEVTVRV CNIGTYGSYG TDVELYSSSP SPAGPGGPVG MAQVAPLPAG VCQNLRVEVW VDTYAEGAMT ILAELDPYDT VTELIEDNNS GESAPIGVGY DADFTIASVW TPSTLMAGER FTAEVEVCNV GQSGASSDVK LWFSPSSSPS NYDTRVPTAW LEPDMCQVLR VVLDAPYEPG QQVTLVAEVG PDNWQPELRR DNNLGESEPF TVSY
|
| |