Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2017 |
Symbol | |
ID | 8544399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2781854 |
End bp | 2786113 |
Gene Length | 4260 bp |
Protein Length | 1419 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646386720 |
Product | amino acid adenylation domain protein |
Protein accession | YP_003266455 |
Protein GI | 262195246 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0369158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.503018 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTCAT CTCTGCAAGC GCACGCGTCT GCCGCGAGCG CACAGCCTCA TGGCGCGGAC GCGCCGCGCG TCGAGTTGCC GCTGAGCGCG GCACAGCACG GCATCTGGCT GGGGCAGCAG CTCGACCCCG ACAGTCCGGC GTACAACACG GCCGAGTGCA TCGAGATTCA CGGCCCCATC GATCCCGTGC ATTTCGAGGC CGCGCTGCGC CAGGTGATGC ACGAGGCGCA GACCCTGCGC ATGGCCGTGG TGGCGGCCGA CGGCTCGCGC CAGCGAGAGC GGGCGATGAG CGACTGGCGC TTCGTGTACC GCGCCTACGA CGCTGGCGCG GACTGGCTGG CGACGCGTGC GGGCGACGCG CTCGCGAGCG CTGGGGCCGC GACCGTCGAG ACCGCTGCTG CCGATGCGCC CGACGCCGCG GCCCGCGCCT GGATGGACGC CGACATGCGC ACGCCGGTCG ATCTGGCGCG CGGTCCGCTG TTTGCGCACG CGCTGTTCCA GTGCGCGGCG GATCGCTACC TGTGGTACTT CCGCGCCCAT CACATCGCGC TCGACGGCCT CGGCTTCTCG CTGGTCGCGC GCCGCGTAGC CGCGGTGTAC AGCGCCCGCA TGCAGGGCAC GGCAGCGCGC TCGCCCGCGG CCTTTGGCCC GCTCGAGCAG GTGGTGCGCG AGGATCTAGC CTATCTCGGC TCGGACGATT ACCAGCGCGA TCGCGGGTTC TGGATCGACC AGATGTCCGA TATCGACACG CCGGCGACCC TGGCGCGGCG GGCGGCGCCG GTGTCGCGCA CGGTGGTGCG CGATGCTGCG CGGCTCGACC CGGCGTTGCA CGCGTCCCTG CAGGCGGCCG CGCGCGGCGC GCGCACGACC TGGCCGGCGC TGATGCTGCT GGCCACCAGC GTGTATCTCG GCCGCAGCAC GGGCGCCTCC GAGCTGGTGC TGGGGCTGCC GGTGATGGCT CGTCTGGGCT CGGCCGCGAT GCGCGTGCCG TGCATGGCCA TGAACATCGT GCCGCTGCGA CTGCCGGTGC CGGCCGAGGG CGAGTTCGCC GCGGGCCTGG CCCAGGTGAC CCAGCTCCTC GCCCGCGTGC GCCCGCACCA GCGTTACCGC TACGAGCATC TGCGCCGCGA GCTCGGTCGT GTCGGCGGAG ACCGGCGCTT GTTCGGACCG GTGGTCAATC TGATGCCTTT CGATCACCCG CTGAGCTTCG CCGGTCATCC GGCGACCACG CACAACCTCT CGGCGGGTCC GGTGGAGGAT CTGTCGATCG GCGTACGCGC ATGCGCGGCC GGAGGTCCTG GCGCAGATGG CGCCGAGCGC GCGCCCACGC TGAAGCTCGA ACTCGACGGC AACCCGGCCT GCTACCGCGC CGACGAGCTG GCCGCGCACC GGCGCGGCCT GTGCGACACG CTGGCCGAGA TCGCAGACAA CCTGAACGAT CTCGACGGCC CGACGGCGCC GCGAGCAAGC GCGCGCCCTG TTCGCGCGGC GCACGCGGGC GCGTTGATCG CGGGCGAGTG CGCGTTCGGC CCGGCGCGCT CGGTGTGTGC GCTGCTGCTC GAGCGCGCCG CCCGCCACGG CGAAGACACG GCCGTGGTGT GCGGCGACAC GCAGCTCAGC TACGCGCAGC TCGTCGCCGC GGCCAGCGCG CTGGCGCTCG AGCTGCGGCG CCGCGGCGCG GGCCCCGAGA CCCTGGTCGC GGTGCTGCTG CCGCGCAGCG CTGAGGCCAT CGTCGCCATC GTCGCCGTGC TGCTCGCGGG CGGCGCCTAT CTGGCGCTCG ATCCCGACGC GCCGCGCTCG CGCAATCAGG CCATCGACGA ACACGCCGCG CCCGCGCTGG TGGTGACCGA TGAGACCGCC GGCAACGCGC TCGGCGACGC CATCGGCACG CGCGCCGTCC GCGTGGACCA AATCCTGGGC CGCCGCGATC GCGCGCTCGG CGTGACGCCC GCGCTGCCCA CCGAGTTCGC ACCCGATTCG CTGGCCTATG TCATCTACAC CTCGGGCTCC ACCGGCGCCC CCAAAGGCGT GCAGGTCGAG CACGACGCGC TCGCGCATTT CGTGGCCGGC GCCATGCAGC GCTACCGCGT GCGCCACCGC GACCGGGTGT TGCAGTTCGC GCCCCTGCAC TTCGACGCCA GCATCGAGGA GATCTTCGTG ACGCTGTGCG CGGGCGCTAC GCTGGTGCTG CGCGCCAGCG ATATGCTCGA CTCGGTGCCG CGCTTCCTCG AGGCCTGCGC GGCGCAGGCG ATCACGGTGC TCGACTTGCC CACGGCCTAC TGGCACGAGC TGGCGTACAG CATCTCGACC GGGGCGGCGA CGCTGCCGCC CTGCGTGCAC ACGGTGATCA TCGGCGGCGA GGCCGCGCTG CCCGAACGCG TGGCCCGCTG GCGCAGTTCG GTGGCCAGCC TGGTGGCGCT GCTCAACACC TACGGTCCCA GCGAGGCGAC CATCGTGGCC ACGGTGGCGA TGCTCGCAGG CCCCGATCCG ATCGCGGTGG ACGGCGACGA GGTGCCCATC GGTCTGCCGC TCGGCAACAC CGGCGCGGCC GTGCTCGACC GACGGGCGCA GCCGGTGCCG CGCGGCGCGA TCGGCGAGCT GTACCTCACC GGGCCGAGTC TGGCGCGCGG CTATCTGGGA CGCGACGACC TGAGCGAGAG CCGCTTCGTC ACCCTGCAGC ACCTGCCCGG GACGCCGCGC GCCTACCGCA CCGGCGACCT GGTGCGCCAG CGCGCCGACG GCCAGCTGGT GTTTATCGGA CGCGTGGACG ATGAGTGCAA GATCAGCGGC CACCGGGTGT CGCCAGCCGA GATCGAGACG GTACTGGCGG CCGCGCCCGG CGTTCGCGAG GCGGCCGTGA TCGCGCGCGA CGAAGCCGGT AGCAAGTACC TGGCCGCGCA CGTGGCCGCG GATGCGCCAG CGCCGACGCC GGCCGAGCTG CGCCAACATC TGCGCGCCGC CCTGCCCCCG GCGCTGGTGC CAAGCGCCAT CCATTTGCAC GAGCGGCTGC CGCGCAATCC CGCCGGCAAG ATCGACCGCG CGGCGCTCGC CCGCGCCCAG GTCCAGGCAG CGGTGAGCGA TGCGGCTCCA GCGGCGCCGA TGTCACCCGG CGAGCGGCAG GTGCTCGAGG TCTGGCACCA GGTGCTGGGC CTGAACGCGC TGCGACCCGA GGATGACTTC TTTGCCCTCG GCGGCCAGTC GCTGCAGTCG ATTCAGGTCG CCAATCGCCT CGGCGTCGCC CTGGGCCGCG ACGTGCCGGT GGCGCTGCTG TTTCGCTACC CCACGGCCGC GGGTCTGGCC TGGGCGCTCG AGCACGAGCT CGCGCCCGCC GCGAGCGCGG GTCCGCGCGA TGGTCGCGCG GCCAACCCCG GGCTGTCGCC GCTGCTGTCG ATTGCCGGAC AGTCGGACGA ATTCTCAGGG CACGGCCAGG CGGACGAGAC CACGACGCGC ACGCCGCTGT TCTGCGTGCA CCCGGCGGCC GGTCTGAGCT GGAGCTACCT GGCGCTCACC CGCCATCTGG ACAGCAGACG CGCGGTCTAC GGGATACAGT CGCCCGCGCT GAGCGGCGAC GCGGCGACGC CGGAGGGATA CGCCTCGCTC GCCGACCTGG CGTCCGATTA CGTGGCCCGC ATCCGGGCGG TGCAACCGAG CGGCCCCTAC GCGCTGCTCG GCTGGTCCAT GGGCGGCGTG ATCGCGCACG CCATGGCCGC TCGGCTCCAA GCGCAGGGCC ACGAGGTGGC GCTGCTGGCC CTGCTCGACG CCTACCCGCG CGAATCCTGG CCCGAGCCGC GCGACAGCGC CGAGCGCGAG GCGATGCGCG CGCTGCTGCA CATCGCCGGG CCGCGCGTGA CCGGCGACGA CGACGCGCCC GATGACACGA TCGGCGACGC GTCCGATGAC GCGCCCGGCA CGCGCGAGCA GCTACGCGCC CGTCTCGCTG GCGCCGGCAG CGCCTTGCGC GGACTGGGCG ACGACGCGCT CGACGCGCTC GTCGCGGTCG CCACCCGCAA CGTCGAGCTG CTGCGCACGG CCCAGCATCG CCCCTTCCGC GGCGACGCGG TGCTCTTCAC CGCCCGCCAG ACGCGCACCC AGGAAGGCTT CTCGGGCGCC GCCTGGAAGC CGCACATCGA CGGCTGCATC GAGTCCATCG AGTTCGAGTG CAGCCACGCA ACGATTCTGC AGGATGACCG CGCCGAGTTT ATCGGCCAAG CGGTCGAACG ACGACTCAAC GAACGCGATC ACAGGAGAGA CGCGAGATGA
|
Protein sequence | MHSSLQAHAS AASAQPHGAD APRVELPLSA AQHGIWLGQQ LDPDSPAYNT AECIEIHGPI DPVHFEAALR QVMHEAQTLR MAVVAADGSR QRERAMSDWR FVYRAYDAGA DWLATRAGDA LASAGAATVE TAAADAPDAA ARAWMDADMR TPVDLARGPL FAHALFQCAA DRYLWYFRAH HIALDGLGFS LVARRVAAVY SARMQGTAAR SPAAFGPLEQ VVREDLAYLG SDDYQRDRGF WIDQMSDIDT PATLARRAAP VSRTVVRDAA RLDPALHASL QAAARGARTT WPALMLLATS VYLGRSTGAS ELVLGLPVMA RLGSAAMRVP CMAMNIVPLR LPVPAEGEFA AGLAQVTQLL ARVRPHQRYR YEHLRRELGR VGGDRRLFGP VVNLMPFDHP LSFAGHPATT HNLSAGPVED LSIGVRACAA GGPGADGAER APTLKLELDG NPACYRADEL AAHRRGLCDT LAEIADNLND LDGPTAPRAS ARPVRAAHAG ALIAGECAFG PARSVCALLL ERAARHGEDT AVVCGDTQLS YAQLVAAASA LALELRRRGA GPETLVAVLL PRSAEAIVAI VAVLLAGGAY LALDPDAPRS RNQAIDEHAA PALVVTDETA GNALGDAIGT RAVRVDQILG RRDRALGVTP ALPTEFAPDS LAYVIYTSGS TGAPKGVQVE HDALAHFVAG AMQRYRVRHR DRVLQFAPLH FDASIEEIFV TLCAGATLVL RASDMLDSVP RFLEACAAQA ITVLDLPTAY WHELAYSIST GAATLPPCVH TVIIGGEAAL PERVARWRSS VASLVALLNT YGPSEATIVA TVAMLAGPDP IAVDGDEVPI GLPLGNTGAA VLDRRAQPVP RGAIGELYLT GPSLARGYLG RDDLSESRFV TLQHLPGTPR AYRTGDLVRQ RADGQLVFIG RVDDECKISG HRVSPAEIET VLAAAPGVRE AAVIARDEAG SKYLAAHVAA DAPAPTPAEL RQHLRAALPP ALVPSAIHLH ERLPRNPAGK IDRAALARAQ VQAAVSDAAP AAPMSPGERQ VLEVWHQVLG LNALRPEDDF FALGGQSLQS IQVANRLGVA LGRDVPVALL FRYPTAAGLA WALEHELAPA ASAGPRDGRA ANPGLSPLLS IAGQSDEFSG HGQADETTTR TPLFCVHPAA GLSWSYLALT RHLDSRRAVY GIQSPALSGD AATPEGYASL ADLASDYVAR IRAVQPSGPY ALLGWSMGGV IAHAMAARLQ AQGHEVALLA LLDAYPRESW PEPRDSAERE AMRALLHIAG PRVTGDDDAP DDTIGDASDD APGTREQLRA RLAGAGSALR GLGDDALDAL VAVATRNVEL LRTAQHRPFR GDAVLFTARQ TRTQEGFSGA AWKPHIDGCI ESIEFECSHA TILQDDRAEF IGQAVERRLN ERDHRRDAR
|
| |