Gene Hoch_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2017 
Symbol 
ID8544399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2781854 
End bp2786113 
Gene Length4260 bp 
Protein Length1419 aa 
Translation table11 
GC content73% 
IMG OID646386720 
Productamino acid adenylation domain protein 
Protein accessionYP_003266455 
Protein GI262195246 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0369158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.503018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCAT CTCTGCAAGC GCACGCGTCT GCCGCGAGCG CACAGCCTCA TGGCGCGGAC 
GCGCCGCGCG TCGAGTTGCC GCTGAGCGCG GCACAGCACG GCATCTGGCT GGGGCAGCAG
CTCGACCCCG ACAGTCCGGC GTACAACACG GCCGAGTGCA TCGAGATTCA CGGCCCCATC
GATCCCGTGC ATTTCGAGGC CGCGCTGCGC CAGGTGATGC ACGAGGCGCA GACCCTGCGC
ATGGCCGTGG TGGCGGCCGA CGGCTCGCGC CAGCGAGAGC GGGCGATGAG CGACTGGCGC
TTCGTGTACC GCGCCTACGA CGCTGGCGCG GACTGGCTGG CGACGCGTGC GGGCGACGCG
CTCGCGAGCG CTGGGGCCGC GACCGTCGAG ACCGCTGCTG CCGATGCGCC CGACGCCGCG
GCCCGCGCCT GGATGGACGC CGACATGCGC ACGCCGGTCG ATCTGGCGCG CGGTCCGCTG
TTTGCGCACG CGCTGTTCCA GTGCGCGGCG GATCGCTACC TGTGGTACTT CCGCGCCCAT
CACATCGCGC TCGACGGCCT CGGCTTCTCG CTGGTCGCGC GCCGCGTAGC CGCGGTGTAC
AGCGCCCGCA TGCAGGGCAC GGCAGCGCGC TCGCCCGCGG CCTTTGGCCC GCTCGAGCAG
GTGGTGCGCG AGGATCTAGC CTATCTCGGC TCGGACGATT ACCAGCGCGA TCGCGGGTTC
TGGATCGACC AGATGTCCGA TATCGACACG CCGGCGACCC TGGCGCGGCG GGCGGCGCCG
GTGTCGCGCA CGGTGGTGCG CGATGCTGCG CGGCTCGACC CGGCGTTGCA CGCGTCCCTG
CAGGCGGCCG CGCGCGGCGC GCGCACGACC TGGCCGGCGC TGATGCTGCT GGCCACCAGC
GTGTATCTCG GCCGCAGCAC GGGCGCCTCC GAGCTGGTGC TGGGGCTGCC GGTGATGGCT
CGTCTGGGCT CGGCCGCGAT GCGCGTGCCG TGCATGGCCA TGAACATCGT GCCGCTGCGA
CTGCCGGTGC CGGCCGAGGG CGAGTTCGCC GCGGGCCTGG CCCAGGTGAC CCAGCTCCTC
GCCCGCGTGC GCCCGCACCA GCGTTACCGC TACGAGCATC TGCGCCGCGA GCTCGGTCGT
GTCGGCGGAG ACCGGCGCTT GTTCGGACCG GTGGTCAATC TGATGCCTTT CGATCACCCG
CTGAGCTTCG CCGGTCATCC GGCGACCACG CACAACCTCT CGGCGGGTCC GGTGGAGGAT
CTGTCGATCG GCGTACGCGC ATGCGCGGCC GGAGGTCCTG GCGCAGATGG CGCCGAGCGC
GCGCCCACGC TGAAGCTCGA ACTCGACGGC AACCCGGCCT GCTACCGCGC CGACGAGCTG
GCCGCGCACC GGCGCGGCCT GTGCGACACG CTGGCCGAGA TCGCAGACAA CCTGAACGAT
CTCGACGGCC CGACGGCGCC GCGAGCAAGC GCGCGCCCTG TTCGCGCGGC GCACGCGGGC
GCGTTGATCG CGGGCGAGTG CGCGTTCGGC CCGGCGCGCT CGGTGTGTGC GCTGCTGCTC
GAGCGCGCCG CCCGCCACGG CGAAGACACG GCCGTGGTGT GCGGCGACAC GCAGCTCAGC
TACGCGCAGC TCGTCGCCGC GGCCAGCGCG CTGGCGCTCG AGCTGCGGCG CCGCGGCGCG
GGCCCCGAGA CCCTGGTCGC GGTGCTGCTG CCGCGCAGCG CTGAGGCCAT CGTCGCCATC
GTCGCCGTGC TGCTCGCGGG CGGCGCCTAT CTGGCGCTCG ATCCCGACGC GCCGCGCTCG
CGCAATCAGG CCATCGACGA ACACGCCGCG CCCGCGCTGG TGGTGACCGA TGAGACCGCC
GGCAACGCGC TCGGCGACGC CATCGGCACG CGCGCCGTCC GCGTGGACCA AATCCTGGGC
CGCCGCGATC GCGCGCTCGG CGTGACGCCC GCGCTGCCCA CCGAGTTCGC ACCCGATTCG
CTGGCCTATG TCATCTACAC CTCGGGCTCC ACCGGCGCCC CCAAAGGCGT GCAGGTCGAG
CACGACGCGC TCGCGCATTT CGTGGCCGGC GCCATGCAGC GCTACCGCGT GCGCCACCGC
GACCGGGTGT TGCAGTTCGC GCCCCTGCAC TTCGACGCCA GCATCGAGGA GATCTTCGTG
ACGCTGTGCG CGGGCGCTAC GCTGGTGCTG CGCGCCAGCG ATATGCTCGA CTCGGTGCCG
CGCTTCCTCG AGGCCTGCGC GGCGCAGGCG ATCACGGTGC TCGACTTGCC CACGGCCTAC
TGGCACGAGC TGGCGTACAG CATCTCGACC GGGGCGGCGA CGCTGCCGCC CTGCGTGCAC
ACGGTGATCA TCGGCGGCGA GGCCGCGCTG CCCGAACGCG TGGCCCGCTG GCGCAGTTCG
GTGGCCAGCC TGGTGGCGCT GCTCAACACC TACGGTCCCA GCGAGGCGAC CATCGTGGCC
ACGGTGGCGA TGCTCGCAGG CCCCGATCCG ATCGCGGTGG ACGGCGACGA GGTGCCCATC
GGTCTGCCGC TCGGCAACAC CGGCGCGGCC GTGCTCGACC GACGGGCGCA GCCGGTGCCG
CGCGGCGCGA TCGGCGAGCT GTACCTCACC GGGCCGAGTC TGGCGCGCGG CTATCTGGGA
CGCGACGACC TGAGCGAGAG CCGCTTCGTC ACCCTGCAGC ACCTGCCCGG GACGCCGCGC
GCCTACCGCA CCGGCGACCT GGTGCGCCAG CGCGCCGACG GCCAGCTGGT GTTTATCGGA
CGCGTGGACG ATGAGTGCAA GATCAGCGGC CACCGGGTGT CGCCAGCCGA GATCGAGACG
GTACTGGCGG CCGCGCCCGG CGTTCGCGAG GCGGCCGTGA TCGCGCGCGA CGAAGCCGGT
AGCAAGTACC TGGCCGCGCA CGTGGCCGCG GATGCGCCAG CGCCGACGCC GGCCGAGCTG
CGCCAACATC TGCGCGCCGC CCTGCCCCCG GCGCTGGTGC CAAGCGCCAT CCATTTGCAC
GAGCGGCTGC CGCGCAATCC CGCCGGCAAG ATCGACCGCG CGGCGCTCGC CCGCGCCCAG
GTCCAGGCAG CGGTGAGCGA TGCGGCTCCA GCGGCGCCGA TGTCACCCGG CGAGCGGCAG
GTGCTCGAGG TCTGGCACCA GGTGCTGGGC CTGAACGCGC TGCGACCCGA GGATGACTTC
TTTGCCCTCG GCGGCCAGTC GCTGCAGTCG ATTCAGGTCG CCAATCGCCT CGGCGTCGCC
CTGGGCCGCG ACGTGCCGGT GGCGCTGCTG TTTCGCTACC CCACGGCCGC GGGTCTGGCC
TGGGCGCTCG AGCACGAGCT CGCGCCCGCC GCGAGCGCGG GTCCGCGCGA TGGTCGCGCG
GCCAACCCCG GGCTGTCGCC GCTGCTGTCG ATTGCCGGAC AGTCGGACGA ATTCTCAGGG
CACGGCCAGG CGGACGAGAC CACGACGCGC ACGCCGCTGT TCTGCGTGCA CCCGGCGGCC
GGTCTGAGCT GGAGCTACCT GGCGCTCACC CGCCATCTGG ACAGCAGACG CGCGGTCTAC
GGGATACAGT CGCCCGCGCT GAGCGGCGAC GCGGCGACGC CGGAGGGATA CGCCTCGCTC
GCCGACCTGG CGTCCGATTA CGTGGCCCGC ATCCGGGCGG TGCAACCGAG CGGCCCCTAC
GCGCTGCTCG GCTGGTCCAT GGGCGGCGTG ATCGCGCACG CCATGGCCGC TCGGCTCCAA
GCGCAGGGCC ACGAGGTGGC GCTGCTGGCC CTGCTCGACG CCTACCCGCG CGAATCCTGG
CCCGAGCCGC GCGACAGCGC CGAGCGCGAG GCGATGCGCG CGCTGCTGCA CATCGCCGGG
CCGCGCGTGA CCGGCGACGA CGACGCGCCC GATGACACGA TCGGCGACGC GTCCGATGAC
GCGCCCGGCA CGCGCGAGCA GCTACGCGCC CGTCTCGCTG GCGCCGGCAG CGCCTTGCGC
GGACTGGGCG ACGACGCGCT CGACGCGCTC GTCGCGGTCG CCACCCGCAA CGTCGAGCTG
CTGCGCACGG CCCAGCATCG CCCCTTCCGC GGCGACGCGG TGCTCTTCAC CGCCCGCCAG
ACGCGCACCC AGGAAGGCTT CTCGGGCGCC GCCTGGAAGC CGCACATCGA CGGCTGCATC
GAGTCCATCG AGTTCGAGTG CAGCCACGCA ACGATTCTGC AGGATGACCG CGCCGAGTTT
ATCGGCCAAG CGGTCGAACG ACGACTCAAC GAACGCGATC ACAGGAGAGA CGCGAGATGA
 
Protein sequence
MHSSLQAHAS AASAQPHGAD APRVELPLSA AQHGIWLGQQ LDPDSPAYNT AECIEIHGPI 
DPVHFEAALR QVMHEAQTLR MAVVAADGSR QRERAMSDWR FVYRAYDAGA DWLATRAGDA
LASAGAATVE TAAADAPDAA ARAWMDADMR TPVDLARGPL FAHALFQCAA DRYLWYFRAH
HIALDGLGFS LVARRVAAVY SARMQGTAAR SPAAFGPLEQ VVREDLAYLG SDDYQRDRGF
WIDQMSDIDT PATLARRAAP VSRTVVRDAA RLDPALHASL QAAARGARTT WPALMLLATS
VYLGRSTGAS ELVLGLPVMA RLGSAAMRVP CMAMNIVPLR LPVPAEGEFA AGLAQVTQLL
ARVRPHQRYR YEHLRRELGR VGGDRRLFGP VVNLMPFDHP LSFAGHPATT HNLSAGPVED
LSIGVRACAA GGPGADGAER APTLKLELDG NPACYRADEL AAHRRGLCDT LAEIADNLND
LDGPTAPRAS ARPVRAAHAG ALIAGECAFG PARSVCALLL ERAARHGEDT AVVCGDTQLS
YAQLVAAASA LALELRRRGA GPETLVAVLL PRSAEAIVAI VAVLLAGGAY LALDPDAPRS
RNQAIDEHAA PALVVTDETA GNALGDAIGT RAVRVDQILG RRDRALGVTP ALPTEFAPDS
LAYVIYTSGS TGAPKGVQVE HDALAHFVAG AMQRYRVRHR DRVLQFAPLH FDASIEEIFV
TLCAGATLVL RASDMLDSVP RFLEACAAQA ITVLDLPTAY WHELAYSIST GAATLPPCVH
TVIIGGEAAL PERVARWRSS VASLVALLNT YGPSEATIVA TVAMLAGPDP IAVDGDEVPI
GLPLGNTGAA VLDRRAQPVP RGAIGELYLT GPSLARGYLG RDDLSESRFV TLQHLPGTPR
AYRTGDLVRQ RADGQLVFIG RVDDECKISG HRVSPAEIET VLAAAPGVRE AAVIARDEAG
SKYLAAHVAA DAPAPTPAEL RQHLRAALPP ALVPSAIHLH ERLPRNPAGK IDRAALARAQ
VQAAVSDAAP AAPMSPGERQ VLEVWHQVLG LNALRPEDDF FALGGQSLQS IQVANRLGVA
LGRDVPVALL FRYPTAAGLA WALEHELAPA ASAGPRDGRA ANPGLSPLLS IAGQSDEFSG
HGQADETTTR TPLFCVHPAA GLSWSYLALT RHLDSRRAVY GIQSPALSGD AATPEGYASL
ADLASDYVAR IRAVQPSGPY ALLGWSMGGV IAHAMAARLQ AQGHEVALLA LLDAYPRESW
PEPRDSAERE AMRALLHIAG PRVTGDDDAP DDTIGDASDD APGTREQLRA RLAGAGSALR
GLGDDALDAL VAVATRNVEL LRTAQHRPFR GDAVLFTARQ TRTQEGFSGA AWKPHIDGCI
ESIEFECSHA TILQDDRAEF IGQAVERRLN ERDHRRDAR