Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2106 |
Symbol | |
ID | 5733994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2633481 |
End bp | 2639264 |
Gene Length | 5784 bp |
Protein Length | 1927 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279247 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544874 |
Protein GI | 159898627 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCAAC ATGAAGTTGA GCATTTTCGG CTATCGCCGC AACAAACCCA TACATGGTTG GTTCAACCAC AAAGCCAGCA ACCCTTGGGC ACATGGCTGC TGGTTGAGTT AACCACTCCG CTTCGTTATG AACGTTGGCA AGCTGGCTTG AATGTTGTGA TCGAGCGCCA TGAGGCGTTG CGTACCCGTT TCGAGCAGAT CGCTGGGCTA AAGCTACCAG CCCAAGTGCT GCACAACCAA ACAGTGGTCT TGCATCAGCA GCAAATAGCC GAATCGCATC AGATTGCTGA GCTTGCAGCG CCAGCTGATG CTGGTTTGAT GCAGATTACG TTATTTGAGC ATGGCCAACA GCAATGGCTT GGGTTGTGGC TAGCGGCCTT GGTTGGCGAT GCCACTAGTG CCCGATTGCT GCTCGAAGAA TTGACCCAAG CCGCGTTGGC TCCGCATGAA TTGAGCGCAA GCGATGAGCT GATGCAGTAT ATTGACGCTG CTGAATGGCA AAATGGCTTG CTCGAAGCAG CCGAAAGCGC CGCTGAACGG GCATTTTGGC AAACGCAGGC GATTAAGCAA GCACCGCATG ATCTGCGGGG CTTTGCACGC TTGACCCAAA CCCAGCCCAC TCGGATCAAG CTTAACCTTC CTGCAAGCAG CAGTATCGCG ATTAATGCTT GGTTTACTCA ACATAATGTT GATTTAGCGA GTACGGTGCT CAGCCTTTGG CGTTGGTTGC TCAGCCGCAG CAATTATGGG CAAACGCCAG CGTTGGCGCT GGCCTGCGAT GGCCGTTCGT ATGCAGAGTT GGCCAATGCC CAAGGCTTGT TTGAGCGCTA TCTGCCATTG TTACCAAACG AACTTGCTGC CGATCAGCCA ATCGCCGAGG CTATAACCAC GCTAGCCCAA CAGCTAGCCG ATTTAGCGCA GTTCCAAGAG TATTTCAGTT GGCAGCAACT TGCGTTGGAT CAGCCGTTAG CGTTGGCATT TGCCCATTAT CGTTGGGAAA CAGCGGCGCA CTATCAGCTT GAACACCTGA CCAGCCATAC TGATCTGTTT CGCTGTAAAT TAAGCCTGAT CGAGCAAGCT ACAAGCTGGC AATTGACCTT AGATTACGAT GCAACTAGCA TGCGCTCTGA GGTTGCCGAG GCCTTGGCTG AGAGCTTAAT CACAATGCTG GTTTGGCTTG GGCAACAATC CAACCCGACC TTTGGGCAAC TGCCAATCAT TGGGAGCAAT ACCCAAACAT TATTGACTAA GCAGGTCAAT GCAACCGATC GGCCATTTGC TGCAACGCCA ATTCACGATC TGATTGATCA GCAGGCACTA CACAATCCGC AAGCAATTGC TGTGCAATTT GGTGCAGAGC AACTGAGTTA TGCCGAGTTG GCTCAGCAAG CCAACCAACT GGCCCAACAA TTAATCCAAC ACGGTATTCA ACCCGAGCAG CGGGTTGGCT TGTATCTTGA GCGCTCGCCG CTGATGGTCG TGGCCTTGTT GGCGTGTCTC AAGGCTGGCG CGGCCTATGT GCCCTTAGAG CCAGAGTATC CCGCCGAGCG GATTCAGTAT ATTCTTGCTG ATGCGGCGAT TCAGTTGGTG TTGAGCCAAA CCAGCCTCAT GCCTAGTTTG CCGTGTAGCG TTGCCCAATT GGCGGTCGAT CAGTTGCAAT TTGATCAAGC GAGTGCCGCG CCACGTTTGA ACTATCAGCC TGCGCAATTG GCCTATCTGC TGTATACCTC TGGCTCGACC GGCCAGCCCA AGGGTGTGAT GGTCAGCCAC GCTGGTTTGA GCAACTATGT GCAATGGGCG ATCACGGCCT ACGATTTGGC GGCTGGTACA GGTTCGTTGG TGCATTCGCC ATTAGCCTTC GATTTGACCG TAACCAGTTT GCTTGTGCCC TTGTGTGCTG GCCAAACCGT GCGTTTATTG CCAAGCAATG CTGGGGTTGA AACGCTAGCC CAAGCACTGC GAGCCAGCAC TGATCTGAGT TTGCTCAAAC TGACACCAGC GCATTTGGCG GTGCTGAATC AATTGATCAC TAGTGCTGAT TTGGCTCAAC GCAGTAGGGC CTTGGTGATT GGCGGTGAGG CGCTTGATGC AACTACGTTG GCTCCATGGC GCACCCACGC TCCTGAAACC CGCTTGTTCA ACGAATATGG CCCAACTGAA ACAGTGGTTG GCTGTTCGAT CTACCAAACC CAAACCACTG ATTCGGCTGC TGGCGCGGTT TCGATTGGTT TGCCAATTGC CAATATGCGT TTGTATGTGC TTGATGAGCG CTTGCAACCT GTGCCATTTG GGGTTGTTGG TGAGCTGTAT ATTGGTGGAG TTGGGGTTGC CCGCGGTTAT AATCAGCGCC CTGATCTGAC CGCTGCCCAG TTTGTACCTG ATAACCTGAG TGGAATCGCT GGCGCACGGC TGTATCGCAC TGGCGATTTG GCGTGTTGGG CCTGGGATGG AACGCTGGAA TATCTTGGGC GGCGTGATAC GCAAATCAAA TTGCGTGGCT ATCGAATTGA GCTGGGCGAG ATTGAGGCAG TGCTGCAACG CTTGCCAATG GTCGCTTCAG CACTGGTCTT GCTGCGTGGC ACAGGCGACG ATCAACGCTT GGTCGCCTAT CTCCAAGCCA CACCCGATGC CGACTCCACG CAATTGAGTG AACAAGTGGT GTTGAAATAT GCCCAACAAT TCCTGCCACA GTACATGTTA CCAAGCAACG TTGTGTTGGT TGAGCAATGG CCGTTGACCG CGAATGGCAA AATTGATCGG GCGGCCTTGC CCGAACCAAC CGCTATAAAC AATTATGTTG CCCCAACGAC CCCTGAAGAA GAAATTTTGG CAGCCATTTG GGAACAGGTG CTTGAGCACC CAATGATTGG GATTGATGAT AATTTTTTTG CATTAGGCGG CAATTCAATT CGCAGCATTC AGGTGGTGGC CCAAGCCAAA CAGCGCGGCT TAAATCTGAG TGTTGAAATG CTGTTCAATC AGCCGACGAT TCGTAGTTTG GTTCAAACTA TGGTCTGCTC TACAGAAAAT CAAATAATCG AATACACACC CTTCAGTTTG ATTAGCCCTG CTGATCATGC CTTGCTCCCA AATACTATTG TTGATGCCTT CCCGATTGCC AAGTTGCAGG GTGGCATGAT TTTCCACAAC CAATTCAACC CTGAACAAGC GCTGTACCAC GATATTTTTA GCTATCGGAT GCGGGTCGTG CTCGATTTGG CGTTGTTGCA ACTGATCGTC GATGATTTAG TGGTGCGGCA TCCAGCGCTA CGCACTAGTT TTGATCTGAC CAGCGCCAGC GAGCCGTTGC AAGTGGTGCA TGCCCAAGGC GCAAACCTGT TGAATATTAT CGATCTGCGC AACCAGCCTG TTGAACAGCA CGATCAATTA ATTGAAGCTT GGATCGCCGC CGAAAAGCAG CGCGGTTTTG AGCCAAGTAG CCTGCCGTTA TTGCGGTTCC AAGTGCATGT GCGGGCTGAT GATGAATTGC AATTTTCGCT GAGCTTTCAC CATGCGGTGA TCGATGGCTG GAGCGATGCG ATAATGCTGA CTGAGCTGTT TAGCGATTAT GCGCGGCGCT TGCAAGGCCA AACCAGTAGC CTTGTTGCGC CTCAAATTGG CTATCACGAA TTTGTACGGC TAGAACAAGC AGCAATTCAG AATCCTGCGA CCCAGCAATT TTGGGCTGAC CATTTGGCCC AAGCCAGCCC GATGCGCTTG CCGCGCTGGC CGAATGTGCC GCGTTCAAAC ACCAGCCAAT CACAACCAGT TGCAATTAGT GCTGAGCTTT CGCAAGCACT TAAAGCCTTG GCTCGCCAGC TTGCCGTGCC AATTAAAGAT GTGCTGTTGG CAGCGCATTT ACGGGTGATT AGCATCCTGA CTGGTCAGTT CGATGTAGTG ACCAGCATGG TTTCGAGTGG GCGGCCTGAA ACCCTTGATG GTGAACGGGT TTTGGGCTTG TTTATCAATA GTATTCCTCT ACGAATGCAG CTGAATCAGC CAACGTGGCG TGAACTTATT ATGCAGACCT TTGCTGCTGA ACGTGCCAGT CTGGAGCATC GGCGCTACCC AACTGCCGAG TTGCAACGCC ACAACGGCGG TTTGGCTTGG TCGGAGAGTT TGTTCTACTT CACCCACTAC CATATCTTCC AAGCCTTGCA AAACATCAGT GAGCTGGAGT TGCTTGATGT GCTGCCCTAC GAAGTTTCGA GTTTTCCATT AGTTGCCAAC TTCCGCATCG ATCCCTTTAC GAATGACATT AACTTGAGTT TGACCTGTGA TGGGCGAATT TTGACCAATG CCCAAATCGA AGCGATTGCA GGCTATTATC AAGTCTGTCT GACCGCGATG GTTGCCGACC CTGCGGCAGA TTATCGCGCT ATGCCATTGT TGAGTGATAC TGAGCAACAC CTATTGCTTG GATTTAATCG CACCGAAGTT GCACAATCGT CGCCTGATCT TGTTGGTTGG CTGGCCGAAG TGGCTCAACA GCAGCCAACT GCCCAAGCCA TCCAAGCCTA TGATGGGGCG TTGAGCTATG CTGAGCTTGA GCAACGCGCA ACGGCTTTGG CGGGCTATTT ACAAACGCAG GGGATTGGTG CAGAAACCCG GGTTGGTATC AGCCTTGAGC ATTCAACCAG CTTGATTGTG GCGATTTTGG CGGTGCTCAA AACAGGGGCT GCTTATGTGC CACTTGACCC CAACTACCCA CGTGAGCGGC TTGAATTGAT GGCGAGCGAT GCTGAATTGA AGCTCTTGAT TTGCCAACAG CCAGACATCT GGCAAAACCT ACCTGCAAAC TCTGCCTGTT TAGGCCTTGC TGATTTAGAT TCTGCCCAAG CGCCATTTGT GCCAGTCACG ATTCATCCGG CGCAGGCCGC CTATCTGATC TATACCTCTG GTTCGACAGG CCGCCCCAAG GGTGTGGTGG TCAGTCATGC CAATCTGCAT AGCTCCACGT TTGCCCGAAC GCTTGCCTAT CGCGAGCCGC TGACGAGCTT TTTATTGCTT TCATCGTATG CCTTCGATAG CTCGATCGCT GGAATTTTCT GGACACTGAG CCAAGCTGGC TGTTTGGTAC TGCCCGATCA AGCGCAACGC CACGATGTTC TAGCGCTAGC CAGCATGGTC GAACATCATC AGATTAGCCA TACCTTGGCA ATTCCGTCGT TGTACGCGGT ATTGTTGGAA CAAGCCGAAT TAAGCCAATT AGCTAGTTTG CGCGTGGTCG TGGTCGCGGG CGAGGCCTGT ACCACCAGCT TGGTCAATCG CCATTATCAA CAACTGTCAA CGTGTGCCCT ATACAACGAA TATGGCCCAA CCGAGGCGAC GGTTTGGGCA AGCGTTGCCA AACTAGTACC GCAACAACCG ATCTCAATTG GCGGCCCGAT TGCCACGATC CAAGCCTATG TAGTTGATCC AAGCTTGCAG CCTGTGCCAA TTGGAGTTGC TGGCGAATTG TTGATTGCTG GTGCGGGTAT TAGTCGCGGC TATTGGCAAC AACCAGCGCT GACCGCCGAG CGGTTTATGC CCGACCCATG GGCCGAACAG CCAGGCCAGC GCTTGTATCG CACTGGCGAT TTAGCCCGTT GGTTGCCCGA TGGTCAGCTT GAATTCTTAG GTCGCATCGA TCAACAGGTC AAAATTCGCG GTTTTCGGAT TGAGCTTGAA GAAATTGCCC AACTGCTGCG CCAACACCCC GCCTTACGCG AGGCTGTGGT TACCGCTCAG CCCGATCAGC ATGGTCAATT ACGCTTGGTG GCCTATATCG AGCCACGCAA TTAA
|
Protein sequence | MQQHEVEHFR LSPQQTHTWL VQPQSQQPLG TWLLVELTTP LRYERWQAGL NVVIERHEAL RTRFEQIAGL KLPAQVLHNQ TVVLHQQQIA ESHQIAELAA PADAGLMQIT LFEHGQQQWL GLWLAALVGD ATSARLLLEE LTQAALAPHE LSASDELMQY IDAAEWQNGL LEAAESAAER AFWQTQAIKQ APHDLRGFAR LTQTQPTRIK LNLPASSSIA INAWFTQHNV DLASTVLSLW RWLLSRSNYG QTPALALACD GRSYAELANA QGLFERYLPL LPNELAADQP IAEAITTLAQ QLADLAQFQE YFSWQQLALD QPLALAFAHY RWETAAHYQL EHLTSHTDLF RCKLSLIEQA TSWQLTLDYD ATSMRSEVAE ALAESLITML VWLGQQSNPT FGQLPIIGSN TQTLLTKQVN ATDRPFAATP IHDLIDQQAL HNPQAIAVQF GAEQLSYAEL AQQANQLAQQ LIQHGIQPEQ RVGLYLERSP LMVVALLACL KAGAAYVPLE PEYPAERIQY ILADAAIQLV LSQTSLMPSL PCSVAQLAVD QLQFDQASAA PRLNYQPAQL AYLLYTSGST GQPKGVMVSH AGLSNYVQWA ITAYDLAAGT GSLVHSPLAF DLTVTSLLVP LCAGQTVRLL PSNAGVETLA QALRASTDLS LLKLTPAHLA VLNQLITSAD LAQRSRALVI GGEALDATTL APWRTHAPET RLFNEYGPTE TVVGCSIYQT QTTDSAAGAV SIGLPIANMR LYVLDERLQP VPFGVVGELY IGGVGVARGY NQRPDLTAAQ FVPDNLSGIA GARLYRTGDL ACWAWDGTLE YLGRRDTQIK LRGYRIELGE IEAVLQRLPM VASALVLLRG TGDDQRLVAY LQATPDADST QLSEQVVLKY AQQFLPQYML PSNVVLVEQW PLTANGKIDR AALPEPTAIN NYVAPTTPEE EILAAIWEQV LEHPMIGIDD NFFALGGNSI RSIQVVAQAK QRGLNLSVEM LFNQPTIRSL VQTMVCSTEN QIIEYTPFSL ISPADHALLP NTIVDAFPIA KLQGGMIFHN QFNPEQALYH DIFSYRMRVV LDLALLQLIV DDLVVRHPAL RTSFDLTSAS EPLQVVHAQG ANLLNIIDLR NQPVEQHDQL IEAWIAAEKQ RGFEPSSLPL LRFQVHVRAD DELQFSLSFH HAVIDGWSDA IMLTELFSDY ARRLQGQTSS LVAPQIGYHE FVRLEQAAIQ NPATQQFWAD HLAQASPMRL PRWPNVPRSN TSQSQPVAIS AELSQALKAL ARQLAVPIKD VLLAAHLRVI SILTGQFDVV TSMVSSGRPE TLDGERVLGL FINSIPLRMQ LNQPTWRELI MQTFAAERAS LEHRRYPTAE LQRHNGGLAW SESLFYFTHY HIFQALQNIS ELELLDVLPY EVSSFPLVAN FRIDPFTNDI NLSLTCDGRI LTNAQIEAIA GYYQVCLTAM VADPAADYRA MPLLSDTEQH LLLGFNRTEV AQSSPDLVGW LAEVAQQQPT AQAIQAYDGA LSYAELEQRA TALAGYLQTQ GIGAETRVGI SLEHSTSLIV AILAVLKTGA AYVPLDPNYP RERLELMASD AELKLLICQQ PDIWQNLPAN SACLGLADLD SAQAPFVPVT IHPAQAAYLI YTSGSTGRPK GVVVSHANLH SSTFARTLAY REPLTSFLLL SSYAFDSSIA GIFWTLSQAG CLVLPDQAQR HDVLALASMV EHHQISHTLA IPSLYAVLLE QAELSQLASL RVVVVAGEAC TTSLVNRHYQ QLSTCALYNE YGPTEATVWA SVAKLVPQQP ISIGGPIATI QAYVVDPSLQ PVPIGVAGEL LIAGAGISRG YWQQPALTAE RFMPDPWAEQ PGQRLYRTGD LARWLPDGQL EFLGRIDQQV KIRGFRIELE EIAQLLRQHP ALREAVVTAQ PDQHGQLRLV AYIEPRN
|
| |