Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3129 |
Symbol | |
ID | 5735001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3950482 |
End bp | 3955932 |
Gene Length | 5451 bp |
Protein Length | 1816 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280272 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001545894 |
Protein GI | 159899647 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3208] Predicted thioesterase involved in non-ribosomal peptide biosynthesis |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.975665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAG TCGCACGCCA GCTTGAAGAT CTTTCGCCCG AACGCCGCGC CTTGTTGGCC CAACGTCTGC GCCAACGCCA AGCCGCCAAG CCAGTGCCCA GCATTCCAGC GCTGGCTCGG ATCGGCGAAC ATCCAGCCTT TGAACTTTCG TTTGCCCAAC AACGGCTGTG GTTTTTAAGC CAGTGGCAAC CAGAAAGCGC CGCCTATCAC ATTCCAGCCA CATTTAGCAT TGCTGGTGAA ATTAATGTTT CTGTATTGCA AACTTGTCTT GATAAAATTA TTCAGCGCCA TGAAGTGCTG CGTAGCACGA TCGAAGTGCT CAACGACCAG CCAATGCAGG TTATTCAGCC ATTCCGCAGC CTTGATTTGG AGCTTGTTGA TCTCCGTGGG CTGAGCAATG AGCAACAAGC CAGTCAACGC CAGCAGCAGA TCGAACGCCA CAGTCAGCAG CCGTTTGATC TCAGCAAAGA TTTAATGCTA CGCGGATTGT TGCTGCACAC GGCTGCTGAT CACGCTGAAT TGGTATTGAC GATTCACCAT ATTGCCTGCG ATGGTTGGTC GATTGGGGTC TTGTTGCGTG AGCTAGGCCA ACTTTACGAA GCTGGGCTAC GCGGCGAGCA ACTTGAGCTA CCAGCGCTGC CAATTCAATA TGCCGATTTT GCGGTTTGGC ATAAACGTTA TGTGCTTGAG CAAGTGTATC AGCAACATTT AAATTTTTGG CAAGAACAAT TAAGCGGCAC GCTACCCTTG CTCAATTTGC CAACCGACCA TGCGCGACCA GCGGTCAAAC GCGACCTTGG GGCAACCGTC GAATATCATC TACCTTGGAG TTTGGTCGAG GCGGTTGAGC GCTTGAGCCG CCAAGAGCGA GCCACGCCGT TTATGCTGTT TATGGCGGCT TTTCAGGTTT TGCTCTATCG CTATTCAGGC CAAAGCGATC TGATTATTGG TACGCCGATT GCCAATCGCA CCCGCCGCGA AATCGAAAAT CTGATTGGAT TTTTCGTCAA TACCTTGCCA ATTCGGGTCA ATTTGGCTGG CTCGCCTAGC TTTCGTAGCC TAATTGCCCA AGTGCGCCAA ACCTCGTTGG CCGCTTTCGA GCATCAAGAT ATGCCGCTCG AACACTTGAT TGATGTGCTC AAAGTCGAAC GCAGTTTGAG CCATAACCCA CTGTTTCAAG CCCTGTTTGT GCATCAAACC ACCAGCATCC AAACTGTCGA TTCAGGTGAA TTTGGCTTGC AATTTGGCGG GGCAATTGAA ACTGGCAGCG CTAAATTTGA TATTAATCTG AATTTGGCCG CCAATCGTGC CACTTTTGAA TACAATACCG ATCTGTTTGA GCGCAGCACG ATCGAACGCA TGGCCAGCCA TTTCCATAGC TTGTTGGAAT ATGCGGTGAC CAATCCCGAT GCCAGCATCG AGCATTTGCC TTTGCTGAGC AGCAGCGAAC GCCAACAATT GCTGCAAACC TGGAACAGCA CCAGCGCCAA TTATCCTGCG GTTGATTCGA TTGTGCGTTT GTTCGAAGCT CAAGCGGCGC GAGTCCCAGA GCGCACGGCC TTGCATTTTG AAGGCCAAAC CCTGAGCTAC GCCGAATTGA ATCAACGCGC CAACCAACTA GCACATAGCT TGCGCCAACG CGGCATCGGC TGCGATATGC GAGTAGGCTT GTTTATCGAT CGCTCGTTGG ATTTATTGGT TGGCGCGTTG GGAATTCTCA AAGCGGGCGC GGCCTATGTG CCAATCGACC CAATTTACCC CCAAGATCGC ATCAGCGCTA TGCTCGAAGA TGGTGCAGTG AGTTTGCTGC TTACCCACGC TGAGCTAGCC GCCGAATTGC CAAAACTTGA TCTTGAGGTG CTGTGCCTCG ACCAAGCATG GCCGACGATT GCCCAAGCGC CAACGCACAA CTTGAATTTG GCGCTTGAGC CGCGCAGTTT GATGTATGTG CTGTTTACCT CTGGCTCGAC TGGCCGCCCC AAAGGTGTGG CAATCGAACA CCATAATTAT GTCAACTATA TTCAAGGCTT ATTGCAACGA ATTGAAGCCG AAGATGGCTG GAGCTATGCC TTGGTTTCGA CCTTCGCGGC AGATCTTGGC ACAACCAATG TGTATGGAGC CTTGTGCAGC GGCGGCGAAT TACATATCGT GGCCTATGAA CGCGCCACTG ATCCCGAAGC GTTTGCAGCC TACTTTCGCC AGCATCGCAT CGATGTGATG AAACTTGTAC CCAGCCATTT CGAGGCCATG CGCGGCTTGA ACAACTTGGC CGATGTCATT CCCAAGCAGC GCTTGATTTT GGCGGGCGAA GCCAGCCTTT GGGAGCAGCT TAGCGATATT CGCCAACTTC AGCCTAGCGT GCAACTGCAA AACCACTATG GACCAACCGA AACCACGGTT TCCATGCTGA CCTATCCAAT TCCCAGCCAA CCACACTACC CCAGCAGCAC CGTGCCACTT GGCAGGCCAT TGGGCAATGT GCAAATTTAT GTGCTCGATC GCCGAATGCA GCCAACGCCG CAAGGTGTAC CAGGCGAACT CTACGTTGGC GGCGCTGGTG TGGGGCGTGG CTACATCGGG CGGCCCGATC TGACCGCCGA GCGTTTTGTG CCCAATCCAT TTAGTACCGA AGCTGGAGCG CGGCTCTATC GCAGCGGCGA TTTGGTGCGC TATCAGCCTG ATGGCGCGAT TGAATTTTTG GGTCGGATCG ATCTACAAGT CAAAATTCGT GGCTATCGGG TTGAGTTAAG CGAGATCGAA ACCGCGATTC AAGCTCAGGC ACAGGTTGCC AATAGTGTAG TGATTTTGCG CGAAGACACG CCAGGTGATA AGCGCTTGGT CGCCTATATC GTGCCAGAAG CAGGCCAAAG CCTGAATATT GGCAGCATTC GCGAGGCCTT GCGCAATAGT TTGCCCGATT ATATGGTGCC AACCGCCTTT GTTGAATTAG ATGGATTGCC GTTGAACCCC AATGGCAAAA TCGAACGCCG CGCCTTGCCC GCCCCCAGCA ACGAGCGCAA TCTCGATAGT TACGTTGCGC CACAAACCGC CACTGAACAT GAATTGGCAG GAATTTGGGC CGAGGTTTTG GGGCTTGATC AGGTTGGCAT CGACGATAAT TTCTTCGATT TAGGCGGCGA ATCATTTAAA GCGATTCGGG TGGTGCGCAA AATTGGCAGC CATATCAGCG TTATGACGCT GTTCAAATAT CCGACAGTGC GTGAATTGGC CGCTCATCTC TCGGGTGCAA GCAGCGCTGA GAGCGGCGGC ATGCTCTATG AATTGAGCAA AGCACAGGCT AAACAGCACA CCACGATTGT GGCAATTCCC TATGGTGGCG GCAGCGCGAT CACCTATCAA CCATTGGCCC AAGCCATGCC CAAGGGCTAT CGGTTATTGG CCGCCGAGCT ACCTGGCCAC GATTTCAGCC GCCCCGACGA GCCATTGCAA GCCTTAGAAG TCGTGGCGAG TCAGCTTGCC AGCGAAATTC AAACCAAAAC CCAAGGGCCA ATTGTGTTGT ATGGCCATTG TGTGGGCAGC GCCATGACCG TGGAAATTGG CCGTTTGTTG GAGCAAGCAG GCCGCGACGT TCAGGGGATT GTGCTCGGTG GCAATTTTCC GGCGGCTCGC GTGCCAGGCC GCTTCTTTGA ATGGCTCAAC AAACTTATGC CCGCCGATCG TTGGATGTCG GATCGGACAT ATCGCGATTT TCTCCGCGCG TTGGGTGGCT TCACCGAAAT TGTCGATCAA GCTGAACAAA CCTTTGTGAT GCGCAGTTTG CGCCATGATG CGCGTGAAGT TGAGCGCTAT TTCACCCAAG CCTTTGCCCA AAAACAGTCG CAACAACTCA AAGCTCCAAT CGCCTGTATC ATCGGCGAGA TGGATCGAGC GACCGAATAT TACCAAGAGC GCTACCGCGA ATGGGAATAT TTCAGCAACA ATGTGACGCT GCATGTAATT CCCCACGCTG GCCATTATTT TCTCAAACAT CAGGCCAGCG AATTGGGCCA AATTATCGAG CAACAAACCG AGCAATGGCA ACAACCACGG CCAATTCAGC CAACTGCCGC CAAATCAAAA TCGCATAAAA CCAGCATGCC CAGCCTACGA ATTTTCTTTA TGGTTGCGTT AGGCCAATTG GTTTCGATGC TTGGCTCAAG TTTATCAAGT TTCGCCTTGG GCATTTGGAT CTATCAACGA ACTGGCACGG TCAGCGATTT TGCCTTTACC GCGATTGCCT CGATGCTGCC AAGTTTGCTG GTTTCGCCAT TGGCTGGAGC AATTGCCGAC CGCTGGGATC GGCGCTGGAT TATGATTATT GCTGATACAA TTTCGGCGCT TTCGACAATT GTGATTGCAA TGTTGCTGTG GGCCAATAAA CTTGAGGTTT GGCATATTTA CCTCACGGCG GCAATTAGCT CGATTGCCGG AACCTTCCAG CGCCCAGCGT ATGCCGCAGC CATGACCCAG CTTGTGCCCA AACAATATCT GGGCCATGCC AACGGCGTGA TTCAGCTTGG CTCGGCTACG GGCGGACTGA TTGCACCGTT TATTGCTGGC GGCATGGTGG CCTTCTTTGG CTTGGGCGGG GTCTTTTTGC TCGACTTCAT CTCGTTCAGC TTGGGAATTG GAGTATTGTT CTTGGTGCGC TTCCCCAACA CGCTCTATCA CAAGCGCGAG GAGCCATTGC TGCGCGAAAT TGTGCGCGGC TGGGAATATA TCATCAAGCG CCCAAGTTTG GTGGCGATGG TGCTGTTTTT CGCCTTGGGC AACATCTGGT TTGGCATCGC CAGTATCAGC ATGAGTCCGT TGGTGCTTTC GTTTGGCGGG CCTGCCGAAT TAGGCATTGT CAGCGCAGCT TGTGCTTTGG GCGGCTTCCT CGGCGGCTTA TTTATGAGCT TATGGGGCGG TTTGCAGCGT CGTGCCGAAG GCATGGTTGG CTTCGTCATT CTCGAAGGCT TTTTCATTGC CTTGGCCGGG TTGCGGCCTT CGGTATGGTT GGTAGCCTTA GCGATGTTTG GAATGTGGTT TGCGATTTCG CTGGTCAACG CCCACTGGCA AGTGCTCATT CAAACCAAAG TTGGCCTGGA GCTACAAGGG CGGGTGCAAG CAACCAACCA AATGCTGGCC ATGCTCAGCA TCCCGCTGGG CTATTGGCTG GCTGGGCCAT TGGCTGATAA CTTGTTTGGC CCTTTGCTCG AACCAAACGG CGCACTCAGC AGCAGTTTGG GTTGGCTCTT CGGGGTCGGG CCTGATCGCG GGATTGGCCT GTTGATGGTG GTGGTTGGCT TAGGTGCAGC AATTTGGGCC TTGATTGGCT TCAACTATCG CCCCTTGCGC TACATGGAAG ATGCCCTGCC CGATGCCATC CCTGATGCTG AAATCGCCAG CGACCGCGAT ACGATTCAAG CCCAAGCCGA TGGTATAATC GCTGTGACAG CAAAAGGCTA A
|
Protein sequence | MTEVARQLED LSPERRALLA QRLRQRQAAK PVPSIPALAR IGEHPAFELS FAQQRLWFLS QWQPESAAYH IPATFSIAGE INVSVLQTCL DKIIQRHEVL RSTIEVLNDQ PMQVIQPFRS LDLELVDLRG LSNEQQASQR QQQIERHSQQ PFDLSKDLML RGLLLHTAAD HAELVLTIHH IACDGWSIGV LLRELGQLYE AGLRGEQLEL PALPIQYADF AVWHKRYVLE QVYQQHLNFW QEQLSGTLPL LNLPTDHARP AVKRDLGATV EYHLPWSLVE AVERLSRQER ATPFMLFMAA FQVLLYRYSG QSDLIIGTPI ANRTRREIEN LIGFFVNTLP IRVNLAGSPS FRSLIAQVRQ TSLAAFEHQD MPLEHLIDVL KVERSLSHNP LFQALFVHQT TSIQTVDSGE FGLQFGGAIE TGSAKFDINL NLAANRATFE YNTDLFERST IERMASHFHS LLEYAVTNPD ASIEHLPLLS SSERQQLLQT WNSTSANYPA VDSIVRLFEA QAARVPERTA LHFEGQTLSY AELNQRANQL AHSLRQRGIG CDMRVGLFID RSLDLLVGAL GILKAGAAYV PIDPIYPQDR ISAMLEDGAV SLLLTHAELA AELPKLDLEV LCLDQAWPTI AQAPTHNLNL ALEPRSLMYV LFTSGSTGRP KGVAIEHHNY VNYIQGLLQR IEAEDGWSYA LVSTFAADLG TTNVYGALCS GGELHIVAYE RATDPEAFAA YFRQHRIDVM KLVPSHFEAM RGLNNLADVI PKQRLILAGE ASLWEQLSDI RQLQPSVQLQ NHYGPTETTV SMLTYPIPSQ PHYPSSTVPL GRPLGNVQIY VLDRRMQPTP QGVPGELYVG GAGVGRGYIG RPDLTAERFV PNPFSTEAGA RLYRSGDLVR YQPDGAIEFL GRIDLQVKIR GYRVELSEIE TAIQAQAQVA NSVVILREDT PGDKRLVAYI VPEAGQSLNI GSIREALRNS LPDYMVPTAF VELDGLPLNP NGKIERRALP APSNERNLDS YVAPQTATEH ELAGIWAEVL GLDQVGIDDN FFDLGGESFK AIRVVRKIGS HISVMTLFKY PTVRELAAHL SGASSAESGG MLYELSKAQA KQHTTIVAIP YGGGSAITYQ PLAQAMPKGY RLLAAELPGH DFSRPDEPLQ ALEVVASQLA SEIQTKTQGP IVLYGHCVGS AMTVEIGRLL EQAGRDVQGI VLGGNFPAAR VPGRFFEWLN KLMPADRWMS DRTYRDFLRA LGGFTEIVDQ AEQTFVMRSL RHDAREVERY FTQAFAQKQS QQLKAPIACI IGEMDRATEY YQERYREWEY FSNNVTLHVI PHAGHYFLKH QASELGQIIE QQTEQWQQPR PIQPTAAKSK SHKTSMPSLR IFFMVALGQL VSMLGSSLSS FALGIWIYQR TGTVSDFAFT AIASMLPSLL VSPLAGAIAD RWDRRWIMII ADTISALSTI VIAMLLWANK LEVWHIYLTA AISSIAGTFQ RPAYAAAMTQ LVPKQYLGHA NGVIQLGSAT GGLIAPFIAG GMVAFFGLGG VFLLDFISFS LGIGVLFLVR FPNTLYHKRE EPLLREIVRG WEYIIKRPSL VAMVLFFALG NIWFGIASIS MSPLVLSFGG PAELGIVSAA CALGGFLGGL FMSLWGGLQR RAEGMVGFVI LEGFFIALAG LRPSVWLVAL AMFGMWFAIS LVNAHWQVLI QTKVGLELQG RVQATNQMLA MLSIPLGYWL AGPLADNLFG PLLEPNGALS SSLGWLFGVG PDRGIGLLMV VVGLGAAIWA LIGFNYRPLR YMEDALPDAI PDAEIASDRD TIQAQADGII AVTAKG
|
| |