Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1858 |
Symbol | |
ID | 5733747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2170286 |
End bp | 2174872 |
Gene Length | 4587 bp |
Protein Length | 1528 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279002 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544629 |
Protein GI | 159898382 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTATGA ATGAGCGCGA AGTGTTTGCC TTCCCGATGT CGTTTGCCCA GCAACGGCTC TGGTTTTTGG AGCAACTTCA GCCGAATAGT GCCTTGTATC ATATCGCTAG TTTGCTGGAA ATTCAGGGGT CGCTTGATCT GGCGGCGTTG CAGCAGAGCA TTAACCAGAT TGTTGTGCGT CACGAGACGT TGCGTACCAC CTTTGGCATG GTCGATCAAA CGCCGATGCA ATTGATCAGC AGTGAGTTGA CGCTCACCCC GGTTGTCCAT AATTTGCAGG CGCTTGAGCC TGAGCAACGC TGGTTTGTGG CGCTTGAACA GGCCAAAGCC AGTTGTTTGG TGCCATTTGA TTTGAGCCAA GGCCCGTTGG TGCGGCTTGA GTTGTTTCAA CTTGGGCCTG AGCATTGCTT GATGACGGTC GTGCTGCACC ACATTATTGC TGATGGCTGG TCGATGGAAC TTTTCATTCA GGAACTAGTC AGCAGCTATG AGGCCTATCG CGCGGGTTTT CATCCGCAAC TTGCGCCCTT GGCCTTGCAA TATGCCGATT ATAGCGAGTG GCAGCGTGAA TGGCTGGCTA GCCCACGCCA GCAGCAACAA TTGGATTTTT GGCAACAGCA ATTGGCCGAT GCCCCCAAGC GTTTGGAGTT AGCAACTGAT CACCCTCGGC CTGCTCAGCA GAGTTTTGCG GGGAGCACGT TGAGTTTTGG CATCCCAAGC CAACAAACTA GTCAACTGCG CAGTTTGGCC CAGCAAACCC AAACTACGCC GTTTATGCTA GCTTTGGCGG TGTTTGCCAG CTTGTTGAGC CGTTATAGCC GCCAAGATCA GGTGTTGATT GGCTCGCCGA TTGCCAACCG TACTACCGCC GAAGCCCAAC CGCTGATTGG CTTTTTCGTC AATACAGTGG TTTTCAAGGT ACAACTGAGT CCAAACCTCG ATTTGCTGAG CCTAGTTGAG CAGGTACGCG AGCAAAGTTT GGCGGTCTAT GCCAACCAAG ATGTGCCGTT TGAGCAGGTG GTGCAGCAGT TACAGCTTGA GCGCAATCTG AGCCATACGC CCATTTTTCA GGTGATGCTA GCTTACCAAA ATGTGCCGAG CCAGCAACTG AGCATCGCCA ACTTGACAAT CAAACAGCTG CCGCTGGATT TGGGTTATGC CAAATTTGAC CTAACTTTAT TTATTGAGGA AACTCCAGCA GGGCTAGTTG GGCGCTTGGA ATATAACCGT GATTTGTTTG AGCCTGCCAC GATTGCGCGC TTGCGCGACC ATTTCTTGCG TTTGCTAGGC CACGCCTTGG CTCAGCCAAC CCAACCACTG GCCCAAATCA GTATCTTGAG CGCTGCCGAA TGCCAACAGC TTTTGGTCGA TTGGAACCAA ACGCAGCAGC CGTTCCCCGA CCAACTTGGT TTGCAGCATT TGGTCGCGCA GCAAGTACAA CGTACTCCCA ATGCTCCAGC GATGCGTTGG AATAACCAAA TAATCTGCTA TACAGAGCTT GAGCAACGTG CCAACCAATT AGCCCATTTG CTGCTGCAAC GTGGCGTTAC CCAAGGCTCA ATCGTTGGAG TCTATGCGAC GCGCTGCCCA GAAATGATCA TCAGCTTGCT GGCGATTTTG AAGGCTGGTG CTGCCTACTT GCCGCTTGAT CCGGCCTATC CTGCTGAACG CTTGCACTAT TTGGTGGCCG ATTCGGCGGC GAGTTTGATT GTGCAAGCCA GCCATCAGGC GCTGCCAACC CTCGTTAGTA CAGCTGAAAC GCTTGATGTT GTAGCCGAAG CTGAAACGCT GGCTAGCTTG CCAACCACTG CTCCGATGGT TGATTTCGAC CCGCAGCAAT TGGCCTATGT GATTTATACC TCTGGCTCGA CTGGCAAGCC CAAAGGTGTG CTGATTCAGC ATCAAGGGGT GGTGAATTAT CTGCACTGGG CGATTCATTA TTATCCATTT GAGCAGGGTG CTGGTGCACC GCTGGCCTCG TCGTTGGCCT TCGATGCCAC AATTACGGCA TTGTGGGGGC CACTCTGTAC GGGCAAAACC ATCGATTTGC TGCCTGAGCA GGATGAGCTA GAAGTCTTGG CGCAACGCCT GAGCAGCGAA GATTATAGCG TGCTCAAAAT CACCCCAGCG CATATGGAAG CGCTTAGTCA GCTGGTTGCG CCCGACCAAA TTGGCTCAAG CAAGGCCTTT GTGATTGGCG GCGAGGCCTT GTTGCAGCAA CATGTGGCCT TTTGGCAAAC CAACGCTCCC AACCTGCGCT TGATCAATGA ATATGGCCCA ACCGAGACGG TAGTTGGCTG TGTGATCTAT CAAGCCCAAG CTGCGCCAAG CGAATGGGCT GCCGTGCCGA TTGGCCGCCC GATTGCCAAT ACCCAGTTGT ATGTGCTTGA TCCGGCAGGT TTGCCAGTGC CGATTGGCGT GCCTGGCGAG TTGTATATCG CTGGCTTGGG TGTTGGGCGC GGCTACCATG GGCGGCCTGA ATTGACCGCC GAGCGCTTTG TGCGGCTGGA ACAATTGGCT GGGGTGCAGG CAGAACTTGC CCGTTGCCAA CAGCCTCAGC CAGCGTTTGA ACGCTTGTAT CGTTCAGGCG ATTTGGTGCG CTATCTGCCC GATGGTAATC TCGAATATCT TGGGCGGATC GATCAGCAAG TCAAACTCCA TGGCTTTCGA ATTGAGCTTG GCGAGATCGA AGCCACGCTG GCGAGCCATC CGACGGTGCA CGCGGCGGTG GCCATGATTC GCGAAGATCG GCCTGGACAT AAGCGACTGG TTGCCTATGT GGTCGCTGAG CCAACTGCCA ATCAGGATAC TTCGATTGTT TTGACCCATG TTGCCCAACA GTTGCCCCAC TATATGCTGC CAAGCGTGGT GATTTGGCTC GATAGCTTGC CATTAACCCC CAATGGCAAA GTTGATCGTC AAGCGCTGCC CGCGCCTGAG ATCAACCAAA CTGCGCTTGA TTCGGCCCAA ACCACCCCAC TCGATCAGTA TGAAGCTCAA TTGATGGCTA TTTGGCAGCG AGTGTTAGGA CTGAAGGCCG TTGATCGCCA TGCCAATTTC TTTAGCCTTG GTGGCGATTC GATTTTGGTG ATGCAGGTAG TCGGCATTGC ACGGCAGCAT GGCCTGATTC TAACCCCACG CTTGTTGTTC CAAAACCAAA CGATTGCCAG CTTGGCCCAA GCGATTCGCC AGCAAACCCA AGCTAAGCCC GCGCTTGATC CCTTGAGTTT GCAGGGCATT GTGCCGCTTA GCCCAATGCA ACATTGGCTA TTTGAGCGCC AACTGGCCCA GCCCGCCCAT GTCAACCAAA GTATTGTGCT CAAGTTACAA ACGGGGTTAG CGACCGAACA AATACAGGCA GCGCTTGATC AATTAGTGCG TTTGCACCCA AGTTTGCGTT TGATCTTTAC CCAAACTGCG GCTTGGCAAC AACGCTATGA GCCAGCCGCC AGCGTGCCCT TGCGCGAATT ACAGCAACCA ACATTAAGCC AGCAACAAGT CTGTGATGCC GAATTGCAAG CTTCATTCGA TTTAGCTCAA GTGCCGTTGT TACGAGCCTC GTTGTGGCGT GGTATCGACC ACGATCAATT GCTGTTGGTG GCGCACCATA GCATTATCGA CGGGGTTTCG TGGCGAATTG TGCTCGAAGA TTTGGCTTTG TTGCTCAACC AACAAGCTGT GCCAGCGGCA ACCACGCCAT TTAGTGAGTG GGCCGAATAT CAGGTGCAGC AAGCCCAAAC CCCGCAATTG CTGAGCCAAC TCGCCTATTG GCGCTCGACG ATTGAAGCCA TCACGCCGAT TCCTCAGCTA GCTCAAGCGG GGTTGGTTGG CGAAGCACAG CGCTTTCAAA CCAAGCTTAA TCCTGAATTG ACCGAGCAAC TGCTGCATCA CGCACCTGAG CGCAGCCGTA CCAGCGTGGC CGAGTTGCTG ATCACTGGCT TGGCGATAGC TTTCCAGCGT TGGTCAAATC TACAACAATT AGTGCTTGAT ATTGAAAGCC ATGGTCGCGA ATCGCTTGAC CCTGAGCATG ATTTCAGCCG GAGTTTGGGC TGGTTCACCA GTTTGTACCC AGTGCGCTTG GATTTCCCCA CTACCAATGA GCCAAACCAG TGGATTAAGC AGATCAAAGA AAGCTTGCGG GCAGTGCCGC AAGCCGGAGC GGGCTATGGC ATGTTGCGCT ATTTGCACGC TGATCCAGCG ATTCGCGCGA GCCTTGTGCC AACTCACGCC CCAGCAATTG CCTTCAACTA CCTTGGTCAG CTCGATAACC AACAAACTTT AGCACCATTC CAAGGGCTAA ATTTGGAGTT TGCCAGCCAA ACCTTGGCTC CCACCAACCA ACGCAGTCAT GCCTTAGAGC TTAATTGTTA TAGCACTGAT GGCTGTTTGG TGTTCGATTG GGAATGTCAT CAGACAGCGC GGGCAGCCGT TGAACACTTG GCCGAGCAGT ATCAAATAGC CTTAGCCGAG TTGTTGCAAG TACCAACCAC AACTGCTAGC TTGGCTCCCT CGGATTTTCC AGCAGCTCGC GTCAAGGCCA ACGATCTCGA TCGATTGTTG GCTCGTTTGA AAGCAAAGGG GCAATAG
|
Protein sequence | MTMNEREVFA FPMSFAQQRL WFLEQLQPNS ALYHIASLLE IQGSLDLAAL QQSINQIVVR HETLRTTFGM VDQTPMQLIS SELTLTPVVH NLQALEPEQR WFVALEQAKA SCLVPFDLSQ GPLVRLELFQ LGPEHCLMTV VLHHIIADGW SMELFIQELV SSYEAYRAGF HPQLAPLALQ YADYSEWQRE WLASPRQQQQ LDFWQQQLAD APKRLELATD HPRPAQQSFA GSTLSFGIPS QQTSQLRSLA QQTQTTPFML ALAVFASLLS RYSRQDQVLI GSPIANRTTA EAQPLIGFFV NTVVFKVQLS PNLDLLSLVE QVREQSLAVY ANQDVPFEQV VQQLQLERNL SHTPIFQVML AYQNVPSQQL SIANLTIKQL PLDLGYAKFD LTLFIEETPA GLVGRLEYNR DLFEPATIAR LRDHFLRLLG HALAQPTQPL AQISILSAAE CQQLLVDWNQ TQQPFPDQLG LQHLVAQQVQ RTPNAPAMRW NNQIICYTEL EQRANQLAHL LLQRGVTQGS IVGVYATRCP EMIISLLAIL KAGAAYLPLD PAYPAERLHY LVADSAASLI VQASHQALPT LVSTAETLDV VAEAETLASL PTTAPMVDFD PQQLAYVIYT SGSTGKPKGV LIQHQGVVNY LHWAIHYYPF EQGAGAPLAS SLAFDATITA LWGPLCTGKT IDLLPEQDEL EVLAQRLSSE DYSVLKITPA HMEALSQLVA PDQIGSSKAF VIGGEALLQQ HVAFWQTNAP NLRLINEYGP TETVVGCVIY QAQAAPSEWA AVPIGRPIAN TQLYVLDPAG LPVPIGVPGE LYIAGLGVGR GYHGRPELTA ERFVRLEQLA GVQAELARCQ QPQPAFERLY RSGDLVRYLP DGNLEYLGRI DQQVKLHGFR IELGEIEATL ASHPTVHAAV AMIREDRPGH KRLVAYVVAE PTANQDTSIV LTHVAQQLPH YMLPSVVIWL DSLPLTPNGK VDRQALPAPE INQTALDSAQ TTPLDQYEAQ LMAIWQRVLG LKAVDRHANF FSLGGDSILV MQVVGIARQH GLILTPRLLF QNQTIASLAQ AIRQQTQAKP ALDPLSLQGI VPLSPMQHWL FERQLAQPAH VNQSIVLKLQ TGLATEQIQA ALDQLVRLHP SLRLIFTQTA AWQQRYEPAA SVPLRELQQP TLSQQQVCDA ELQASFDLAQ VPLLRASLWR GIDHDQLLLV AHHSIIDGVS WRIVLEDLAL LLNQQAVPAA TTPFSEWAEY QVQQAQTPQL LSQLAYWRST IEAITPIPQL AQAGLVGEAQ RFQTKLNPEL TEQLLHHAPE RSRTSVAELL ITGLAIAFQR WSNLQQLVLD IESHGRESLD PEHDFSRSLG WFTSLYPVRL DFPTTNEPNQ WIKQIKESLR AVPQAGAGYG MLRYLHADPA IRASLVPTHA PAIAFNYLGQ LDNQQTLAPF QGLNLEFASQ TLAPTNQRSH ALELNCYSTD GCLVFDWECH QTARAAVEHL AEQYQIALAE LLQVPTTTAS LAPSDFPAAR VKANDLDRLL ARLKAKGQ
|
| |