Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2092 |
Symbol | |
ID | 5733980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2613140 |
End bp | 2618125 |
Gene Length | 4986 bp |
Protein Length | 1661 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279233 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544860 |
Protein GI | 159898613 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAACTG TGTTAGAGCC AATAACGCCT GATGCGCTCA GCGAAACGTT GCGCCAGTAT TTGCAAGCGT ATCTGCCGGA TTATATGCTG CCAGCGGCGT TTGTGCCGCT TGAACAGATT CCGCGCTTGC CGAATGGCAA AATTGACCGT GCGGCCTTGC CAATGGTCGA TTTTGCAGCC CAGCATGAGC AGCAAACCCA AACGGCCCCA CGTAACCCGC TTGAGCAGCA GCTCGCGGCA ATTTGGCAGC AAACCTTGCA AGTACCAAGC GTGGGCATTC ACGATAACTT TTTTCAATTG GGCGGCGATT CAATTTTGAG CATTCAGGTG ATTGCTCGTG CTAATCGAGC TGGCATACGG CTGACTACGC GCCAATTGTT TGAGCAGCCA ACGATTGCCC AACTAGCAAG CTTGGCGCAA ACCACAACAA TAGATGTTGC TCATAGCGAA TTGTCTGCTG GACAGATCGT TCCCCTAACA CCAATTCAGC GCTGGTTGCT CGCTGATCCT ACTGATCCCA GCCAGTTTAA TCAGGCACTC TTTTTGCAAT TTACTCAGGC TATTGATTCG AATTTGGTGG CATCGGCGGT TGAGCACGTT GCACAGCTAC ATGTCAGTTT GCGTTTGCGC TATCGCCGTA CTGCTGATGG TTGGCAACAA TTTGTGGCTG CTGCTGATGC ACCACTGGTT GAATTTGAGC AAATTAATGC TCAAAACCTC AATCCAACGG AGCTGGCTGA GCTTTTCGAG GCTACAACTG AACGACTGCA ACGGCCCTTT GATTTGGCGA AGGCTGCACT GTGGCGAATT GCCTACATCG CCATGCCTGA TGCTCAGCCC GCCCGCTTGC TGTTGGTCAT GCATCACTTG GTGGTTGATG GGGTTTCGTG GCGAATTATC ATTCAAGATC TCGCCCATGC CTTGCAAAAC CAGCCATTAA CCAAACCAGC TGTCGGTTTT GCCCAATGGG CTATTGCTTT AGAACGCTAT GCCCACCGCA CCGAATTGCA GCAGCAACGT GCATATTGGC TAGCCCAAAC CAGCACTGAT CCCTTGCCAG TCGATGATAT AACTGGACAT AATGATTATG CGAGTGTTGC GACAATCACT AAGCAATTGA GCCAGGCCCA TACCACAGCC TTAATTCATC AAGCATCCCA GGCCTATCAA ACCCAGATCA ACGAACTATT GCTAGCAGCC TTGACCCAAA CCATCACCGC TTGGAGCGGC CACGCCGATG TGGTGCTGCA ACTCGAAGGC CATGGTCGCG AGGAGCTGGA TCAGCCGCTC GATCTTTCGC AAACTGTTGG GTGGTTTACT ACGCTGTTTC CAATAAAGTT AAGTCTGCCA CAAAAACTAG GCTCCAAAAA CCTGATCAAA CAGATTAAAG AACAGATACG TGCAGTTCCC CAACGCGGGT TTGGCTATGG CTTGTTACGC TCAGCTGATG CGGCGTTGCA AGCTATGCCA ACCCCAGCGA TCAGCTTTAA CTATTTTGGT CAACTTGATC AAACGCTCCA GTCAAGCAAA TTATTCAGTG CTGCGCCCGA ATCGACCGGA AGCGCCGTGT TGCCGCAGCG GCGGCGTGAA CAACTGCTAG CGATCAACTG CCAAGTTTTG GCGGGCCAAT TACAGATCGA ATGGTCGTAT AGCCAACATC TGCATTCAGC AGCAACAATC GAGCGCTTGG CTGAAGCATA TTGTGCTGAT TTAGTTGGGT TGATTGAGCA TTGTTGTCAA CAAACCCAAC CAAGCTTTAC GCCTGCCGAT TTTCCTTTGG CGCAAATTAC CCAAGCCCAA CTTGACCAGA TTGAGCAAAT ATATCCGCCG TTTGAGCAAT TGTACCCTCT TTCATCGCTG CAACAGGGCA TTTTGTTCCA TCGCTTGTAT GCGCCGGACG CTGGTGATTA TATTACCCAA ATGCAGTTTG AAATCACGGG TCAGCTCAAT CATGCGGCAT TTAGTGCTGC TTGGAATCGA ACAATTGGCC ATTATAGAAT GCTGCGAACC GCGTTTGTTT GGCAAGATTT AGCTGAGCCA CTGCAATTGG TGCTGCGCCA AGCTGTAATC ACGATCGATT TTCAACAGTT ACCCATGAAT AGCCTTGAAC AAGAACAGGT GCTTGAAGCC TATTTGCAGG CTGATCGCAC ACGCGGCTTT GAGCCAACCC AAGCGCCATT GATGCGTGTT GCTTTGTTCG AGCGTGCACC GCAACGCTAT TGCTGTATCT GGACGAATCA TCATTTGATC ATCGATGGCT GGAGCTTACC GCTGATTCTT GATAGCTTGT TTCGCTATTA TCAGGCTGAA ATCAATCAGC AACCACTGGA GCTTGCCCCA GAAATTCCTT ATCAACGCTA TATTCAATGG CTGGCTCAAC ATAATGATCA GCAAGCTACA GCATTTTGGC GTGAATTATT GCGCGGTTTT ACTGCTCCAA CAAGCTTGGC GCTTGAACGG TTTGGCTCGA CCCACGCTGA ACGACACTAT AGTGCGAGTT GGCTCCAGCT TGATTCTGCT ATAACTCAGC AGCTTCAACA GTTTGCCCGC GATCATGGCC TAACCGTGAA TAGCCTGTTG CAGGCGGCGT GGGCCTTGGT TTTATCACGC TACAGTCATC AAACTGATAT CGTGTTTGGT ACAACGACGG CTGGCCGCCC AACCGATTTG GCCGGAGTTG AGCAGATTGT CGGGATGTTC GTGAATACGC TGCCCACGAG AGTTAAGTGG GATTTGCAGC AGCCAGTGTT GGATTGGCTA CAAGCCTTGC AGGCTCAAGA GAGTGCTGTG CGCAGTTACG AGGCCAGTTC GCTCATTGAA ATTCAGGCAT GCAGCGAGTT GCCACGCAAT AGCCCGCTGT TTGAAAGTAT CTTGGTCTTT GAAAACTATC CGGTGAGCAG TAGCGATTTA ACTGGTTTGG GCGATTTAGA GTTACGTTTG GTTCCTTCGC GCGAGCAAAC CAACTATCCT TTGACCTTAG TTGCTGTGCC AGGTGATGGG TTGGCCTTCA AGTTGATGTA TCAACAAGGC TACATCGACC AACTTACTAG CCAGCGCATG CTCGATTATC TCCAACAAGG CTTAGCCGCA ATGCTAGCGC AGCCCAAGGC AAGGCTTGGC CAGCTCAATA TTGGGCATCC CAGCGAAATC CAAGCCTTGG CTGATTGGAA CGCAACCGCA GCACCGCGCC AAACCAGCTC GTTGCTTGAA TGTTTTTACC AGCAGGTCGC AGCTCAGCCA ACAAGCATCG CCGTCGCATG GCGTGAACAA CGCTGGAGTT ACTTCGATTT AGCACAGGCG AGCCAAGCAA TTGCCGGCTA TTTGCGCGAT CAAGGGGTGC AACGCCAGCA AATTATCGGC CTACGAGCTG AGCGCAACCC GCAGTTTGTC GCAGCGTTGT TGGCGATCTT GCAATTGGGC GCGGTGTATT TGCCGATTGA TCCTCAGCAT CCAGTGCAGC GCCAACAGCA ACTTGCTCAG CATGTCGATT GGTTATTGAC TGATGCCTTG GCTGAAGCGC AGCCTCAGCA ACTCGATTTG GCTCAGGCGT TGGGCTACGA TCAACCTGCA TCCGACTTTG TGCAACTCCA TGATCGAGAT TTAGCCTATG TGCTGTTTAC CTCTGGCTCG ACCGGCACAC CCAAGGGCGT GATGATCGAC CATGCAGGGA TGCTGAATCA TATTGACGTA ATGATCGAGC GTTTGGCGCT AACCCAAACC GATTGTATTG CCCAAAGCGC TGCCCAATCG TTTGATATTT CGGTTTGGCA GTTGCTGACA GCGCTCGTGG TTGGCGCTCG GATGCAGATC ATTGATGATC AAACGATGCG CGATCCGCAG GCCTTGTTAG CTAAATTGGC AGCGGCTAAC GTTTCAATCT TCGAGCCAGT GCCCAGCCTG ATTCAAGCCC TACTCGAAAC GATTGCAAGC CTTGAGCAAA CCCCAAGTTT GGCTGCTTTG CGTTGGGTGC TGCCAACTGG CGAACATTTG CCGCGTGAGC TAGCCCAACA ATGGTTTGCC CACTATCCTT ATATTCCCTT GCTGAATGCC TATGGTCCGG CTGAATGCGC CGATGATGTG ACGCTTTGGC CGATTGCCAG TGCTGTTGAG CTACCTCAAA ACGCCATTCC AATTGGCCGA CCAGTAGCCA ATGTGCGGGC TTATGTACTT GATGCCAGTT TGCGGCCAGT ACCGATCGGC GTGGCAGGCG AGTTGTATAT CGCTGGAATT GCGGTTGGTT GGGGTTATTT GGCCGATCCC CAACGCACCG CTAGCCTGTT TTTGCCCGAT CCTTGGGGCG AACCAGGGGC GCGAATGTAT CGCACTGGCG ATTTAGCGCG TTACAACCAA GCAGGTGTGT TGAGCTTCTT GGGGCGTAGC GATCAGCAAG TCAAAATTCG CGGCTTCCGA ATTGAGCTAG GCGAGATCGA AGCCTGTTTA TTGCAGCATC CGGCGCTGCA TTCGGTCGCA GTTGCTGTGG TTGGCGTAGC TGAGCAAGCA CGTTTGATCG CCTATCTGGT GGCGAAAGCT AAACCAGTCT CCGATCAATT ACTACGTGAT TTTGTCCAAG CGCGGTTGCC GCATTATCTG CAACCAAGTG GCTATTGTTG GTTGAGCCAA TTGCCGCTCA ATGCCAATGG TAAATTAGAC CGTCAGCGCT TGCCAATTCC CCAGCTGCAA ACCGCTGAAC GACTGATTAT CGCTCCCCAA AACGCTGATC AAGCCAAATT GGCCGAGCTT TGGGCGGCAA TCTTGCAACG TGAGCAAGTT GGAATTAATC AGAATTTCTT TGAACTTGGC GGTCATTCAT TATTGGCAAC CCGCTTGGTC AGCCAAATTC GCCAGTATTG GCAACTCGAT TTGCCAATTC GGAGCGTATT TGAGGCTCCG ACAATCGAAC AACTGGCTGA TGTGCTTGAT CTCCTACGCT GGGCGCAACA GGCTAATCAA GCTCCAGCTC AAGCCCGCGA ACAAGGAGCA ATTTAA
|
Protein sequence | MPTVLEPITP DALSETLRQY LQAYLPDYML PAAFVPLEQI PRLPNGKIDR AALPMVDFAA QHEQQTQTAP RNPLEQQLAA IWQQTLQVPS VGIHDNFFQL GGDSILSIQV IARANRAGIR LTTRQLFEQP TIAQLASLAQ TTTIDVAHSE LSAGQIVPLT PIQRWLLADP TDPSQFNQAL FLQFTQAIDS NLVASAVEHV AQLHVSLRLR YRRTADGWQQ FVAAADAPLV EFEQINAQNL NPTELAELFE ATTERLQRPF DLAKAALWRI AYIAMPDAQP ARLLLVMHHL VVDGVSWRII IQDLAHALQN QPLTKPAVGF AQWAIALERY AHRTELQQQR AYWLAQTSTD PLPVDDITGH NDYASVATIT KQLSQAHTTA LIHQASQAYQ TQINELLLAA LTQTITAWSG HADVVLQLEG HGREELDQPL DLSQTVGWFT TLFPIKLSLP QKLGSKNLIK QIKEQIRAVP QRGFGYGLLR SADAALQAMP TPAISFNYFG QLDQTLQSSK LFSAAPESTG SAVLPQRRRE QLLAINCQVL AGQLQIEWSY SQHLHSAATI ERLAEAYCAD LVGLIEHCCQ QTQPSFTPAD FPLAQITQAQ LDQIEQIYPP FEQLYPLSSL QQGILFHRLY APDAGDYITQ MQFEITGQLN HAAFSAAWNR TIGHYRMLRT AFVWQDLAEP LQLVLRQAVI TIDFQQLPMN SLEQEQVLEA YLQADRTRGF EPTQAPLMRV ALFERAPQRY CCIWTNHHLI IDGWSLPLIL DSLFRYYQAE INQQPLELAP EIPYQRYIQW LAQHNDQQAT AFWRELLRGF TAPTSLALER FGSTHAERHY SASWLQLDSA ITQQLQQFAR DHGLTVNSLL QAAWALVLSR YSHQTDIVFG TTTAGRPTDL AGVEQIVGMF VNTLPTRVKW DLQQPVLDWL QALQAQESAV RSYEASSLIE IQACSELPRN SPLFESILVF ENYPVSSSDL TGLGDLELRL VPSREQTNYP LTLVAVPGDG LAFKLMYQQG YIDQLTSQRM LDYLQQGLAA MLAQPKARLG QLNIGHPSEI QALADWNATA APRQTSSLLE CFYQQVAAQP TSIAVAWREQ RWSYFDLAQA SQAIAGYLRD QGVQRQQIIG LRAERNPQFV AALLAILQLG AVYLPIDPQH PVQRQQQLAQ HVDWLLTDAL AEAQPQQLDL AQALGYDQPA SDFVQLHDRD LAYVLFTSGS TGTPKGVMID HAGMLNHIDV MIERLALTQT DCIAQSAAQS FDISVWQLLT ALVVGARMQI IDDQTMRDPQ ALLAKLAAAN VSIFEPVPSL IQALLETIAS LEQTPSLAAL RWVLPTGEHL PRELAQQWFA HYPYIPLLNA YGPAECADDV TLWPIASAVE LPQNAIPIGR PVANVRAYVL DASLRPVPIG VAGELYIAGI AVGWGYLADP QRTASLFLPD PWGEPGARMY RTGDLARYNQ AGVLSFLGRS DQQVKIRGFR IELGEIEACL LQHPALHSVA VAVVGVAEQA RLIAYLVAKA KPVSDQLLRD FVQARLPHYL QPSGYCWLSQ LPLNANGKLD RQRLPIPQLQ TAERLIIAPQ NADQAKLAEL WAAILQREQV GINQNFFELG GHSLLATRLV SQIRQYWQLD LPIRSVFEAP TIEQLADVLD LLRWAQQANQ APAQAREQGA I
|
| |