Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1875 |
Symbol | |
ID | 5733764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2226333 |
End bp | 2232305 |
Gene Length | 5973 bp |
Protein Length | 1990 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279019 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544646 |
Protein GI | 159898399 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAG ATATTGAGGA CATCTACCCA CTTTCGCCGT TACAACAAGG GTTACTCTTT CATAGCCTGT ATGACCCAGA TTCGGGAGCC TATTTTGAGC AATTTACCTG TCAATTAAGG GGAATCCTTC AGCTTGACGC GTTTCGACGG GCTTGGCAAC ACGTGCTTGA GCGCCATGCT GCTTTGCGCA CAGTCTTCGT ATGGGAAGAT TTGGCAGAAC CGTTACAAGT GGTCTATCGG GCGGTGCAGT TGCCCTTGGA TTACCACGAT TGGCGCGAAT TAGATCCGCA GACCCATACG GCCCAGCTTG AAGCCTATTT CCAAACTGAG CGCCAACGTG GGTTTGATCT TAGCCAAGCG CCGTTATTAC GGGCTAGTTT AATTCAGTTG AGCGATGATT GCTATCAATT TGTTTGGTGT AATCATCACT TATTGCTCGA TGGCTGGAGC ATGGCGCTGT TATTAAAGGA AGCATTTAGT TACTACAGCG CTTTCTGCGA GGGTCGGGCA TTGCGTTTAG CCCAAACGCG ACCGTATCGC GATTATATTG CATGGCTACA AAAGCAAGAT CAAGCCAAGG CCGAGCAATT TTGGCGAGCC AACTTAAGCC CTATTTCAGC CCCAACGCCG TTGGTGATCG AGCGCCCGAA TTATGCCTTG CTTGGGCCAG AACAACCATG CGAACAGCGG ATTGTGCTCG ATCTTGCGGC AACGGAAACG CTACAGATCA TGGCCCGCCA GCATAAACTC ACGATCAATA CCATCTTGCA AGGAGCTTGG GCCTTATTGC TTGGGCGGTA TAGTGGCGAA CGCACGATCG TTTTTGGGAG TCCAGTTTCC AGTCGTCCTG CCCAACTGGC TGGTAGCAAT GCGATGGTTG GTTTATTTAT CAATACCATC CCTGTGTGCA TAACCATCAA ACCTCAGGCA GCGGTCAGTG AATGGCTCCA AGCATTACAA CAACAGCAGG TCGAGGCCCA ACAATACCAC TATACCGGCT TGAATCAGCT TCAAACGTGG AGCGCAGTGC CACGTGGTAT ACCACTGTTT GAGAGTATTT TGGTGTTTGA AAATTACCCC TTGGCCGCAT GGCAACAATC GGGCAATGCA ACCTTGCAAC TGCAAGATGT GCGCTTTATT GAGCAAACCA ATTATCCACT TTCAATTGAG GCAGTGCTCG CACCAAATCT GGTGATTCAA GTGTTTTATG ATCAACGGCG CTTTGAGCCA GCGACGATCA CACGCTTGTT GGAGCATCTT CAAAATCTAT TGCTAAGCTT TGCCACCGCG CCACAAGCTC GTTTAGCCTC AATTGATCTA TTGACCGCTG CTGAACGACA GGTGATGCTG CGGGACTGGA ATACGACGAA TGTGCCGTTA CCTAGCTCAA TCTATTTACA TAAGATTGTT GCGGCCCATG CCCAAGCTAC TCCAGATGCA GTTGCATTGC GATTTGGTCA ACAACACCTG AGTTATGGCG AGTTGAATCG GCGGGCCAAT CAATTGGCGG CCTATCTTCG GGCGCAGGGA GTTCCGCCTG GTAGCTTGGT TGGGCTGTGT GTTGAGCGTT CGCTTGAGTT GGTGCTTGGA ATTTTGGCAA TTCTTAAAGC CGGGGCCGCC TATGTACCGC TTGATCCGCG CTATCCGCTT GAGCGATTGC ACTATATGCT CAACGATAGC CAAGCTCAGG TTTTGCTGAC TCAGCATTCG CTTAGCCAAC AAATTCGCAC TGAGCAACAA CGGGTAATCT ATCTTGATCA CGATTGGCCA ACGATTGCTC AATATCCCTC GTTTGAGCTA GCCGTACCAC TTTGGCCTGA GAGTTTGGTC TATCTGATTT ACACTTCTGG CTCAACTGGG CGACCTAAGG CAGTGCCAAT TACCCATCGG GGTTTGGCTA ACTTGGCCTA TGCCCAAATT CAAGCCTTTG AACTTGATGC ACAGCAGCGG ATTTTGCAAT TTGCCTCGTT GAGTTTCGAT GCCTCGATTT TTGAAATCGT TATGGCACTC TGGTCGGGGG CAACCTTAGT GCTGGCTGAT CAAGAGACTT TGTTGCCTGG CCCAAGTTTG ATTGAATTAT TGCAACAGCA AGCGATTACT CATATTACTG TGCCGCCGTC GGCATTAAAA GTGCTGCCCG AGGCAGAATT ACCAGCATTA TCTACGGTCA TCGTGGCGGG CGAGGCTTGT CCGGCTGAGT TGGTGGCGCG TTGGGGTTTG GATCGACGCT TTTTCAATGC CTATGGCCCA ACCGAAGCGA CAGTTTGGTC GAGCCTCGCC TTGTGCGACG ATCCAAACCA AAAACCCTCA ATTGGCCGAC CAATTGCCAA TACTCAACTA TATATTCTTG ATCAATACCT GCAACCTGTG CCAGTTGGGA TTGCTGGCGA GTTGTATATT GCTGGGCCTG GTTTGGCATG GGGCTATCTC AATCGGCCTG AATTAACTGC CCAGATGTTT GTGCCAAATC CCTTTAGTGC TGAGCCTGGC CAACGGCTGT ATCGTTCGGG TGATTTGGCT TGTTTCTTAC CCGATGGCTC GATTAACCAC CTTGGGCGGG TTGATCATCA GGTTAAAATT CGGGGCTTTC GGATTGAAAC AGGCGAGATT GAGCAATGCT TGTGTGAGCA TCCTTTGGTT CATGAAGCGG TGGCGATTGC CCGCGATGAG CCAAATGGCC AGAAACGACT GGTGGCCTAT GTGGTTGCCA CGCCTGATAA TCAACCAAGC AGCGCCGAAT TGCGCACGTT TTTGCAAACG CGCTTACCAG AACATATGCT ACCAGCGGTA TTTGTGCTGC TGGCTAGCTT ACCGCTAACT CCCAATGGCA AACTTGATCG CCATGCCTTG CCTGCACCGA AAACGACGCG CCATGCTGAA CAAGCCTTGT TTGATGCACC CCAAACCGCC AATCAACAAA TCTTGGCTGA GATTTGGGCC GATGTTTTGG GGCTAGCACA GGTTGGGATT CACGATAATT TCTTCGAGTT GGGCGGCGAT TCAATTATTT GTATTCAAAT TGTGGCCCGC GCCAACCAAG CTGGTTTGCG GCTAACCCCC AAGCAGGTTT TTGAACAACG CACGATTGCC AATTTGGCGA CCGTGGTTGG CACTGGCCCC CAAATTCAGG CTGAACAAGG TTTAGTTAGC GGAGCCGTAC CATTAACCCC GATTCAACAG TGGTTTTTTG CGCAAAACTT GCCAAATTTT CACCATTGGA ATCAATCGGC CTTGCTCGAA GTCCGCCAGC CGCTTGATCT AACCTTGCTT AGCCAAGTGT TGTATCAATT GCACATTCAG CACGATGCAC TGCGCTTACG CTTTCAGTTC GGTACAGATG GGTGGCAGCA AATAAACCTC GACCATGCTG CCACGCCCAG CATTAGCTTG ATTGATTTAG CTGATTTGCC GCTTGAACAA CAAAGCGTTG CAATTACTGA GCATGCTAAT CAGCTGCAAG CGTGTTTGAA TCTTAGCACT GGACCAGTGT TACAGGTTGC TTTATTCAAT TTGGGAGCCG ATCGGTCTGG ACGCTTGTTG GTGGTGGCTC ATCACTTAAT TTTCGATGGG GTTTCGTGGC GGATCTTTTT TGAAGATTTA GCGACGGCCT ACCAACAAAT TGCTCAGGCC AAGCCGATTC AGCTGCCTGC GAAAACCAGC TCATACAAGG CTTGGGCCGA GCGATTGGTT GAGTATAGTC AATCAACAAC CCTACAAGCC GAATTAACCT ATTGGAATCA GCAAATTGGC GAGTTGCCAA GCTTGCCGAT TGATTTCCCC GAGGCATTGG CTGACAATAG CGAAGCCTCG CAGGCCTTGG TGACGGTCGC CCTTGATGCG CCAACGACTG CCTTATTGCT CCACGAGGTG CCCAAAGCCT ATCATACCCA GATCAACGAT ATATTGTTAA CAGCCTTAGC CCGCTGTTTG AGTCAATGGA GCGGCCAAGC TGCCCTGCTG ATCGATTTGG AAAGCCATGG CCGCGAAGAT CTGTTTGACG ATCTTGATCT ATCGCGGACG ATTGGCTGGT TTACAGCAAT TGCGCCCTTG CGCTTAACCC TCGCAGAAAG CGGTGAACTT GGTGCTGATC TTCAATCAAT CAAAGAGCAG CTTCGTCAAG TTCCACAGCA TGGTGTTGGT TATGGTATTT TGCGCTATCT TGGGCAACAA CCGATTCAGG CTCAGCCACA GGTTGGCTTT AATTATCTTG GTCAATTTGG CTATGGCTTG AGTGCTGATT CGCCGTTGGC ATGGGCCTAC GAATCGAGCG GAGCCGACCA CGACCCAGCT GGGCTGCGAC CACACCTGCT CGAAGTGGGC GGCAGTATTG TTGATGCCCA ATTAACGATC CAATGGATGT ATAGTACCAA TCTGTATCGC TCCACGACGA TTGAGCAATT GGCGCATAGC TTGATGCACG AACTACGGGC AATCATTGCG CATTGTTTGC AACCAGATGT TGGTGGCTAT ACGCCTTCAG ATTTTCCTTT GGCCACATTG CCAGCAGCAG ATTTGGCTCA GTTGAATGCG CAATATCGCC AAATCGACGA TCTGTATCCG TTGACTCCAA CCCAACAAGG TATGCTCTTT CATGCCTTAT ATGAGCCTGA ATCGACTGTC TACTTTATGC AGATTAGTTG GCTCTTTGAG GGCAAACTTG ATCTTGCGGC GTTTCAAGCT GCTTGGAATC ACACCCTCAA TCAGCATACA ATTTTGCGCA GTTGCTTTGT CTGGCAAGGC TTAAGCCAAG CCTATCAGTT GGTGCATCCA ACCGTGGAGA TGCCGTGGGA GTATCTGGAT TGGCGCGAGC TTGAACCTGA GCAACAAGCG ATTAATCTGG CAGGGTTACT TGAGGCCGAT AAAACTAAGG TTTTTGATCT CTCCCAAGCG CCGTTGATGC GGGTTACGTT GGTGCACTTA GCTGAGCATA GCTACCATTT TATTTGGAGC CAACACCATA TTTTGCTTGA TGGCTGGTGT ACCAACATTC TGCTCAAAGA GGTGTTTCGC GCCTACGAGG CCTTGGTGCA GGGTTTGCCA ATTCCGCTCA GTCAGCCAGC AATTCGGCCT TATCGTGAGT ATATTGCTTG GCTGCAACGC CAAGATTTAG CCCAAGCCGA GGCCTATTGG CGTAAACGAT TGCAGGGCTT TGCTAAAACT ACACCACTGC CACCAGCCAG CGGAGCCCAA CAAGCTGGCG TTGATTACGC TGTCCAGAAG TTGCCGCTTG ATCCAGCGCT CACAACCGCG ATCTATACGC TGCTGCGTCA ACATCAACTG ACGATGAACA CGCTGTTGCA AGGGCTTTGG GCCTGTGTTT TGGCGCATTA CAGTGGCCAG CATGATCTTG TTTTTGGCAG CACGGTTTCT GGCCGTCCAG TCGATTTGGC TGGAGCTGAG AATATGTTGG GATTATTTAT CAACACCCTG CCGGTACGAG TTCGCATCCA ACCAACCTTG TCAATCATTG AATGGTTACA GGATGTGCAA GCTCAGCAGG TTGAAATGCG TCAATATGAA TATACACCAG TGGCGCAGGT TCAGCGCTGG AGCGAATTGC CGCCCCGTCA ACCATTATTT GAAAGCGCTG TGGTGTTTGA AAATCTGCCG ATGGATAGTA GCAATCAGGG TCAGTTTAAT GACCTAACGA TGAGCAATAT TCAATCGTTT ATTCAAAATA ACTTCCCATT GACGATTCGC GGTGCGCCGA GTGCCACAAC CTTCGAGTTG CATGTGCTCT ACGATCGCCA GCGTTTTGCC ACAACCACCG TTTTAGCGTT GCTAGGCCAA CTTGAAGCCT TGTTCAAGGC CGTGCAACAC CAACCAAGCG CCTCATTGGC CGATTTGGCC CAGCGATTAG AGGATTTTGA TCACCATAAC CAAAAAGCGC AAGCTCAACA GAGCGAAACC AGCAGTCTGC AAAAACTAAA ACACGTCAAA CGTAAGGCTA TCCGTGGGCA ACAATCTGAA TAA
|
Protein sequence | MNADIEDIYP LSPLQQGLLF HSLYDPDSGA YFEQFTCQLR GILQLDAFRR AWQHVLERHA ALRTVFVWED LAEPLQVVYR AVQLPLDYHD WRELDPQTHT AQLEAYFQTE RQRGFDLSQA PLLRASLIQL SDDCYQFVWC NHHLLLDGWS MALLLKEAFS YYSAFCEGRA LRLAQTRPYR DYIAWLQKQD QAKAEQFWRA NLSPISAPTP LVIERPNYAL LGPEQPCEQR IVLDLAATET LQIMARQHKL TINTILQGAW ALLLGRYSGE RTIVFGSPVS SRPAQLAGSN AMVGLFINTI PVCITIKPQA AVSEWLQALQ QQQVEAQQYH YTGLNQLQTW SAVPRGIPLF ESILVFENYP LAAWQQSGNA TLQLQDVRFI EQTNYPLSIE AVLAPNLVIQ VFYDQRRFEP ATITRLLEHL QNLLLSFATA PQARLASIDL LTAAERQVML RDWNTTNVPL PSSIYLHKIV AAHAQATPDA VALRFGQQHL SYGELNRRAN QLAAYLRAQG VPPGSLVGLC VERSLELVLG ILAILKAGAA YVPLDPRYPL ERLHYMLNDS QAQVLLTQHS LSQQIRTEQQ RVIYLDHDWP TIAQYPSFEL AVPLWPESLV YLIYTSGSTG RPKAVPITHR GLANLAYAQI QAFELDAQQR ILQFASLSFD ASIFEIVMAL WSGATLVLAD QETLLPGPSL IELLQQQAIT HITVPPSALK VLPEAELPAL STVIVAGEAC PAELVARWGL DRRFFNAYGP TEATVWSSLA LCDDPNQKPS IGRPIANTQL YILDQYLQPV PVGIAGELYI AGPGLAWGYL NRPELTAQMF VPNPFSAEPG QRLYRSGDLA CFLPDGSINH LGRVDHQVKI RGFRIETGEI EQCLCEHPLV HEAVAIARDE PNGQKRLVAY VVATPDNQPS SAELRTFLQT RLPEHMLPAV FVLLASLPLT PNGKLDRHAL PAPKTTRHAE QALFDAPQTA NQQILAEIWA DVLGLAQVGI HDNFFELGGD SIICIQIVAR ANQAGLRLTP KQVFEQRTIA NLATVVGTGP QIQAEQGLVS GAVPLTPIQQ WFFAQNLPNF HHWNQSALLE VRQPLDLTLL SQVLYQLHIQ HDALRLRFQF GTDGWQQINL DHAATPSISL IDLADLPLEQ QSVAITEHAN QLQACLNLST GPVLQVALFN LGADRSGRLL VVAHHLIFDG VSWRIFFEDL ATAYQQIAQA KPIQLPAKTS SYKAWAERLV EYSQSTTLQA ELTYWNQQIG ELPSLPIDFP EALADNSEAS QALVTVALDA PTTALLLHEV PKAYHTQIND ILLTALARCL SQWSGQAALL IDLESHGRED LFDDLDLSRT IGWFTAIAPL RLTLAESGEL GADLQSIKEQ LRQVPQHGVG YGILRYLGQQ PIQAQPQVGF NYLGQFGYGL SADSPLAWAY ESSGADHDPA GLRPHLLEVG GSIVDAQLTI QWMYSTNLYR STTIEQLAHS LMHELRAIIA HCLQPDVGGY TPSDFPLATL PAADLAQLNA QYRQIDDLYP LTPTQQGMLF HALYEPESTV YFMQISWLFE GKLDLAAFQA AWNHTLNQHT ILRSCFVWQG LSQAYQLVHP TVEMPWEYLD WRELEPEQQA INLAGLLEAD KTKVFDLSQA PLMRVTLVHL AEHSYHFIWS QHHILLDGWC TNILLKEVFR AYEALVQGLP IPLSQPAIRP YREYIAWLQR QDLAQAEAYW RKRLQGFAKT TPLPPASGAQ QAGVDYAVQK LPLDPALTTA IYTLLRQHQL TMNTLLQGLW ACVLAHYSGQ HDLVFGSTVS GRPVDLAGAE NMLGLFINTL PVRVRIQPTL SIIEWLQDVQ AQQVEMRQYE YTPVAQVQRW SELPPRQPLF ESAVVFENLP MDSSNQGQFN DLTMSNIQSF IQNNFPLTIR GAPSATTFEL HVLYDRQRFA TTTVLALLGQ LEALFKAVQH QPSASLADLA QRLEDFDHHN QKAQAQQSET SSLQKLKHVK RKAIRGQQSE
|
| |