Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2414 |
Symbol | |
ID | 5734295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3086167 |
End bp | 3091623 |
Gene Length | 5457 bp |
Protein Length | 1818 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279555 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001545182 |
Protein GI | 159898935 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATGG ATTTTCAGAG TATCCAGCGA CGTTTTGCAA CTGCTGTTGA ACAACGATCG AGCCAGGCGG CGCTGCGTTA TCACGACCAA GTAGTTTCGT ATCACGAGCT GGCTGAGCAT GCCCAACGCA TTGCCAGCGG CCTAGCCAAT CAGCAGGTTG GGGTCAATAC CAATGTCGCA ATTCAGCTAA CCAACCCAAT AGATGTTTGT AGCACCATTT TGGCCACATT GCTGTTGGGT GCACGCTATG CTTTGCTTAG CCCTAATTTA GCCAAGCTTC GGCTGCAACA GGTCTTGGCC CGCCAACAAT TTGTGCTGGT TGGCTCAGCC GCTAGCAACA ATTTAGCTGC TAACTATATC GAATTTGAGC AACTTGCTAA TAGCGAATTA GCGGAAATCA CTCCTCATTC AGCAACTGCC GAAAGCTTGA TCGGCTTGAG TTTGGCCTCG AATCCGAGTG GGTTGATTGA AGCAGGCCAA CTCAGCCAAA CCAATCTGCT GAGTTTTATC GATTTTAATC TGAGCAAAGC CAAAGTTAGT TTCCAACAAA GCCTGTGGCT CGGCGAGGAA TTCAACGATT TTAGCGCCTT TGCGAGTTTG GCAACCCTTG CTAGTGGCGG CACGCTGCGA TTTAGCACAC TTGAAACGCT CGATCACGAT CTTGATGAAC AAGCGCAAAC CTTGATGCTG ACGCTTGCAA CGTTGGGCCA ACTGTTCGCG CAACAGTCAG CATTGCCCAA AGTGCGCCAT ATTTTGAGTA GCGGCGAAGG CTTGCTCGAT GGCGAAGCGC TCAAACAACA ACTCAAACAG CAACAAACCG CTTGGCACAA CTACTATGGG TTCCCAGCCT TTCAATTGCT GACGGTGGTT GGAGCCAACA CTCAAACCCA AACAGCAAGC CAAATTCATA GCGGCAAGCC GGTACCACAT ACCCAAGCTT TGATTCTTGA CCAACACAAA CAACTCGCGC CAATCGGCTT AACTGGTGAG TTATATGTGG CTGGAGCAGG GGTTTTTGCA GGCTTTGAGC AGGCTCAATT GAATGCCGAG CGTTTTATCG CCAGCCCATT TGCTGCCGAC ACTCAACTCT ACCAAACGCG CTATCTGGCC CGTTGGCAAG ATGATGGGCG GCTGAGCATC AGCGGTAGCC TTGATTCAAC GATCGAACTG GCTGCCACCC CAATCTTGCT CCAAGAAGTA GAACGTCTGC TCGAACAGCA TCCGGCAATT GTTGAATGTT GTATCGTGCG GCGCATAACC CTGAGCAACA CCGAGCAATT AACGGGCTTT GTGGTCGCCA AACAGCGCGT TGAGCCAAGC GAGATCCTGA ATTACCTTGA AGATCAGCTT GACTGCCAAC TCGCAAACCT CGGCTTGATT CAGGTTGATC AGCTACCGCG CACCGCCGAT GGTCAGCTTG ATCGTCAACA ATTGGCCCAA ACTAACCTGC TCGATAACCG CCAAATTGCC GATTTGCAGG CTCAACTGCA ACAAGCAGCT AATGGCGCAG AGCTAGCGGT GGTCGCTCAG CCGATCTTGC CAGTTTCAAG CCCGCTGCAC ATCGACGATC TTGTGCCAAT GGTCGAAACT AGCAATTTTG GTACATCGCA ACGCACGATC AGCGAGCAAC CAGTTGAAAT GTTGTCAACT CAGGCCAAGC CAGCGTTGGC AGTTGGCCCA CCGTTGATCA AGGCGGAACA AGCGCCGTTG ACCTTAGCTG AGGCCTTAGT GCTCGCCGCC AAACACTACC CTGAACATGG CATTAGCTAT ATCGAAGCCG ATGGCAAAGC GTTGTTTCAA TCGTATGCCG CGCTATTGGC CGATGCTGAG GCGGTTTTGG CTGGTTTACG GGCGGCAGGT TTGAAGCATG GCCAACATGT TGTGCTGCAA TTTGCCCATA ACGAGCCATT TGTGGTGGCA TTTTGGGCTT GTATGCTTGG TGGTTTTACG GCGGTTCCCT TGGCCTTGCC CAACAGCAGC GATCCCAATA ACCCCGCGGT GAGCAAGCTC TATAACACGT GGCAAACCTT GGAGCAACCA CTGATCGTTA GTGAACAAGC CAGCTTCAGC CTGCTTCAGC GCATTTTTAA TGGCTTGGGC GTGGTAAAAC CAGCGATTCA GATCACCGAA CAGTTGCGCC AACATCAGCC TGATCAGCAG CATCAGCACC TCGCGCCGCA GGATTCGGCC TTGTTGTTGT TTACATCGGG CAGCACTGGC CTTCCCAAAG GGGTCGAATT AAGCCATCAC AATATTATTA GCCGCTCCAA GGCTAGCGCT CAACACAATC GTTTTGATCA TAATGATGTT TCATTAAATT GGATGCCGCT TGATCATGTT GGTGGGATTG TGATGTTCCA CGTTCACGAT GTGTGCTTGG GCTGCCGCCA AATTCAGGCC AAAACCGACT ATATTCTGGA AGATCCAGTG CGTTGGCTAG ATTTGCTTGA GCAGTATCGC GCCACAATTA CTTGGTCTCC TAACTTTGCC TATGCCTTAA TCAACGATCA GCACGAACGG GTCAATAGTC GCCGCCGCAA TCTTAGCTCG TTGCGTTTTA TTTTGAATGG TGGCGAGGGC ATCAATAAAC AAACTGCCTT GAATTTTCTT GGTTTGTTGC AAGCCCATGG CCTGCCCGCA ACGGCTATGC ACCCCGCTTA TGGCATGTCC GAAACCTCAT CGGGCATCAG CTCATCCGAT CAACTGGTGC TTGGGGCAAC CACAGGCTTT CACGAACTCG ACCAAGCCTC ATTAACCGGG GTGATTCAGC CAGCCAGCGC CGATAGCATT GGTGTAGCAT TCGTCGAAGT TGGCGCACCA TTGCCGGGGG TTTCACTACG AATTGTCAAC ACCAACAATC AATTGCTCAG CGAAGATCTG ATTGGCCGTT TGCAAATTCA AGGCCCGACG ATCACCGCTG GCTACTACCG CAACCCTGAG CTTAACCGCG AAGTCTTTAC CGACGATGGT TGGTTTACGA CTGGCGATTT GGCTTTTTTG CACCAAGGGC GTTTGACAAT TGCTGGCCGT GAAAAAGATG TAATCATCAT CAATGGCATC AATTATCACA ACCACGAAAT CGAAGCCTTG GTTGAAACAA TCGAAGGCGT GGAAGTTTCC TACACGGCGG CTTGTAGCGT GCCAAGCAAA CATACAGGCG GCACGGAGTC GCTGGTCATT TTTTACGTTT CAAAATCGGC TGAATTTGAT CAACAGCTAG CCCAAATTAA CCAGATTCGC GAAGTTGTCG TGCAAAAAAT TGGCATCAAC CCCAGCTATG TGCTGCCAGT TGCTAAGAGC GATATTCCCA AAACAGCAAT TGGCAAGATT CAGCGTTCGC AATTGAGCCA GCGCTTTATC AATGGCGAAT TTAGCAGCAT CACCAAGCCA ATCGATTTAG CCCTAGCCAA CCAGCAAACC TTGCCACGCT GGTTCTTTAG CAAGCAATGG CAACCTGTCA GCAAGCGCCA TAATTCGGCC TTGCTCAAGC CAAGCTATGC AATTTTCAGC GACGATACCA CGCTAGCGCG TGAGTTGATT GATGTGCTTG AACAGCACCA TCGCGATTGG GTCTTGATCA GCGCTGGCGA AACATTCAGC CAGCAGGGTC AACACTACAC GATTAATTTG CACGATCCTG AGCATTACCA TCAGATCGCT GCCACTCTTG CAGCCACCAA TATTCACGAT TATGTTCACT TGTATAGCTG TGATCTACCA AGCGAGATTG AGCACGTTGG CGATTTGGCC GCCGCCCAAT ATCGTGGAAC CTATAGTCTT TTGTTCCTGA CCCAAGCCTT GGCCAAGCAA AAGCTCAGTC AAGCCAGCTT AACCGTGGTT TCACAACGCA GTCATGCGAT TAACCAAAGC GATCAGGTGA TTTATGCGGC GGCTCCAATT CATGGTTTGC TCAAAACCAT GCCCTTAGAA ATTGATTGGC TCAGCTGCCA GCATGTTGAC CTTGATGCAG CAAGTGCAAC CACCAACAGC CAACAAATTT ACTATGAACT AGCCCAACCG AAGCCCAGCG CCGAGGTGGC CTATCGGGCA GGTCAACGAC TCGTTCCACA GTTAGTTGAA GCCGAAATGG CCCAATCAAG CCCAGTCGAA TCGCCGCTGG TCAAAGGTGG CCTGTATTTG GTGACTGGCG GCTTAGGCGG CATTGGCAGC CAATTTGCTC GCTGGTTGCT GCAAAACTAC AATGCTCGTT TGCTGATCAC AGGCTCAACT GAATTGCCAT TGGGCAGCGA TTGGGCCAAG CATTTGGGCA CCGATAGCAG CCTGAGCAAA CGCTTACGTG CTTATAAAGA TCTAATCGAT ATTAGCAACG ATGTGCATTA CCAAGCGGTG GATATCACCG ATTCAGCTCA ATTGGCTCGC TTAATTAATG ATGCAGAACA GCGCTGGAAT CAACCATTGG CCGGAGTTTT CCACTTTGCT GGGGCTGGCA ATTTGGCTTA TCACTGGACA GTGATGGATC ATCACTGGAT TACCAACGAA AGCCTCGCCA CCTTTGAAAT GATGTTCGCG CCCAAAGTCT ATGGCACGTG GGCCTTGCAA CGAGCCTTGA GCCAGCGCCC AGAACTGCCA ATTGTGGCAA TGTCGTCGAT TAATAGCTTT TTTGGCGGAG CAACATTTAG TGCCTACTCG GCAGCCAATA GTTTTCTCGA TAGCTTTATG CTGCATCAAC GCCAAACCTC GCATCCCAAA GCGCTCTGTT TAAATTGGAC GCAATGGGAC AATATTGGGA TGAGTCTCAA TAATCCGCAG CAAATTCGCA GCCTTTCTGC CGAGCGCGGC TACAACGTAA TTGGTTTGCA ACAGGGTTTG CAATCGCTCT TGGCGGGCAT CAGCCAAAAC CAATATCCAC TGTTAATGAT TGGCTTAAAT GCTGATAGCC CAGCGCTGCG CCAACATCTG GCAGTCAGCC AGCCCTTGCA ACAACGCATC AATCTTTATA CAACCCATCA GCATGGCCCG CTCAGCCACG ATCGCTATCG CCAACTTGCC AATAGCTATT TTGGCTCGGC CACGCTTGAA TGGTATCGCG TTGCAGAATT GCCACGCACC AGTAGCGGCG CAATCGATCT GGCTGCTTTA GGCCAACTCG ATGCCACCAA CCAACAAACC GCGCTCGATC AACCAACCAA TATCATCGAA GAACAGTTGG TCAGCATTTG GCAAGAGATT CTCGGCAAGC CCAAGATCGG CATTCACGAC AACTTTTTTG CCTTGGGCGG CCATTCGCTG CTGGCAACCC AGCTTGTTTC ACGTTTGCGC GACGGCTTTA ACCTTGAAGT ACGGTTGTAT CAACTCTTCG CCGCGCCAAC CATCGCCGAA CTTGCCAACT GCATCGCCGA GCTGCAACTT GAACAAATCG ATTCAGCCGA GATGGATGCA CTCTTAGCCG AGCTTGAGGG ATTATCAGAA GCCGAACTTG AAGCAGGATT AGGGTAA
|
Protein sequence | MEMDFQSIQR RFATAVEQRS SQAALRYHDQ VVSYHELAEH AQRIASGLAN QQVGVNTNVA IQLTNPIDVC STILATLLLG ARYALLSPNL AKLRLQQVLA RQQFVLVGSA ASNNLAANYI EFEQLANSEL AEITPHSATA ESLIGLSLAS NPSGLIEAGQ LSQTNLLSFI DFNLSKAKVS FQQSLWLGEE FNDFSAFASL ATLASGGTLR FSTLETLDHD LDEQAQTLML TLATLGQLFA QQSALPKVRH ILSSGEGLLD GEALKQQLKQ QQTAWHNYYG FPAFQLLTVV GANTQTQTAS QIHSGKPVPH TQALILDQHK QLAPIGLTGE LYVAGAGVFA GFEQAQLNAE RFIASPFAAD TQLYQTRYLA RWQDDGRLSI SGSLDSTIEL AATPILLQEV ERLLEQHPAI VECCIVRRIT LSNTEQLTGF VVAKQRVEPS EILNYLEDQL DCQLANLGLI QVDQLPRTAD GQLDRQQLAQ TNLLDNRQIA DLQAQLQQAA NGAELAVVAQ PILPVSSPLH IDDLVPMVET SNFGTSQRTI SEQPVEMLST QAKPALAVGP PLIKAEQAPL TLAEALVLAA KHYPEHGISY IEADGKALFQ SYAALLADAE AVLAGLRAAG LKHGQHVVLQ FAHNEPFVVA FWACMLGGFT AVPLALPNSS DPNNPAVSKL YNTWQTLEQP LIVSEQASFS LLQRIFNGLG VVKPAIQITE QLRQHQPDQQ HQHLAPQDSA LLLFTSGSTG LPKGVELSHH NIISRSKASA QHNRFDHNDV SLNWMPLDHV GGIVMFHVHD VCLGCRQIQA KTDYILEDPV RWLDLLEQYR ATITWSPNFA YALINDQHER VNSRRRNLSS LRFILNGGEG INKQTALNFL GLLQAHGLPA TAMHPAYGMS ETSSGISSSD QLVLGATTGF HELDQASLTG VIQPASADSI GVAFVEVGAP LPGVSLRIVN TNNQLLSEDL IGRLQIQGPT ITAGYYRNPE LNREVFTDDG WFTTGDLAFL HQGRLTIAGR EKDVIIINGI NYHNHEIEAL VETIEGVEVS YTAACSVPSK HTGGTESLVI FYVSKSAEFD QQLAQINQIR EVVVQKIGIN PSYVLPVAKS DIPKTAIGKI QRSQLSQRFI NGEFSSITKP IDLALANQQT LPRWFFSKQW QPVSKRHNSA LLKPSYAIFS DDTTLARELI DVLEQHHRDW VLISAGETFS QQGQHYTINL HDPEHYHQIA ATLAATNIHD YVHLYSCDLP SEIEHVGDLA AAQYRGTYSL LFLTQALAKQ KLSQASLTVV SQRSHAINQS DQVIYAAAPI HGLLKTMPLE IDWLSCQHVD LDAASATTNS QQIYYELAQP KPSAEVAYRA GQRLVPQLVE AEMAQSSPVE SPLVKGGLYL VTGGLGGIGS QFARWLLQNY NARLLITGST ELPLGSDWAK HLGTDSSLSK RLRAYKDLID ISNDVHYQAV DITDSAQLAR LINDAEQRWN QPLAGVFHFA GAGNLAYHWT VMDHHWITNE SLATFEMMFA PKVYGTWALQ RALSQRPELP IVAMSSINSF FGGATFSAYS AANSFLDSFM LHQRQTSHPK ALCLNWTQWD NIGMSLNNPQ QIRSLSAERG YNVIGLQQGL QSLLAGISQN QYPLLMIGLN ADSPALRQHL AVSQPLQQRI NLYTTHQHGP LSHDRYRQLA NSYFGSATLE WYRVAELPRT SSGAIDLAAL GQLDATNQQT ALDQPTNIIE EQLVSIWQEI LGKPKIGIHD NFFALGGHSL LATQLVSRLR DGFNLEVRLY QLFAAPTIAE LANCIAELQL EQIDSAEMDA LLAELEGLSE AELEAGLG
|
| |