Gene Haur_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2414 
Symbol 
ID5734295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3086167 
End bp3091623 
Gene Length5457 bp 
Protein Length1818 aa 
Translation table11 
GC content51% 
IMG OID641279555 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001545182 
Protein GI159898935 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATGG ATTTTCAGAG TATCCAGCGA CGTTTTGCAA CTGCTGTTGA ACAACGATCG 
AGCCAGGCGG CGCTGCGTTA TCACGACCAA GTAGTTTCGT ATCACGAGCT GGCTGAGCAT
GCCCAACGCA TTGCCAGCGG CCTAGCCAAT CAGCAGGTTG GGGTCAATAC CAATGTCGCA
ATTCAGCTAA CCAACCCAAT AGATGTTTGT AGCACCATTT TGGCCACATT GCTGTTGGGT
GCACGCTATG CTTTGCTTAG CCCTAATTTA GCCAAGCTTC GGCTGCAACA GGTCTTGGCC
CGCCAACAAT TTGTGCTGGT TGGCTCAGCC GCTAGCAACA ATTTAGCTGC TAACTATATC
GAATTTGAGC AACTTGCTAA TAGCGAATTA GCGGAAATCA CTCCTCATTC AGCAACTGCC
GAAAGCTTGA TCGGCTTGAG TTTGGCCTCG AATCCGAGTG GGTTGATTGA AGCAGGCCAA
CTCAGCCAAA CCAATCTGCT GAGTTTTATC GATTTTAATC TGAGCAAAGC CAAAGTTAGT
TTCCAACAAA GCCTGTGGCT CGGCGAGGAA TTCAACGATT TTAGCGCCTT TGCGAGTTTG
GCAACCCTTG CTAGTGGCGG CACGCTGCGA TTTAGCACAC TTGAAACGCT CGATCACGAT
CTTGATGAAC AAGCGCAAAC CTTGATGCTG ACGCTTGCAA CGTTGGGCCA ACTGTTCGCG
CAACAGTCAG CATTGCCCAA AGTGCGCCAT ATTTTGAGTA GCGGCGAAGG CTTGCTCGAT
GGCGAAGCGC TCAAACAACA ACTCAAACAG CAACAAACCG CTTGGCACAA CTACTATGGG
TTCCCAGCCT TTCAATTGCT GACGGTGGTT GGAGCCAACA CTCAAACCCA AACAGCAAGC
CAAATTCATA GCGGCAAGCC GGTACCACAT ACCCAAGCTT TGATTCTTGA CCAACACAAA
CAACTCGCGC CAATCGGCTT AACTGGTGAG TTATATGTGG CTGGAGCAGG GGTTTTTGCA
GGCTTTGAGC AGGCTCAATT GAATGCCGAG CGTTTTATCG CCAGCCCATT TGCTGCCGAC
ACTCAACTCT ACCAAACGCG CTATCTGGCC CGTTGGCAAG ATGATGGGCG GCTGAGCATC
AGCGGTAGCC TTGATTCAAC GATCGAACTG GCTGCCACCC CAATCTTGCT CCAAGAAGTA
GAACGTCTGC TCGAACAGCA TCCGGCAATT GTTGAATGTT GTATCGTGCG GCGCATAACC
CTGAGCAACA CCGAGCAATT AACGGGCTTT GTGGTCGCCA AACAGCGCGT TGAGCCAAGC
GAGATCCTGA ATTACCTTGA AGATCAGCTT GACTGCCAAC TCGCAAACCT CGGCTTGATT
CAGGTTGATC AGCTACCGCG CACCGCCGAT GGTCAGCTTG ATCGTCAACA ATTGGCCCAA
ACTAACCTGC TCGATAACCG CCAAATTGCC GATTTGCAGG CTCAACTGCA ACAAGCAGCT
AATGGCGCAG AGCTAGCGGT GGTCGCTCAG CCGATCTTGC CAGTTTCAAG CCCGCTGCAC
ATCGACGATC TTGTGCCAAT GGTCGAAACT AGCAATTTTG GTACATCGCA ACGCACGATC
AGCGAGCAAC CAGTTGAAAT GTTGTCAACT CAGGCCAAGC CAGCGTTGGC AGTTGGCCCA
CCGTTGATCA AGGCGGAACA AGCGCCGTTG ACCTTAGCTG AGGCCTTAGT GCTCGCCGCC
AAACACTACC CTGAACATGG CATTAGCTAT ATCGAAGCCG ATGGCAAAGC GTTGTTTCAA
TCGTATGCCG CGCTATTGGC CGATGCTGAG GCGGTTTTGG CTGGTTTACG GGCGGCAGGT
TTGAAGCATG GCCAACATGT TGTGCTGCAA TTTGCCCATA ACGAGCCATT TGTGGTGGCA
TTTTGGGCTT GTATGCTTGG TGGTTTTACG GCGGTTCCCT TGGCCTTGCC CAACAGCAGC
GATCCCAATA ACCCCGCGGT GAGCAAGCTC TATAACACGT GGCAAACCTT GGAGCAACCA
CTGATCGTTA GTGAACAAGC CAGCTTCAGC CTGCTTCAGC GCATTTTTAA TGGCTTGGGC
GTGGTAAAAC CAGCGATTCA GATCACCGAA CAGTTGCGCC AACATCAGCC TGATCAGCAG
CATCAGCACC TCGCGCCGCA GGATTCGGCC TTGTTGTTGT TTACATCGGG CAGCACTGGC
CTTCCCAAAG GGGTCGAATT AAGCCATCAC AATATTATTA GCCGCTCCAA GGCTAGCGCT
CAACACAATC GTTTTGATCA TAATGATGTT TCATTAAATT GGATGCCGCT TGATCATGTT
GGTGGGATTG TGATGTTCCA CGTTCACGAT GTGTGCTTGG GCTGCCGCCA AATTCAGGCC
AAAACCGACT ATATTCTGGA AGATCCAGTG CGTTGGCTAG ATTTGCTTGA GCAGTATCGC
GCCACAATTA CTTGGTCTCC TAACTTTGCC TATGCCTTAA TCAACGATCA GCACGAACGG
GTCAATAGTC GCCGCCGCAA TCTTAGCTCG TTGCGTTTTA TTTTGAATGG TGGCGAGGGC
ATCAATAAAC AAACTGCCTT GAATTTTCTT GGTTTGTTGC AAGCCCATGG CCTGCCCGCA
ACGGCTATGC ACCCCGCTTA TGGCATGTCC GAAACCTCAT CGGGCATCAG CTCATCCGAT
CAACTGGTGC TTGGGGCAAC CACAGGCTTT CACGAACTCG ACCAAGCCTC ATTAACCGGG
GTGATTCAGC CAGCCAGCGC CGATAGCATT GGTGTAGCAT TCGTCGAAGT TGGCGCACCA
TTGCCGGGGG TTTCACTACG AATTGTCAAC ACCAACAATC AATTGCTCAG CGAAGATCTG
ATTGGCCGTT TGCAAATTCA AGGCCCGACG ATCACCGCTG GCTACTACCG CAACCCTGAG
CTTAACCGCG AAGTCTTTAC CGACGATGGT TGGTTTACGA CTGGCGATTT GGCTTTTTTG
CACCAAGGGC GTTTGACAAT TGCTGGCCGT GAAAAAGATG TAATCATCAT CAATGGCATC
AATTATCACA ACCACGAAAT CGAAGCCTTG GTTGAAACAA TCGAAGGCGT GGAAGTTTCC
TACACGGCGG CTTGTAGCGT GCCAAGCAAA CATACAGGCG GCACGGAGTC GCTGGTCATT
TTTTACGTTT CAAAATCGGC TGAATTTGAT CAACAGCTAG CCCAAATTAA CCAGATTCGC
GAAGTTGTCG TGCAAAAAAT TGGCATCAAC CCCAGCTATG TGCTGCCAGT TGCTAAGAGC
GATATTCCCA AAACAGCAAT TGGCAAGATT CAGCGTTCGC AATTGAGCCA GCGCTTTATC
AATGGCGAAT TTAGCAGCAT CACCAAGCCA ATCGATTTAG CCCTAGCCAA CCAGCAAACC
TTGCCACGCT GGTTCTTTAG CAAGCAATGG CAACCTGTCA GCAAGCGCCA TAATTCGGCC
TTGCTCAAGC CAAGCTATGC AATTTTCAGC GACGATACCA CGCTAGCGCG TGAGTTGATT
GATGTGCTTG AACAGCACCA TCGCGATTGG GTCTTGATCA GCGCTGGCGA AACATTCAGC
CAGCAGGGTC AACACTACAC GATTAATTTG CACGATCCTG AGCATTACCA TCAGATCGCT
GCCACTCTTG CAGCCACCAA TATTCACGAT TATGTTCACT TGTATAGCTG TGATCTACCA
AGCGAGATTG AGCACGTTGG CGATTTGGCC GCCGCCCAAT ATCGTGGAAC CTATAGTCTT
TTGTTCCTGA CCCAAGCCTT GGCCAAGCAA AAGCTCAGTC AAGCCAGCTT AACCGTGGTT
TCACAACGCA GTCATGCGAT TAACCAAAGC GATCAGGTGA TTTATGCGGC GGCTCCAATT
CATGGTTTGC TCAAAACCAT GCCCTTAGAA ATTGATTGGC TCAGCTGCCA GCATGTTGAC
CTTGATGCAG CAAGTGCAAC CACCAACAGC CAACAAATTT ACTATGAACT AGCCCAACCG
AAGCCCAGCG CCGAGGTGGC CTATCGGGCA GGTCAACGAC TCGTTCCACA GTTAGTTGAA
GCCGAAATGG CCCAATCAAG CCCAGTCGAA TCGCCGCTGG TCAAAGGTGG CCTGTATTTG
GTGACTGGCG GCTTAGGCGG CATTGGCAGC CAATTTGCTC GCTGGTTGCT GCAAAACTAC
AATGCTCGTT TGCTGATCAC AGGCTCAACT GAATTGCCAT TGGGCAGCGA TTGGGCCAAG
CATTTGGGCA CCGATAGCAG CCTGAGCAAA CGCTTACGTG CTTATAAAGA TCTAATCGAT
ATTAGCAACG ATGTGCATTA CCAAGCGGTG GATATCACCG ATTCAGCTCA ATTGGCTCGC
TTAATTAATG ATGCAGAACA GCGCTGGAAT CAACCATTGG CCGGAGTTTT CCACTTTGCT
GGGGCTGGCA ATTTGGCTTA TCACTGGACA GTGATGGATC ATCACTGGAT TACCAACGAA
AGCCTCGCCA CCTTTGAAAT GATGTTCGCG CCCAAAGTCT ATGGCACGTG GGCCTTGCAA
CGAGCCTTGA GCCAGCGCCC AGAACTGCCA ATTGTGGCAA TGTCGTCGAT TAATAGCTTT
TTTGGCGGAG CAACATTTAG TGCCTACTCG GCAGCCAATA GTTTTCTCGA TAGCTTTATG
CTGCATCAAC GCCAAACCTC GCATCCCAAA GCGCTCTGTT TAAATTGGAC GCAATGGGAC
AATATTGGGA TGAGTCTCAA TAATCCGCAG CAAATTCGCA GCCTTTCTGC CGAGCGCGGC
TACAACGTAA TTGGTTTGCA ACAGGGTTTG CAATCGCTCT TGGCGGGCAT CAGCCAAAAC
CAATATCCAC TGTTAATGAT TGGCTTAAAT GCTGATAGCC CAGCGCTGCG CCAACATCTG
GCAGTCAGCC AGCCCTTGCA ACAACGCATC AATCTTTATA CAACCCATCA GCATGGCCCG
CTCAGCCACG ATCGCTATCG CCAACTTGCC AATAGCTATT TTGGCTCGGC CACGCTTGAA
TGGTATCGCG TTGCAGAATT GCCACGCACC AGTAGCGGCG CAATCGATCT GGCTGCTTTA
GGCCAACTCG ATGCCACCAA CCAACAAACC GCGCTCGATC AACCAACCAA TATCATCGAA
GAACAGTTGG TCAGCATTTG GCAAGAGATT CTCGGCAAGC CCAAGATCGG CATTCACGAC
AACTTTTTTG CCTTGGGCGG CCATTCGCTG CTGGCAACCC AGCTTGTTTC ACGTTTGCGC
GACGGCTTTA ACCTTGAAGT ACGGTTGTAT CAACTCTTCG CCGCGCCAAC CATCGCCGAA
CTTGCCAACT GCATCGCCGA GCTGCAACTT GAACAAATCG ATTCAGCCGA GATGGATGCA
CTCTTAGCCG AGCTTGAGGG ATTATCAGAA GCCGAACTTG AAGCAGGATT AGGGTAA
 
Protein sequence
MEMDFQSIQR RFATAVEQRS SQAALRYHDQ VVSYHELAEH AQRIASGLAN QQVGVNTNVA 
IQLTNPIDVC STILATLLLG ARYALLSPNL AKLRLQQVLA RQQFVLVGSA ASNNLAANYI
EFEQLANSEL AEITPHSATA ESLIGLSLAS NPSGLIEAGQ LSQTNLLSFI DFNLSKAKVS
FQQSLWLGEE FNDFSAFASL ATLASGGTLR FSTLETLDHD LDEQAQTLML TLATLGQLFA
QQSALPKVRH ILSSGEGLLD GEALKQQLKQ QQTAWHNYYG FPAFQLLTVV GANTQTQTAS
QIHSGKPVPH TQALILDQHK QLAPIGLTGE LYVAGAGVFA GFEQAQLNAE RFIASPFAAD
TQLYQTRYLA RWQDDGRLSI SGSLDSTIEL AATPILLQEV ERLLEQHPAI VECCIVRRIT
LSNTEQLTGF VVAKQRVEPS EILNYLEDQL DCQLANLGLI QVDQLPRTAD GQLDRQQLAQ
TNLLDNRQIA DLQAQLQQAA NGAELAVVAQ PILPVSSPLH IDDLVPMVET SNFGTSQRTI
SEQPVEMLST QAKPALAVGP PLIKAEQAPL TLAEALVLAA KHYPEHGISY IEADGKALFQ
SYAALLADAE AVLAGLRAAG LKHGQHVVLQ FAHNEPFVVA FWACMLGGFT AVPLALPNSS
DPNNPAVSKL YNTWQTLEQP LIVSEQASFS LLQRIFNGLG VVKPAIQITE QLRQHQPDQQ
HQHLAPQDSA LLLFTSGSTG LPKGVELSHH NIISRSKASA QHNRFDHNDV SLNWMPLDHV
GGIVMFHVHD VCLGCRQIQA KTDYILEDPV RWLDLLEQYR ATITWSPNFA YALINDQHER
VNSRRRNLSS LRFILNGGEG INKQTALNFL GLLQAHGLPA TAMHPAYGMS ETSSGISSSD
QLVLGATTGF HELDQASLTG VIQPASADSI GVAFVEVGAP LPGVSLRIVN TNNQLLSEDL
IGRLQIQGPT ITAGYYRNPE LNREVFTDDG WFTTGDLAFL HQGRLTIAGR EKDVIIINGI
NYHNHEIEAL VETIEGVEVS YTAACSVPSK HTGGTESLVI FYVSKSAEFD QQLAQINQIR
EVVVQKIGIN PSYVLPVAKS DIPKTAIGKI QRSQLSQRFI NGEFSSITKP IDLALANQQT
LPRWFFSKQW QPVSKRHNSA LLKPSYAIFS DDTTLARELI DVLEQHHRDW VLISAGETFS
QQGQHYTINL HDPEHYHQIA ATLAATNIHD YVHLYSCDLP SEIEHVGDLA AAQYRGTYSL
LFLTQALAKQ KLSQASLTVV SQRSHAINQS DQVIYAAAPI HGLLKTMPLE IDWLSCQHVD
LDAASATTNS QQIYYELAQP KPSAEVAYRA GQRLVPQLVE AEMAQSSPVE SPLVKGGLYL
VTGGLGGIGS QFARWLLQNY NARLLITGST ELPLGSDWAK HLGTDSSLSK RLRAYKDLID
ISNDVHYQAV DITDSAQLAR LINDAEQRWN QPLAGVFHFA GAGNLAYHWT VMDHHWITNE
SLATFEMMFA PKVYGTWALQ RALSQRPELP IVAMSSINSF FGGATFSAYS AANSFLDSFM
LHQRQTSHPK ALCLNWTQWD NIGMSLNNPQ QIRSLSAERG YNVIGLQQGL QSLLAGISQN
QYPLLMIGLN ADSPALRQHL AVSQPLQQRI NLYTTHQHGP LSHDRYRQLA NSYFGSATLE
WYRVAELPRT SSGAIDLAAL GQLDATNQQT ALDQPTNIIE EQLVSIWQEI LGKPKIGIHD
NFFALGGHSL LATQLVSRLR DGFNLEVRLY QLFAAPTIAE LANCIAELQL EQIDSAEMDA
LLAELEGLSE AELEAGLG