Gene Haur_3129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3129 
Symbol 
ID5735001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3950482 
End bp3955932 
Gene Length5451 bp 
Protein Length1816 aa 
Translation table11 
GC content53% 
IMG OID641280272 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001545894 
Protein GI159899647 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3208] Predicted thioesterase involved in non-ribosomal peptide biosynthesis 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.975665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAG TCGCACGCCA GCTTGAAGAT CTTTCGCCCG AACGCCGCGC CTTGTTGGCC 
CAACGTCTGC GCCAACGCCA AGCCGCCAAG CCAGTGCCCA GCATTCCAGC GCTGGCTCGG
ATCGGCGAAC ATCCAGCCTT TGAACTTTCG TTTGCCCAAC AACGGCTGTG GTTTTTAAGC
CAGTGGCAAC CAGAAAGCGC CGCCTATCAC ATTCCAGCCA CATTTAGCAT TGCTGGTGAA
ATTAATGTTT CTGTATTGCA AACTTGTCTT GATAAAATTA TTCAGCGCCA TGAAGTGCTG
CGTAGCACGA TCGAAGTGCT CAACGACCAG CCAATGCAGG TTATTCAGCC ATTCCGCAGC
CTTGATTTGG AGCTTGTTGA TCTCCGTGGG CTGAGCAATG AGCAACAAGC CAGTCAACGC
CAGCAGCAGA TCGAACGCCA CAGTCAGCAG CCGTTTGATC TCAGCAAAGA TTTAATGCTA
CGCGGATTGT TGCTGCACAC GGCTGCTGAT CACGCTGAAT TGGTATTGAC GATTCACCAT
ATTGCCTGCG ATGGTTGGTC GATTGGGGTC TTGTTGCGTG AGCTAGGCCA ACTTTACGAA
GCTGGGCTAC GCGGCGAGCA ACTTGAGCTA CCAGCGCTGC CAATTCAATA TGCCGATTTT
GCGGTTTGGC ATAAACGTTA TGTGCTTGAG CAAGTGTATC AGCAACATTT AAATTTTTGG
CAAGAACAAT TAAGCGGCAC GCTACCCTTG CTCAATTTGC CAACCGACCA TGCGCGACCA
GCGGTCAAAC GCGACCTTGG GGCAACCGTC GAATATCATC TACCTTGGAG TTTGGTCGAG
GCGGTTGAGC GCTTGAGCCG CCAAGAGCGA GCCACGCCGT TTATGCTGTT TATGGCGGCT
TTTCAGGTTT TGCTCTATCG CTATTCAGGC CAAAGCGATC TGATTATTGG TACGCCGATT
GCCAATCGCA CCCGCCGCGA AATCGAAAAT CTGATTGGAT TTTTCGTCAA TACCTTGCCA
ATTCGGGTCA ATTTGGCTGG CTCGCCTAGC TTTCGTAGCC TAATTGCCCA AGTGCGCCAA
ACCTCGTTGG CCGCTTTCGA GCATCAAGAT ATGCCGCTCG AACACTTGAT TGATGTGCTC
AAAGTCGAAC GCAGTTTGAG CCATAACCCA CTGTTTCAAG CCCTGTTTGT GCATCAAACC
ACCAGCATCC AAACTGTCGA TTCAGGTGAA TTTGGCTTGC AATTTGGCGG GGCAATTGAA
ACTGGCAGCG CTAAATTTGA TATTAATCTG AATTTGGCCG CCAATCGTGC CACTTTTGAA
TACAATACCG ATCTGTTTGA GCGCAGCACG ATCGAACGCA TGGCCAGCCA TTTCCATAGC
TTGTTGGAAT ATGCGGTGAC CAATCCCGAT GCCAGCATCG AGCATTTGCC TTTGCTGAGC
AGCAGCGAAC GCCAACAATT GCTGCAAACC TGGAACAGCA CCAGCGCCAA TTATCCTGCG
GTTGATTCGA TTGTGCGTTT GTTCGAAGCT CAAGCGGCGC GAGTCCCAGA GCGCACGGCC
TTGCATTTTG AAGGCCAAAC CCTGAGCTAC GCCGAATTGA ATCAACGCGC CAACCAACTA
GCACATAGCT TGCGCCAACG CGGCATCGGC TGCGATATGC GAGTAGGCTT GTTTATCGAT
CGCTCGTTGG ATTTATTGGT TGGCGCGTTG GGAATTCTCA AAGCGGGCGC GGCCTATGTG
CCAATCGACC CAATTTACCC CCAAGATCGC ATCAGCGCTA TGCTCGAAGA TGGTGCAGTG
AGTTTGCTGC TTACCCACGC TGAGCTAGCC GCCGAATTGC CAAAACTTGA TCTTGAGGTG
CTGTGCCTCG ACCAAGCATG GCCGACGATT GCCCAAGCGC CAACGCACAA CTTGAATTTG
GCGCTTGAGC CGCGCAGTTT GATGTATGTG CTGTTTACCT CTGGCTCGAC TGGCCGCCCC
AAAGGTGTGG CAATCGAACA CCATAATTAT GTCAACTATA TTCAAGGCTT ATTGCAACGA
ATTGAAGCCG AAGATGGCTG GAGCTATGCC TTGGTTTCGA CCTTCGCGGC AGATCTTGGC
ACAACCAATG TGTATGGAGC CTTGTGCAGC GGCGGCGAAT TACATATCGT GGCCTATGAA
CGCGCCACTG ATCCCGAAGC GTTTGCAGCC TACTTTCGCC AGCATCGCAT CGATGTGATG
AAACTTGTAC CCAGCCATTT CGAGGCCATG CGCGGCTTGA ACAACTTGGC CGATGTCATT
CCCAAGCAGC GCTTGATTTT GGCGGGCGAA GCCAGCCTTT GGGAGCAGCT TAGCGATATT
CGCCAACTTC AGCCTAGCGT GCAACTGCAA AACCACTATG GACCAACCGA AACCACGGTT
TCCATGCTGA CCTATCCAAT TCCCAGCCAA CCACACTACC CCAGCAGCAC CGTGCCACTT
GGCAGGCCAT TGGGCAATGT GCAAATTTAT GTGCTCGATC GCCGAATGCA GCCAACGCCG
CAAGGTGTAC CAGGCGAACT CTACGTTGGC GGCGCTGGTG TGGGGCGTGG CTACATCGGG
CGGCCCGATC TGACCGCCGA GCGTTTTGTG CCCAATCCAT TTAGTACCGA AGCTGGAGCG
CGGCTCTATC GCAGCGGCGA TTTGGTGCGC TATCAGCCTG ATGGCGCGAT TGAATTTTTG
GGTCGGATCG ATCTACAAGT CAAAATTCGT GGCTATCGGG TTGAGTTAAG CGAGATCGAA
ACCGCGATTC AAGCTCAGGC ACAGGTTGCC AATAGTGTAG TGATTTTGCG CGAAGACACG
CCAGGTGATA AGCGCTTGGT CGCCTATATC GTGCCAGAAG CAGGCCAAAG CCTGAATATT
GGCAGCATTC GCGAGGCCTT GCGCAATAGT TTGCCCGATT ATATGGTGCC AACCGCCTTT
GTTGAATTAG ATGGATTGCC GTTGAACCCC AATGGCAAAA TCGAACGCCG CGCCTTGCCC
GCCCCCAGCA ACGAGCGCAA TCTCGATAGT TACGTTGCGC CACAAACCGC CACTGAACAT
GAATTGGCAG GAATTTGGGC CGAGGTTTTG GGGCTTGATC AGGTTGGCAT CGACGATAAT
TTCTTCGATT TAGGCGGCGA ATCATTTAAA GCGATTCGGG TGGTGCGCAA AATTGGCAGC
CATATCAGCG TTATGACGCT GTTCAAATAT CCGACAGTGC GTGAATTGGC CGCTCATCTC
TCGGGTGCAA GCAGCGCTGA GAGCGGCGGC ATGCTCTATG AATTGAGCAA AGCACAGGCT
AAACAGCACA CCACGATTGT GGCAATTCCC TATGGTGGCG GCAGCGCGAT CACCTATCAA
CCATTGGCCC AAGCCATGCC CAAGGGCTAT CGGTTATTGG CCGCCGAGCT ACCTGGCCAC
GATTTCAGCC GCCCCGACGA GCCATTGCAA GCCTTAGAAG TCGTGGCGAG TCAGCTTGCC
AGCGAAATTC AAACCAAAAC CCAAGGGCCA ATTGTGTTGT ATGGCCATTG TGTGGGCAGC
GCCATGACCG TGGAAATTGG CCGTTTGTTG GAGCAAGCAG GCCGCGACGT TCAGGGGATT
GTGCTCGGTG GCAATTTTCC GGCGGCTCGC GTGCCAGGCC GCTTCTTTGA ATGGCTCAAC
AAACTTATGC CCGCCGATCG TTGGATGTCG GATCGGACAT ATCGCGATTT TCTCCGCGCG
TTGGGTGGCT TCACCGAAAT TGTCGATCAA GCTGAACAAA CCTTTGTGAT GCGCAGTTTG
CGCCATGATG CGCGTGAAGT TGAGCGCTAT TTCACCCAAG CCTTTGCCCA AAAACAGTCG
CAACAACTCA AAGCTCCAAT CGCCTGTATC ATCGGCGAGA TGGATCGAGC GACCGAATAT
TACCAAGAGC GCTACCGCGA ATGGGAATAT TTCAGCAACA ATGTGACGCT GCATGTAATT
CCCCACGCTG GCCATTATTT TCTCAAACAT CAGGCCAGCG AATTGGGCCA AATTATCGAG
CAACAAACCG AGCAATGGCA ACAACCACGG CCAATTCAGC CAACTGCCGC CAAATCAAAA
TCGCATAAAA CCAGCATGCC CAGCCTACGA ATTTTCTTTA TGGTTGCGTT AGGCCAATTG
GTTTCGATGC TTGGCTCAAG TTTATCAAGT TTCGCCTTGG GCATTTGGAT CTATCAACGA
ACTGGCACGG TCAGCGATTT TGCCTTTACC GCGATTGCCT CGATGCTGCC AAGTTTGCTG
GTTTCGCCAT TGGCTGGAGC AATTGCCGAC CGCTGGGATC GGCGCTGGAT TATGATTATT
GCTGATACAA TTTCGGCGCT TTCGACAATT GTGATTGCAA TGTTGCTGTG GGCCAATAAA
CTTGAGGTTT GGCATATTTA CCTCACGGCG GCAATTAGCT CGATTGCCGG AACCTTCCAG
CGCCCAGCGT ATGCCGCAGC CATGACCCAG CTTGTGCCCA AACAATATCT GGGCCATGCC
AACGGCGTGA TTCAGCTTGG CTCGGCTACG GGCGGACTGA TTGCACCGTT TATTGCTGGC
GGCATGGTGG CCTTCTTTGG CTTGGGCGGG GTCTTTTTGC TCGACTTCAT CTCGTTCAGC
TTGGGAATTG GAGTATTGTT CTTGGTGCGC TTCCCCAACA CGCTCTATCA CAAGCGCGAG
GAGCCATTGC TGCGCGAAAT TGTGCGCGGC TGGGAATATA TCATCAAGCG CCCAAGTTTG
GTGGCGATGG TGCTGTTTTT CGCCTTGGGC AACATCTGGT TTGGCATCGC CAGTATCAGC
ATGAGTCCGT TGGTGCTTTC GTTTGGCGGG CCTGCCGAAT TAGGCATTGT CAGCGCAGCT
TGTGCTTTGG GCGGCTTCCT CGGCGGCTTA TTTATGAGCT TATGGGGCGG TTTGCAGCGT
CGTGCCGAAG GCATGGTTGG CTTCGTCATT CTCGAAGGCT TTTTCATTGC CTTGGCCGGG
TTGCGGCCTT CGGTATGGTT GGTAGCCTTA GCGATGTTTG GAATGTGGTT TGCGATTTCG
CTGGTCAACG CCCACTGGCA AGTGCTCATT CAAACCAAAG TTGGCCTGGA GCTACAAGGG
CGGGTGCAAG CAACCAACCA AATGCTGGCC ATGCTCAGCA TCCCGCTGGG CTATTGGCTG
GCTGGGCCAT TGGCTGATAA CTTGTTTGGC CCTTTGCTCG AACCAAACGG CGCACTCAGC
AGCAGTTTGG GTTGGCTCTT CGGGGTCGGG CCTGATCGCG GGATTGGCCT GTTGATGGTG
GTGGTTGGCT TAGGTGCAGC AATTTGGGCC TTGATTGGCT TCAACTATCG CCCCTTGCGC
TACATGGAAG ATGCCCTGCC CGATGCCATC CCTGATGCTG AAATCGCCAG CGACCGCGAT
ACGATTCAAG CCCAAGCCGA TGGTATAATC GCTGTGACAG CAAAAGGCTA A
 
Protein sequence
MTEVARQLED LSPERRALLA QRLRQRQAAK PVPSIPALAR IGEHPAFELS FAQQRLWFLS 
QWQPESAAYH IPATFSIAGE INVSVLQTCL DKIIQRHEVL RSTIEVLNDQ PMQVIQPFRS
LDLELVDLRG LSNEQQASQR QQQIERHSQQ PFDLSKDLML RGLLLHTAAD HAELVLTIHH
IACDGWSIGV LLRELGQLYE AGLRGEQLEL PALPIQYADF AVWHKRYVLE QVYQQHLNFW
QEQLSGTLPL LNLPTDHARP AVKRDLGATV EYHLPWSLVE AVERLSRQER ATPFMLFMAA
FQVLLYRYSG QSDLIIGTPI ANRTRREIEN LIGFFVNTLP IRVNLAGSPS FRSLIAQVRQ
TSLAAFEHQD MPLEHLIDVL KVERSLSHNP LFQALFVHQT TSIQTVDSGE FGLQFGGAIE
TGSAKFDINL NLAANRATFE YNTDLFERST IERMASHFHS LLEYAVTNPD ASIEHLPLLS
SSERQQLLQT WNSTSANYPA VDSIVRLFEA QAARVPERTA LHFEGQTLSY AELNQRANQL
AHSLRQRGIG CDMRVGLFID RSLDLLVGAL GILKAGAAYV PIDPIYPQDR ISAMLEDGAV
SLLLTHAELA AELPKLDLEV LCLDQAWPTI AQAPTHNLNL ALEPRSLMYV LFTSGSTGRP
KGVAIEHHNY VNYIQGLLQR IEAEDGWSYA LVSTFAADLG TTNVYGALCS GGELHIVAYE
RATDPEAFAA YFRQHRIDVM KLVPSHFEAM RGLNNLADVI PKQRLILAGE ASLWEQLSDI
RQLQPSVQLQ NHYGPTETTV SMLTYPIPSQ PHYPSSTVPL GRPLGNVQIY VLDRRMQPTP
QGVPGELYVG GAGVGRGYIG RPDLTAERFV PNPFSTEAGA RLYRSGDLVR YQPDGAIEFL
GRIDLQVKIR GYRVELSEIE TAIQAQAQVA NSVVILREDT PGDKRLVAYI VPEAGQSLNI
GSIREALRNS LPDYMVPTAF VELDGLPLNP NGKIERRALP APSNERNLDS YVAPQTATEH
ELAGIWAEVL GLDQVGIDDN FFDLGGESFK AIRVVRKIGS HISVMTLFKY PTVRELAAHL
SGASSAESGG MLYELSKAQA KQHTTIVAIP YGGGSAITYQ PLAQAMPKGY RLLAAELPGH
DFSRPDEPLQ ALEVVASQLA SEIQTKTQGP IVLYGHCVGS AMTVEIGRLL EQAGRDVQGI
VLGGNFPAAR VPGRFFEWLN KLMPADRWMS DRTYRDFLRA LGGFTEIVDQ AEQTFVMRSL
RHDAREVERY FTQAFAQKQS QQLKAPIACI IGEMDRATEY YQERYREWEY FSNNVTLHVI
PHAGHYFLKH QASELGQIIE QQTEQWQQPR PIQPTAAKSK SHKTSMPSLR IFFMVALGQL
VSMLGSSLSS FALGIWIYQR TGTVSDFAFT AIASMLPSLL VSPLAGAIAD RWDRRWIMII
ADTISALSTI VIAMLLWANK LEVWHIYLTA AISSIAGTFQ RPAYAAAMTQ LVPKQYLGHA
NGVIQLGSAT GGLIAPFIAG GMVAFFGLGG VFLLDFISFS LGIGVLFLVR FPNTLYHKRE
EPLLREIVRG WEYIIKRPSL VAMVLFFALG NIWFGIASIS MSPLVLSFGG PAELGIVSAA
CALGGFLGGL FMSLWGGLQR RAEGMVGFVI LEGFFIALAG LRPSVWLVAL AMFGMWFAIS
LVNAHWQVLI QTKVGLELQG RVQATNQMLA MLSIPLGYWL AGPLADNLFG PLLEPNGALS
SSLGWLFGVG PDRGIGLLMV VVGLGAAIWA LIGFNYRPLR YMEDALPDAI PDAEIASDRD
TIQAQADGII AVTAKG