Gene Haur_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1858 
Symbol 
ID5733747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2170286 
End bp2174872 
Gene Length4587 bp 
Protein Length1528 aa 
Translation table11 
GC content53% 
IMG OID641279002 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544629 
Protein GI159898382 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATGA ATGAGCGCGA AGTGTTTGCC TTCCCGATGT CGTTTGCCCA GCAACGGCTC 
TGGTTTTTGG AGCAACTTCA GCCGAATAGT GCCTTGTATC ATATCGCTAG TTTGCTGGAA
ATTCAGGGGT CGCTTGATCT GGCGGCGTTG CAGCAGAGCA TTAACCAGAT TGTTGTGCGT
CACGAGACGT TGCGTACCAC CTTTGGCATG GTCGATCAAA CGCCGATGCA ATTGATCAGC
AGTGAGTTGA CGCTCACCCC GGTTGTCCAT AATTTGCAGG CGCTTGAGCC TGAGCAACGC
TGGTTTGTGG CGCTTGAACA GGCCAAAGCC AGTTGTTTGG TGCCATTTGA TTTGAGCCAA
GGCCCGTTGG TGCGGCTTGA GTTGTTTCAA CTTGGGCCTG AGCATTGCTT GATGACGGTC
GTGCTGCACC ACATTATTGC TGATGGCTGG TCGATGGAAC TTTTCATTCA GGAACTAGTC
AGCAGCTATG AGGCCTATCG CGCGGGTTTT CATCCGCAAC TTGCGCCCTT GGCCTTGCAA
TATGCCGATT ATAGCGAGTG GCAGCGTGAA TGGCTGGCTA GCCCACGCCA GCAGCAACAA
TTGGATTTTT GGCAACAGCA ATTGGCCGAT GCCCCCAAGC GTTTGGAGTT AGCAACTGAT
CACCCTCGGC CTGCTCAGCA GAGTTTTGCG GGGAGCACGT TGAGTTTTGG CATCCCAAGC
CAACAAACTA GTCAACTGCG CAGTTTGGCC CAGCAAACCC AAACTACGCC GTTTATGCTA
GCTTTGGCGG TGTTTGCCAG CTTGTTGAGC CGTTATAGCC GCCAAGATCA GGTGTTGATT
GGCTCGCCGA TTGCCAACCG TACTACCGCC GAAGCCCAAC CGCTGATTGG CTTTTTCGTC
AATACAGTGG TTTTCAAGGT ACAACTGAGT CCAAACCTCG ATTTGCTGAG CCTAGTTGAG
CAGGTACGCG AGCAAAGTTT GGCGGTCTAT GCCAACCAAG ATGTGCCGTT TGAGCAGGTG
GTGCAGCAGT TACAGCTTGA GCGCAATCTG AGCCATACGC CCATTTTTCA GGTGATGCTA
GCTTACCAAA ATGTGCCGAG CCAGCAACTG AGCATCGCCA ACTTGACAAT CAAACAGCTG
CCGCTGGATT TGGGTTATGC CAAATTTGAC CTAACTTTAT TTATTGAGGA AACTCCAGCA
GGGCTAGTTG GGCGCTTGGA ATATAACCGT GATTTGTTTG AGCCTGCCAC GATTGCGCGC
TTGCGCGACC ATTTCTTGCG TTTGCTAGGC CACGCCTTGG CTCAGCCAAC CCAACCACTG
GCCCAAATCA GTATCTTGAG CGCTGCCGAA TGCCAACAGC TTTTGGTCGA TTGGAACCAA
ACGCAGCAGC CGTTCCCCGA CCAACTTGGT TTGCAGCATT TGGTCGCGCA GCAAGTACAA
CGTACTCCCA ATGCTCCAGC GATGCGTTGG AATAACCAAA TAATCTGCTA TACAGAGCTT
GAGCAACGTG CCAACCAATT AGCCCATTTG CTGCTGCAAC GTGGCGTTAC CCAAGGCTCA
ATCGTTGGAG TCTATGCGAC GCGCTGCCCA GAAATGATCA TCAGCTTGCT GGCGATTTTG
AAGGCTGGTG CTGCCTACTT GCCGCTTGAT CCGGCCTATC CTGCTGAACG CTTGCACTAT
TTGGTGGCCG ATTCGGCGGC GAGTTTGATT GTGCAAGCCA GCCATCAGGC GCTGCCAACC
CTCGTTAGTA CAGCTGAAAC GCTTGATGTT GTAGCCGAAG CTGAAACGCT GGCTAGCTTG
CCAACCACTG CTCCGATGGT TGATTTCGAC CCGCAGCAAT TGGCCTATGT GATTTATACC
TCTGGCTCGA CTGGCAAGCC CAAAGGTGTG CTGATTCAGC ATCAAGGGGT GGTGAATTAT
CTGCACTGGG CGATTCATTA TTATCCATTT GAGCAGGGTG CTGGTGCACC GCTGGCCTCG
TCGTTGGCCT TCGATGCCAC AATTACGGCA TTGTGGGGGC CACTCTGTAC GGGCAAAACC
ATCGATTTGC TGCCTGAGCA GGATGAGCTA GAAGTCTTGG CGCAACGCCT GAGCAGCGAA
GATTATAGCG TGCTCAAAAT CACCCCAGCG CATATGGAAG CGCTTAGTCA GCTGGTTGCG
CCCGACCAAA TTGGCTCAAG CAAGGCCTTT GTGATTGGCG GCGAGGCCTT GTTGCAGCAA
CATGTGGCCT TTTGGCAAAC CAACGCTCCC AACCTGCGCT TGATCAATGA ATATGGCCCA
ACCGAGACGG TAGTTGGCTG TGTGATCTAT CAAGCCCAAG CTGCGCCAAG CGAATGGGCT
GCCGTGCCGA TTGGCCGCCC GATTGCCAAT ACCCAGTTGT ATGTGCTTGA TCCGGCAGGT
TTGCCAGTGC CGATTGGCGT GCCTGGCGAG TTGTATATCG CTGGCTTGGG TGTTGGGCGC
GGCTACCATG GGCGGCCTGA ATTGACCGCC GAGCGCTTTG TGCGGCTGGA ACAATTGGCT
GGGGTGCAGG CAGAACTTGC CCGTTGCCAA CAGCCTCAGC CAGCGTTTGA ACGCTTGTAT
CGTTCAGGCG ATTTGGTGCG CTATCTGCCC GATGGTAATC TCGAATATCT TGGGCGGATC
GATCAGCAAG TCAAACTCCA TGGCTTTCGA ATTGAGCTTG GCGAGATCGA AGCCACGCTG
GCGAGCCATC CGACGGTGCA CGCGGCGGTG GCCATGATTC GCGAAGATCG GCCTGGACAT
AAGCGACTGG TTGCCTATGT GGTCGCTGAG CCAACTGCCA ATCAGGATAC TTCGATTGTT
TTGACCCATG TTGCCCAACA GTTGCCCCAC TATATGCTGC CAAGCGTGGT GATTTGGCTC
GATAGCTTGC CATTAACCCC CAATGGCAAA GTTGATCGTC AAGCGCTGCC CGCGCCTGAG
ATCAACCAAA CTGCGCTTGA TTCGGCCCAA ACCACCCCAC TCGATCAGTA TGAAGCTCAA
TTGATGGCTA TTTGGCAGCG AGTGTTAGGA CTGAAGGCCG TTGATCGCCA TGCCAATTTC
TTTAGCCTTG GTGGCGATTC GATTTTGGTG ATGCAGGTAG TCGGCATTGC ACGGCAGCAT
GGCCTGATTC TAACCCCACG CTTGTTGTTC CAAAACCAAA CGATTGCCAG CTTGGCCCAA
GCGATTCGCC AGCAAACCCA AGCTAAGCCC GCGCTTGATC CCTTGAGTTT GCAGGGCATT
GTGCCGCTTA GCCCAATGCA ACATTGGCTA TTTGAGCGCC AACTGGCCCA GCCCGCCCAT
GTCAACCAAA GTATTGTGCT CAAGTTACAA ACGGGGTTAG CGACCGAACA AATACAGGCA
GCGCTTGATC AATTAGTGCG TTTGCACCCA AGTTTGCGTT TGATCTTTAC CCAAACTGCG
GCTTGGCAAC AACGCTATGA GCCAGCCGCC AGCGTGCCCT TGCGCGAATT ACAGCAACCA
ACATTAAGCC AGCAACAAGT CTGTGATGCC GAATTGCAAG CTTCATTCGA TTTAGCTCAA
GTGCCGTTGT TACGAGCCTC GTTGTGGCGT GGTATCGACC ACGATCAATT GCTGTTGGTG
GCGCACCATA GCATTATCGA CGGGGTTTCG TGGCGAATTG TGCTCGAAGA TTTGGCTTTG
TTGCTCAACC AACAAGCTGT GCCAGCGGCA ACCACGCCAT TTAGTGAGTG GGCCGAATAT
CAGGTGCAGC AAGCCCAAAC CCCGCAATTG CTGAGCCAAC TCGCCTATTG GCGCTCGACG
ATTGAAGCCA TCACGCCGAT TCCTCAGCTA GCTCAAGCGG GGTTGGTTGG CGAAGCACAG
CGCTTTCAAA CCAAGCTTAA TCCTGAATTG ACCGAGCAAC TGCTGCATCA CGCACCTGAG
CGCAGCCGTA CCAGCGTGGC CGAGTTGCTG ATCACTGGCT TGGCGATAGC TTTCCAGCGT
TGGTCAAATC TACAACAATT AGTGCTTGAT ATTGAAAGCC ATGGTCGCGA ATCGCTTGAC
CCTGAGCATG ATTTCAGCCG GAGTTTGGGC TGGTTCACCA GTTTGTACCC AGTGCGCTTG
GATTTCCCCA CTACCAATGA GCCAAACCAG TGGATTAAGC AGATCAAAGA AAGCTTGCGG
GCAGTGCCGC AAGCCGGAGC GGGCTATGGC ATGTTGCGCT ATTTGCACGC TGATCCAGCG
ATTCGCGCGA GCCTTGTGCC AACTCACGCC CCAGCAATTG CCTTCAACTA CCTTGGTCAG
CTCGATAACC AACAAACTTT AGCACCATTC CAAGGGCTAA ATTTGGAGTT TGCCAGCCAA
ACCTTGGCTC CCACCAACCA ACGCAGTCAT GCCTTAGAGC TTAATTGTTA TAGCACTGAT
GGCTGTTTGG TGTTCGATTG GGAATGTCAT CAGACAGCGC GGGCAGCCGT TGAACACTTG
GCCGAGCAGT ATCAAATAGC CTTAGCCGAG TTGTTGCAAG TACCAACCAC AACTGCTAGC
TTGGCTCCCT CGGATTTTCC AGCAGCTCGC GTCAAGGCCA ACGATCTCGA TCGATTGTTG
GCTCGTTTGA AAGCAAAGGG GCAATAG
 
Protein sequence
MTMNEREVFA FPMSFAQQRL WFLEQLQPNS ALYHIASLLE IQGSLDLAAL QQSINQIVVR 
HETLRTTFGM VDQTPMQLIS SELTLTPVVH NLQALEPEQR WFVALEQAKA SCLVPFDLSQ
GPLVRLELFQ LGPEHCLMTV VLHHIIADGW SMELFIQELV SSYEAYRAGF HPQLAPLALQ
YADYSEWQRE WLASPRQQQQ LDFWQQQLAD APKRLELATD HPRPAQQSFA GSTLSFGIPS
QQTSQLRSLA QQTQTTPFML ALAVFASLLS RYSRQDQVLI GSPIANRTTA EAQPLIGFFV
NTVVFKVQLS PNLDLLSLVE QVREQSLAVY ANQDVPFEQV VQQLQLERNL SHTPIFQVML
AYQNVPSQQL SIANLTIKQL PLDLGYAKFD LTLFIEETPA GLVGRLEYNR DLFEPATIAR
LRDHFLRLLG HALAQPTQPL AQISILSAAE CQQLLVDWNQ TQQPFPDQLG LQHLVAQQVQ
RTPNAPAMRW NNQIICYTEL EQRANQLAHL LLQRGVTQGS IVGVYATRCP EMIISLLAIL
KAGAAYLPLD PAYPAERLHY LVADSAASLI VQASHQALPT LVSTAETLDV VAEAETLASL
PTTAPMVDFD PQQLAYVIYT SGSTGKPKGV LIQHQGVVNY LHWAIHYYPF EQGAGAPLAS
SLAFDATITA LWGPLCTGKT IDLLPEQDEL EVLAQRLSSE DYSVLKITPA HMEALSQLVA
PDQIGSSKAF VIGGEALLQQ HVAFWQTNAP NLRLINEYGP TETVVGCVIY QAQAAPSEWA
AVPIGRPIAN TQLYVLDPAG LPVPIGVPGE LYIAGLGVGR GYHGRPELTA ERFVRLEQLA
GVQAELARCQ QPQPAFERLY RSGDLVRYLP DGNLEYLGRI DQQVKLHGFR IELGEIEATL
ASHPTVHAAV AMIREDRPGH KRLVAYVVAE PTANQDTSIV LTHVAQQLPH YMLPSVVIWL
DSLPLTPNGK VDRQALPAPE INQTALDSAQ TTPLDQYEAQ LMAIWQRVLG LKAVDRHANF
FSLGGDSILV MQVVGIARQH GLILTPRLLF QNQTIASLAQ AIRQQTQAKP ALDPLSLQGI
VPLSPMQHWL FERQLAQPAH VNQSIVLKLQ TGLATEQIQA ALDQLVRLHP SLRLIFTQTA
AWQQRYEPAA SVPLRELQQP TLSQQQVCDA ELQASFDLAQ VPLLRASLWR GIDHDQLLLV
AHHSIIDGVS WRIVLEDLAL LLNQQAVPAA TTPFSEWAEY QVQQAQTPQL LSQLAYWRST
IEAITPIPQL AQAGLVGEAQ RFQTKLNPEL TEQLLHHAPE RSRTSVAELL ITGLAIAFQR
WSNLQQLVLD IESHGRESLD PEHDFSRSLG WFTSLYPVRL DFPTTNEPNQ WIKQIKESLR
AVPQAGAGYG MLRYLHADPA IRASLVPTHA PAIAFNYLGQ LDNQQTLAPF QGLNLEFASQ
TLAPTNQRSH ALELNCYSTD GCLVFDWECH QTARAAVEHL AEQYQIALAE LLQVPTTTAS
LAPSDFPAAR VKANDLDRLL ARLKAKGQ