Gene Haur_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2106 
Symbol 
ID5733994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2633481 
End bp2639264 
Gene Length5784 bp 
Protein Length1927 aa 
Translation table11 
GC content52% 
IMG OID641279247 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544874 
Protein GI159898627 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAC ATGAAGTTGA GCATTTTCGG CTATCGCCGC AACAAACCCA TACATGGTTG 
GTTCAACCAC AAAGCCAGCA ACCCTTGGGC ACATGGCTGC TGGTTGAGTT AACCACTCCG
CTTCGTTATG AACGTTGGCA AGCTGGCTTG AATGTTGTGA TCGAGCGCCA TGAGGCGTTG
CGTACCCGTT TCGAGCAGAT CGCTGGGCTA AAGCTACCAG CCCAAGTGCT GCACAACCAA
ACAGTGGTCT TGCATCAGCA GCAAATAGCC GAATCGCATC AGATTGCTGA GCTTGCAGCG
CCAGCTGATG CTGGTTTGAT GCAGATTACG TTATTTGAGC ATGGCCAACA GCAATGGCTT
GGGTTGTGGC TAGCGGCCTT GGTTGGCGAT GCCACTAGTG CCCGATTGCT GCTCGAAGAA
TTGACCCAAG CCGCGTTGGC TCCGCATGAA TTGAGCGCAA GCGATGAGCT GATGCAGTAT
ATTGACGCTG CTGAATGGCA AAATGGCTTG CTCGAAGCAG CCGAAAGCGC CGCTGAACGG
GCATTTTGGC AAACGCAGGC GATTAAGCAA GCACCGCATG ATCTGCGGGG CTTTGCACGC
TTGACCCAAA CCCAGCCCAC TCGGATCAAG CTTAACCTTC CTGCAAGCAG CAGTATCGCG
ATTAATGCTT GGTTTACTCA ACATAATGTT GATTTAGCGA GTACGGTGCT CAGCCTTTGG
CGTTGGTTGC TCAGCCGCAG CAATTATGGG CAAACGCCAG CGTTGGCGCT GGCCTGCGAT
GGCCGTTCGT ATGCAGAGTT GGCCAATGCC CAAGGCTTGT TTGAGCGCTA TCTGCCATTG
TTACCAAACG AACTTGCTGC CGATCAGCCA ATCGCCGAGG CTATAACCAC GCTAGCCCAA
CAGCTAGCCG ATTTAGCGCA GTTCCAAGAG TATTTCAGTT GGCAGCAACT TGCGTTGGAT
CAGCCGTTAG CGTTGGCATT TGCCCATTAT CGTTGGGAAA CAGCGGCGCA CTATCAGCTT
GAACACCTGA CCAGCCATAC TGATCTGTTT CGCTGTAAAT TAAGCCTGAT CGAGCAAGCT
ACAAGCTGGC AATTGACCTT AGATTACGAT GCAACTAGCA TGCGCTCTGA GGTTGCCGAG
GCCTTGGCTG AGAGCTTAAT CACAATGCTG GTTTGGCTTG GGCAACAATC CAACCCGACC
TTTGGGCAAC TGCCAATCAT TGGGAGCAAT ACCCAAACAT TATTGACTAA GCAGGTCAAT
GCAACCGATC GGCCATTTGC TGCAACGCCA ATTCACGATC TGATTGATCA GCAGGCACTA
CACAATCCGC AAGCAATTGC TGTGCAATTT GGTGCAGAGC AACTGAGTTA TGCCGAGTTG
GCTCAGCAAG CCAACCAACT GGCCCAACAA TTAATCCAAC ACGGTATTCA ACCCGAGCAG
CGGGTTGGCT TGTATCTTGA GCGCTCGCCG CTGATGGTCG TGGCCTTGTT GGCGTGTCTC
AAGGCTGGCG CGGCCTATGT GCCCTTAGAG CCAGAGTATC CCGCCGAGCG GATTCAGTAT
ATTCTTGCTG ATGCGGCGAT TCAGTTGGTG TTGAGCCAAA CCAGCCTCAT GCCTAGTTTG
CCGTGTAGCG TTGCCCAATT GGCGGTCGAT CAGTTGCAAT TTGATCAAGC GAGTGCCGCG
CCACGTTTGA ACTATCAGCC TGCGCAATTG GCCTATCTGC TGTATACCTC TGGCTCGACC
GGCCAGCCCA AGGGTGTGAT GGTCAGCCAC GCTGGTTTGA GCAACTATGT GCAATGGGCG
ATCACGGCCT ACGATTTGGC GGCTGGTACA GGTTCGTTGG TGCATTCGCC ATTAGCCTTC
GATTTGACCG TAACCAGTTT GCTTGTGCCC TTGTGTGCTG GCCAAACCGT GCGTTTATTG
CCAAGCAATG CTGGGGTTGA AACGCTAGCC CAAGCACTGC GAGCCAGCAC TGATCTGAGT
TTGCTCAAAC TGACACCAGC GCATTTGGCG GTGCTGAATC AATTGATCAC TAGTGCTGAT
TTGGCTCAAC GCAGTAGGGC CTTGGTGATT GGCGGTGAGG CGCTTGATGC AACTACGTTG
GCTCCATGGC GCACCCACGC TCCTGAAACC CGCTTGTTCA ACGAATATGG CCCAACTGAA
ACAGTGGTTG GCTGTTCGAT CTACCAAACC CAAACCACTG ATTCGGCTGC TGGCGCGGTT
TCGATTGGTT TGCCAATTGC CAATATGCGT TTGTATGTGC TTGATGAGCG CTTGCAACCT
GTGCCATTTG GGGTTGTTGG TGAGCTGTAT ATTGGTGGAG TTGGGGTTGC CCGCGGTTAT
AATCAGCGCC CTGATCTGAC CGCTGCCCAG TTTGTACCTG ATAACCTGAG TGGAATCGCT
GGCGCACGGC TGTATCGCAC TGGCGATTTG GCGTGTTGGG CCTGGGATGG AACGCTGGAA
TATCTTGGGC GGCGTGATAC GCAAATCAAA TTGCGTGGCT ATCGAATTGA GCTGGGCGAG
ATTGAGGCAG TGCTGCAACG CTTGCCAATG GTCGCTTCAG CACTGGTCTT GCTGCGTGGC
ACAGGCGACG ATCAACGCTT GGTCGCCTAT CTCCAAGCCA CACCCGATGC CGACTCCACG
CAATTGAGTG AACAAGTGGT GTTGAAATAT GCCCAACAAT TCCTGCCACA GTACATGTTA
CCAAGCAACG TTGTGTTGGT TGAGCAATGG CCGTTGACCG CGAATGGCAA AATTGATCGG
GCGGCCTTGC CCGAACCAAC CGCTATAAAC AATTATGTTG CCCCAACGAC CCCTGAAGAA
GAAATTTTGG CAGCCATTTG GGAACAGGTG CTTGAGCACC CAATGATTGG GATTGATGAT
AATTTTTTTG CATTAGGCGG CAATTCAATT CGCAGCATTC AGGTGGTGGC CCAAGCCAAA
CAGCGCGGCT TAAATCTGAG TGTTGAAATG CTGTTCAATC AGCCGACGAT TCGTAGTTTG
GTTCAAACTA TGGTCTGCTC TACAGAAAAT CAAATAATCG AATACACACC CTTCAGTTTG
ATTAGCCCTG CTGATCATGC CTTGCTCCCA AATACTATTG TTGATGCCTT CCCGATTGCC
AAGTTGCAGG GTGGCATGAT TTTCCACAAC CAATTCAACC CTGAACAAGC GCTGTACCAC
GATATTTTTA GCTATCGGAT GCGGGTCGTG CTCGATTTGG CGTTGTTGCA ACTGATCGTC
GATGATTTAG TGGTGCGGCA TCCAGCGCTA CGCACTAGTT TTGATCTGAC CAGCGCCAGC
GAGCCGTTGC AAGTGGTGCA TGCCCAAGGC GCAAACCTGT TGAATATTAT CGATCTGCGC
AACCAGCCTG TTGAACAGCA CGATCAATTA ATTGAAGCTT GGATCGCCGC CGAAAAGCAG
CGCGGTTTTG AGCCAAGTAG CCTGCCGTTA TTGCGGTTCC AAGTGCATGT GCGGGCTGAT
GATGAATTGC AATTTTCGCT GAGCTTTCAC CATGCGGTGA TCGATGGCTG GAGCGATGCG
ATAATGCTGA CTGAGCTGTT TAGCGATTAT GCGCGGCGCT TGCAAGGCCA AACCAGTAGC
CTTGTTGCGC CTCAAATTGG CTATCACGAA TTTGTACGGC TAGAACAAGC AGCAATTCAG
AATCCTGCGA CCCAGCAATT TTGGGCTGAC CATTTGGCCC AAGCCAGCCC GATGCGCTTG
CCGCGCTGGC CGAATGTGCC GCGTTCAAAC ACCAGCCAAT CACAACCAGT TGCAATTAGT
GCTGAGCTTT CGCAAGCACT TAAAGCCTTG GCTCGCCAGC TTGCCGTGCC AATTAAAGAT
GTGCTGTTGG CAGCGCATTT ACGGGTGATT AGCATCCTGA CTGGTCAGTT CGATGTAGTG
ACCAGCATGG TTTCGAGTGG GCGGCCTGAA ACCCTTGATG GTGAACGGGT TTTGGGCTTG
TTTATCAATA GTATTCCTCT ACGAATGCAG CTGAATCAGC CAACGTGGCG TGAACTTATT
ATGCAGACCT TTGCTGCTGA ACGTGCCAGT CTGGAGCATC GGCGCTACCC AACTGCCGAG
TTGCAACGCC ACAACGGCGG TTTGGCTTGG TCGGAGAGTT TGTTCTACTT CACCCACTAC
CATATCTTCC AAGCCTTGCA AAACATCAGT GAGCTGGAGT TGCTTGATGT GCTGCCCTAC
GAAGTTTCGA GTTTTCCATT AGTTGCCAAC TTCCGCATCG ATCCCTTTAC GAATGACATT
AACTTGAGTT TGACCTGTGA TGGGCGAATT TTGACCAATG CCCAAATCGA AGCGATTGCA
GGCTATTATC AAGTCTGTCT GACCGCGATG GTTGCCGACC CTGCGGCAGA TTATCGCGCT
ATGCCATTGT TGAGTGATAC TGAGCAACAC CTATTGCTTG GATTTAATCG CACCGAAGTT
GCACAATCGT CGCCTGATCT TGTTGGTTGG CTGGCCGAAG TGGCTCAACA GCAGCCAACT
GCCCAAGCCA TCCAAGCCTA TGATGGGGCG TTGAGCTATG CTGAGCTTGA GCAACGCGCA
ACGGCTTTGG CGGGCTATTT ACAAACGCAG GGGATTGGTG CAGAAACCCG GGTTGGTATC
AGCCTTGAGC ATTCAACCAG CTTGATTGTG GCGATTTTGG CGGTGCTCAA AACAGGGGCT
GCTTATGTGC CACTTGACCC CAACTACCCA CGTGAGCGGC TTGAATTGAT GGCGAGCGAT
GCTGAATTGA AGCTCTTGAT TTGCCAACAG CCAGACATCT GGCAAAACCT ACCTGCAAAC
TCTGCCTGTT TAGGCCTTGC TGATTTAGAT TCTGCCCAAG CGCCATTTGT GCCAGTCACG
ATTCATCCGG CGCAGGCCGC CTATCTGATC TATACCTCTG GTTCGACAGG CCGCCCCAAG
GGTGTGGTGG TCAGTCATGC CAATCTGCAT AGCTCCACGT TTGCCCGAAC GCTTGCCTAT
CGCGAGCCGC TGACGAGCTT TTTATTGCTT TCATCGTATG CCTTCGATAG CTCGATCGCT
GGAATTTTCT GGACACTGAG CCAAGCTGGC TGTTTGGTAC TGCCCGATCA AGCGCAACGC
CACGATGTTC TAGCGCTAGC CAGCATGGTC GAACATCATC AGATTAGCCA TACCTTGGCA
ATTCCGTCGT TGTACGCGGT ATTGTTGGAA CAAGCCGAAT TAAGCCAATT AGCTAGTTTG
CGCGTGGTCG TGGTCGCGGG CGAGGCCTGT ACCACCAGCT TGGTCAATCG CCATTATCAA
CAACTGTCAA CGTGTGCCCT ATACAACGAA TATGGCCCAA CCGAGGCGAC GGTTTGGGCA
AGCGTTGCCA AACTAGTACC GCAACAACCG ATCTCAATTG GCGGCCCGAT TGCCACGATC
CAAGCCTATG TAGTTGATCC AAGCTTGCAG CCTGTGCCAA TTGGAGTTGC TGGCGAATTG
TTGATTGCTG GTGCGGGTAT TAGTCGCGGC TATTGGCAAC AACCAGCGCT GACCGCCGAG
CGGTTTATGC CCGACCCATG GGCCGAACAG CCAGGCCAGC GCTTGTATCG CACTGGCGAT
TTAGCCCGTT GGTTGCCCGA TGGTCAGCTT GAATTCTTAG GTCGCATCGA TCAACAGGTC
AAAATTCGCG GTTTTCGGAT TGAGCTTGAA GAAATTGCCC AACTGCTGCG CCAACACCCC
GCCTTACGCG AGGCTGTGGT TACCGCTCAG CCCGATCAGC ATGGTCAATT ACGCTTGGTG
GCCTATATCG AGCCACGCAA TTAA
 
Protein sequence
MQQHEVEHFR LSPQQTHTWL VQPQSQQPLG TWLLVELTTP LRYERWQAGL NVVIERHEAL 
RTRFEQIAGL KLPAQVLHNQ TVVLHQQQIA ESHQIAELAA PADAGLMQIT LFEHGQQQWL
GLWLAALVGD ATSARLLLEE LTQAALAPHE LSASDELMQY IDAAEWQNGL LEAAESAAER
AFWQTQAIKQ APHDLRGFAR LTQTQPTRIK LNLPASSSIA INAWFTQHNV DLASTVLSLW
RWLLSRSNYG QTPALALACD GRSYAELANA QGLFERYLPL LPNELAADQP IAEAITTLAQ
QLADLAQFQE YFSWQQLALD QPLALAFAHY RWETAAHYQL EHLTSHTDLF RCKLSLIEQA
TSWQLTLDYD ATSMRSEVAE ALAESLITML VWLGQQSNPT FGQLPIIGSN TQTLLTKQVN
ATDRPFAATP IHDLIDQQAL HNPQAIAVQF GAEQLSYAEL AQQANQLAQQ LIQHGIQPEQ
RVGLYLERSP LMVVALLACL KAGAAYVPLE PEYPAERIQY ILADAAIQLV LSQTSLMPSL
PCSVAQLAVD QLQFDQASAA PRLNYQPAQL AYLLYTSGST GQPKGVMVSH AGLSNYVQWA
ITAYDLAAGT GSLVHSPLAF DLTVTSLLVP LCAGQTVRLL PSNAGVETLA QALRASTDLS
LLKLTPAHLA VLNQLITSAD LAQRSRALVI GGEALDATTL APWRTHAPET RLFNEYGPTE
TVVGCSIYQT QTTDSAAGAV SIGLPIANMR LYVLDERLQP VPFGVVGELY IGGVGVARGY
NQRPDLTAAQ FVPDNLSGIA GARLYRTGDL ACWAWDGTLE YLGRRDTQIK LRGYRIELGE
IEAVLQRLPM VASALVLLRG TGDDQRLVAY LQATPDADST QLSEQVVLKY AQQFLPQYML
PSNVVLVEQW PLTANGKIDR AALPEPTAIN NYVAPTTPEE EILAAIWEQV LEHPMIGIDD
NFFALGGNSI RSIQVVAQAK QRGLNLSVEM LFNQPTIRSL VQTMVCSTEN QIIEYTPFSL
ISPADHALLP NTIVDAFPIA KLQGGMIFHN QFNPEQALYH DIFSYRMRVV LDLALLQLIV
DDLVVRHPAL RTSFDLTSAS EPLQVVHAQG ANLLNIIDLR NQPVEQHDQL IEAWIAAEKQ
RGFEPSSLPL LRFQVHVRAD DELQFSLSFH HAVIDGWSDA IMLTELFSDY ARRLQGQTSS
LVAPQIGYHE FVRLEQAAIQ NPATQQFWAD HLAQASPMRL PRWPNVPRSN TSQSQPVAIS
AELSQALKAL ARQLAVPIKD VLLAAHLRVI SILTGQFDVV TSMVSSGRPE TLDGERVLGL
FINSIPLRMQ LNQPTWRELI MQTFAAERAS LEHRRYPTAE LQRHNGGLAW SESLFYFTHY
HIFQALQNIS ELELLDVLPY EVSSFPLVAN FRIDPFTNDI NLSLTCDGRI LTNAQIEAIA
GYYQVCLTAM VADPAADYRA MPLLSDTEQH LLLGFNRTEV AQSSPDLVGW LAEVAQQQPT
AQAIQAYDGA LSYAELEQRA TALAGYLQTQ GIGAETRVGI SLEHSTSLIV AILAVLKTGA
AYVPLDPNYP RERLELMASD AELKLLICQQ PDIWQNLPAN SACLGLADLD SAQAPFVPVT
IHPAQAAYLI YTSGSTGRPK GVVVSHANLH SSTFARTLAY REPLTSFLLL SSYAFDSSIA
GIFWTLSQAG CLVLPDQAQR HDVLALASMV EHHQISHTLA IPSLYAVLLE QAELSQLASL
RVVVVAGEAC TTSLVNRHYQ QLSTCALYNE YGPTEATVWA SVAKLVPQQP ISIGGPIATI
QAYVVDPSLQ PVPIGVAGEL LIAGAGISRG YWQQPALTAE RFMPDPWAEQ PGQRLYRTGD
LARWLPDGQL EFLGRIDQQV KIRGFRIELE EIAQLLRQHP ALREAVVTAQ PDQHGQLRLV
AYIEPRN