Gene Haur_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2092 
Symbol 
ID5733980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2613140 
End bp2618125 
Gene Length4986 bp 
Protein Length1661 aa 
Translation table11 
GC content51% 
IMG OID641279233 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544860 
Protein GI159898613 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAACTG TGTTAGAGCC AATAACGCCT GATGCGCTCA GCGAAACGTT GCGCCAGTAT 
TTGCAAGCGT ATCTGCCGGA TTATATGCTG CCAGCGGCGT TTGTGCCGCT TGAACAGATT
CCGCGCTTGC CGAATGGCAA AATTGACCGT GCGGCCTTGC CAATGGTCGA TTTTGCAGCC
CAGCATGAGC AGCAAACCCA AACGGCCCCA CGTAACCCGC TTGAGCAGCA GCTCGCGGCA
ATTTGGCAGC AAACCTTGCA AGTACCAAGC GTGGGCATTC ACGATAACTT TTTTCAATTG
GGCGGCGATT CAATTTTGAG CATTCAGGTG ATTGCTCGTG CTAATCGAGC TGGCATACGG
CTGACTACGC GCCAATTGTT TGAGCAGCCA ACGATTGCCC AACTAGCAAG CTTGGCGCAA
ACCACAACAA TAGATGTTGC TCATAGCGAA TTGTCTGCTG GACAGATCGT TCCCCTAACA
CCAATTCAGC GCTGGTTGCT CGCTGATCCT ACTGATCCCA GCCAGTTTAA TCAGGCACTC
TTTTTGCAAT TTACTCAGGC TATTGATTCG AATTTGGTGG CATCGGCGGT TGAGCACGTT
GCACAGCTAC ATGTCAGTTT GCGTTTGCGC TATCGCCGTA CTGCTGATGG TTGGCAACAA
TTTGTGGCTG CTGCTGATGC ACCACTGGTT GAATTTGAGC AAATTAATGC TCAAAACCTC
AATCCAACGG AGCTGGCTGA GCTTTTCGAG GCTACAACTG AACGACTGCA ACGGCCCTTT
GATTTGGCGA AGGCTGCACT GTGGCGAATT GCCTACATCG CCATGCCTGA TGCTCAGCCC
GCCCGCTTGC TGTTGGTCAT GCATCACTTG GTGGTTGATG GGGTTTCGTG GCGAATTATC
ATTCAAGATC TCGCCCATGC CTTGCAAAAC CAGCCATTAA CCAAACCAGC TGTCGGTTTT
GCCCAATGGG CTATTGCTTT AGAACGCTAT GCCCACCGCA CCGAATTGCA GCAGCAACGT
GCATATTGGC TAGCCCAAAC CAGCACTGAT CCCTTGCCAG TCGATGATAT AACTGGACAT
AATGATTATG CGAGTGTTGC GACAATCACT AAGCAATTGA GCCAGGCCCA TACCACAGCC
TTAATTCATC AAGCATCCCA GGCCTATCAA ACCCAGATCA ACGAACTATT GCTAGCAGCC
TTGACCCAAA CCATCACCGC TTGGAGCGGC CACGCCGATG TGGTGCTGCA ACTCGAAGGC
CATGGTCGCG AGGAGCTGGA TCAGCCGCTC GATCTTTCGC AAACTGTTGG GTGGTTTACT
ACGCTGTTTC CAATAAAGTT AAGTCTGCCA CAAAAACTAG GCTCCAAAAA CCTGATCAAA
CAGATTAAAG AACAGATACG TGCAGTTCCC CAACGCGGGT TTGGCTATGG CTTGTTACGC
TCAGCTGATG CGGCGTTGCA AGCTATGCCA ACCCCAGCGA TCAGCTTTAA CTATTTTGGT
CAACTTGATC AAACGCTCCA GTCAAGCAAA TTATTCAGTG CTGCGCCCGA ATCGACCGGA
AGCGCCGTGT TGCCGCAGCG GCGGCGTGAA CAACTGCTAG CGATCAACTG CCAAGTTTTG
GCGGGCCAAT TACAGATCGA ATGGTCGTAT AGCCAACATC TGCATTCAGC AGCAACAATC
GAGCGCTTGG CTGAAGCATA TTGTGCTGAT TTAGTTGGGT TGATTGAGCA TTGTTGTCAA
CAAACCCAAC CAAGCTTTAC GCCTGCCGAT TTTCCTTTGG CGCAAATTAC CCAAGCCCAA
CTTGACCAGA TTGAGCAAAT ATATCCGCCG TTTGAGCAAT TGTACCCTCT TTCATCGCTG
CAACAGGGCA TTTTGTTCCA TCGCTTGTAT GCGCCGGACG CTGGTGATTA TATTACCCAA
ATGCAGTTTG AAATCACGGG TCAGCTCAAT CATGCGGCAT TTAGTGCTGC TTGGAATCGA
ACAATTGGCC ATTATAGAAT GCTGCGAACC GCGTTTGTTT GGCAAGATTT AGCTGAGCCA
CTGCAATTGG TGCTGCGCCA AGCTGTAATC ACGATCGATT TTCAACAGTT ACCCATGAAT
AGCCTTGAAC AAGAACAGGT GCTTGAAGCC TATTTGCAGG CTGATCGCAC ACGCGGCTTT
GAGCCAACCC AAGCGCCATT GATGCGTGTT GCTTTGTTCG AGCGTGCACC GCAACGCTAT
TGCTGTATCT GGACGAATCA TCATTTGATC ATCGATGGCT GGAGCTTACC GCTGATTCTT
GATAGCTTGT TTCGCTATTA TCAGGCTGAA ATCAATCAGC AACCACTGGA GCTTGCCCCA
GAAATTCCTT ATCAACGCTA TATTCAATGG CTGGCTCAAC ATAATGATCA GCAAGCTACA
GCATTTTGGC GTGAATTATT GCGCGGTTTT ACTGCTCCAA CAAGCTTGGC GCTTGAACGG
TTTGGCTCGA CCCACGCTGA ACGACACTAT AGTGCGAGTT GGCTCCAGCT TGATTCTGCT
ATAACTCAGC AGCTTCAACA GTTTGCCCGC GATCATGGCC TAACCGTGAA TAGCCTGTTG
CAGGCGGCGT GGGCCTTGGT TTTATCACGC TACAGTCATC AAACTGATAT CGTGTTTGGT
ACAACGACGG CTGGCCGCCC AACCGATTTG GCCGGAGTTG AGCAGATTGT CGGGATGTTC
GTGAATACGC TGCCCACGAG AGTTAAGTGG GATTTGCAGC AGCCAGTGTT GGATTGGCTA
CAAGCCTTGC AGGCTCAAGA GAGTGCTGTG CGCAGTTACG AGGCCAGTTC GCTCATTGAA
ATTCAGGCAT GCAGCGAGTT GCCACGCAAT AGCCCGCTGT TTGAAAGTAT CTTGGTCTTT
GAAAACTATC CGGTGAGCAG TAGCGATTTA ACTGGTTTGG GCGATTTAGA GTTACGTTTG
GTTCCTTCGC GCGAGCAAAC CAACTATCCT TTGACCTTAG TTGCTGTGCC AGGTGATGGG
TTGGCCTTCA AGTTGATGTA TCAACAAGGC TACATCGACC AACTTACTAG CCAGCGCATG
CTCGATTATC TCCAACAAGG CTTAGCCGCA ATGCTAGCGC AGCCCAAGGC AAGGCTTGGC
CAGCTCAATA TTGGGCATCC CAGCGAAATC CAAGCCTTGG CTGATTGGAA CGCAACCGCA
GCACCGCGCC AAACCAGCTC GTTGCTTGAA TGTTTTTACC AGCAGGTCGC AGCTCAGCCA
ACAAGCATCG CCGTCGCATG GCGTGAACAA CGCTGGAGTT ACTTCGATTT AGCACAGGCG
AGCCAAGCAA TTGCCGGCTA TTTGCGCGAT CAAGGGGTGC AACGCCAGCA AATTATCGGC
CTACGAGCTG AGCGCAACCC GCAGTTTGTC GCAGCGTTGT TGGCGATCTT GCAATTGGGC
GCGGTGTATT TGCCGATTGA TCCTCAGCAT CCAGTGCAGC GCCAACAGCA ACTTGCTCAG
CATGTCGATT GGTTATTGAC TGATGCCTTG GCTGAAGCGC AGCCTCAGCA ACTCGATTTG
GCTCAGGCGT TGGGCTACGA TCAACCTGCA TCCGACTTTG TGCAACTCCA TGATCGAGAT
TTAGCCTATG TGCTGTTTAC CTCTGGCTCG ACCGGCACAC CCAAGGGCGT GATGATCGAC
CATGCAGGGA TGCTGAATCA TATTGACGTA ATGATCGAGC GTTTGGCGCT AACCCAAACC
GATTGTATTG CCCAAAGCGC TGCCCAATCG TTTGATATTT CGGTTTGGCA GTTGCTGACA
GCGCTCGTGG TTGGCGCTCG GATGCAGATC ATTGATGATC AAACGATGCG CGATCCGCAG
GCCTTGTTAG CTAAATTGGC AGCGGCTAAC GTTTCAATCT TCGAGCCAGT GCCCAGCCTG
ATTCAAGCCC TACTCGAAAC GATTGCAAGC CTTGAGCAAA CCCCAAGTTT GGCTGCTTTG
CGTTGGGTGC TGCCAACTGG CGAACATTTG CCGCGTGAGC TAGCCCAACA ATGGTTTGCC
CACTATCCTT ATATTCCCTT GCTGAATGCC TATGGTCCGG CTGAATGCGC CGATGATGTG
ACGCTTTGGC CGATTGCCAG TGCTGTTGAG CTACCTCAAA ACGCCATTCC AATTGGCCGA
CCAGTAGCCA ATGTGCGGGC TTATGTACTT GATGCCAGTT TGCGGCCAGT ACCGATCGGC
GTGGCAGGCG AGTTGTATAT CGCTGGAATT GCGGTTGGTT GGGGTTATTT GGCCGATCCC
CAACGCACCG CTAGCCTGTT TTTGCCCGAT CCTTGGGGCG AACCAGGGGC GCGAATGTAT
CGCACTGGCG ATTTAGCGCG TTACAACCAA GCAGGTGTGT TGAGCTTCTT GGGGCGTAGC
GATCAGCAAG TCAAAATTCG CGGCTTCCGA ATTGAGCTAG GCGAGATCGA AGCCTGTTTA
TTGCAGCATC CGGCGCTGCA TTCGGTCGCA GTTGCTGTGG TTGGCGTAGC TGAGCAAGCA
CGTTTGATCG CCTATCTGGT GGCGAAAGCT AAACCAGTCT CCGATCAATT ACTACGTGAT
TTTGTCCAAG CGCGGTTGCC GCATTATCTG CAACCAAGTG GCTATTGTTG GTTGAGCCAA
TTGCCGCTCA ATGCCAATGG TAAATTAGAC CGTCAGCGCT TGCCAATTCC CCAGCTGCAA
ACCGCTGAAC GACTGATTAT CGCTCCCCAA AACGCTGATC AAGCCAAATT GGCCGAGCTT
TGGGCGGCAA TCTTGCAACG TGAGCAAGTT GGAATTAATC AGAATTTCTT TGAACTTGGC
GGTCATTCAT TATTGGCAAC CCGCTTGGTC AGCCAAATTC GCCAGTATTG GCAACTCGAT
TTGCCAATTC GGAGCGTATT TGAGGCTCCG ACAATCGAAC AACTGGCTGA TGTGCTTGAT
CTCCTACGCT GGGCGCAACA GGCTAATCAA GCTCCAGCTC AAGCCCGCGA ACAAGGAGCA
ATTTAA
 
Protein sequence
MPTVLEPITP DALSETLRQY LQAYLPDYML PAAFVPLEQI PRLPNGKIDR AALPMVDFAA 
QHEQQTQTAP RNPLEQQLAA IWQQTLQVPS VGIHDNFFQL GGDSILSIQV IARANRAGIR
LTTRQLFEQP TIAQLASLAQ TTTIDVAHSE LSAGQIVPLT PIQRWLLADP TDPSQFNQAL
FLQFTQAIDS NLVASAVEHV AQLHVSLRLR YRRTADGWQQ FVAAADAPLV EFEQINAQNL
NPTELAELFE ATTERLQRPF DLAKAALWRI AYIAMPDAQP ARLLLVMHHL VVDGVSWRII
IQDLAHALQN QPLTKPAVGF AQWAIALERY AHRTELQQQR AYWLAQTSTD PLPVDDITGH
NDYASVATIT KQLSQAHTTA LIHQASQAYQ TQINELLLAA LTQTITAWSG HADVVLQLEG
HGREELDQPL DLSQTVGWFT TLFPIKLSLP QKLGSKNLIK QIKEQIRAVP QRGFGYGLLR
SADAALQAMP TPAISFNYFG QLDQTLQSSK LFSAAPESTG SAVLPQRRRE QLLAINCQVL
AGQLQIEWSY SQHLHSAATI ERLAEAYCAD LVGLIEHCCQ QTQPSFTPAD FPLAQITQAQ
LDQIEQIYPP FEQLYPLSSL QQGILFHRLY APDAGDYITQ MQFEITGQLN HAAFSAAWNR
TIGHYRMLRT AFVWQDLAEP LQLVLRQAVI TIDFQQLPMN SLEQEQVLEA YLQADRTRGF
EPTQAPLMRV ALFERAPQRY CCIWTNHHLI IDGWSLPLIL DSLFRYYQAE INQQPLELAP
EIPYQRYIQW LAQHNDQQAT AFWRELLRGF TAPTSLALER FGSTHAERHY SASWLQLDSA
ITQQLQQFAR DHGLTVNSLL QAAWALVLSR YSHQTDIVFG TTTAGRPTDL AGVEQIVGMF
VNTLPTRVKW DLQQPVLDWL QALQAQESAV RSYEASSLIE IQACSELPRN SPLFESILVF
ENYPVSSSDL TGLGDLELRL VPSREQTNYP LTLVAVPGDG LAFKLMYQQG YIDQLTSQRM
LDYLQQGLAA MLAQPKARLG QLNIGHPSEI QALADWNATA APRQTSSLLE CFYQQVAAQP
TSIAVAWREQ RWSYFDLAQA SQAIAGYLRD QGVQRQQIIG LRAERNPQFV AALLAILQLG
AVYLPIDPQH PVQRQQQLAQ HVDWLLTDAL AEAQPQQLDL AQALGYDQPA SDFVQLHDRD
LAYVLFTSGS TGTPKGVMID HAGMLNHIDV MIERLALTQT DCIAQSAAQS FDISVWQLLT
ALVVGARMQI IDDQTMRDPQ ALLAKLAAAN VSIFEPVPSL IQALLETIAS LEQTPSLAAL
RWVLPTGEHL PRELAQQWFA HYPYIPLLNA YGPAECADDV TLWPIASAVE LPQNAIPIGR
PVANVRAYVL DASLRPVPIG VAGELYIAGI AVGWGYLADP QRTASLFLPD PWGEPGARMY
RTGDLARYNQ AGVLSFLGRS DQQVKIRGFR IELGEIEACL LQHPALHSVA VAVVGVAEQA
RLIAYLVAKA KPVSDQLLRD FVQARLPHYL QPSGYCWLSQ LPLNANGKLD RQRLPIPQLQ
TAERLIIAPQ NADQAKLAEL WAAILQREQV GINQNFFELG GHSLLATRLV SQIRQYWQLD
LPIRSVFEAP TIEQLADVLD LLRWAQQANQ APAQAREQGA I