Gene Haur_1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1574 
Symbol 
ID5733461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1826304 
End bp1831448 
Gene Length5145 bp 
Protein Length1714 aa 
Translation table11 
GC content50% 
IMG OID641278713 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544345 
Protein GI159898098 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTC AAGGAGCGAC CCAGAGCAAT CCAACCGATA CAAGCCCGGA TGACGATCAA 
CCCTGGTGCA TCCATGAGCT GGTTGCCCAA CAATCACGCT ATTGGGCTGA TGCTGTGGCG
GTTATCCACG CTGACACGCA ATTAACCTAT GCCCAATTAG ACCAGCGGGC TAACCAAGTT
GCCCACGCCC TGCTTGAGCA GGGGATTAAG CCCGATCACC TCGTTGCTCT CTGCCTTGAG
CGCTCAATCG ACATGCTGAT TATCGTGTTT GGCATTCTAA AAGCCGGTGC AGCCTATTTA
CCGATAGACC CTCACTACCC GTATGAGCGT CAACGCTTTA TGATCGAACA CTCACAAGCA
CCGCTCGTAA TCACCACAGC TCCATTGGAA GGAGCGCTTG CTTCAAGGCC TGTGGAAATC
CAGCTTGAAT TATTAATGGC AATTGCTGCT CAAAAACCAA CGAGTGCCCC TAATCAACGA
GTTGATCCTG ATCAATTGGC GTATGTTATC TATACCTCAG GCTCAAGCGG TCAGCCTAAA
GGAGTGATGA TCACGCATCG AGCGTTGGTC AATCATATGC AATGGATGCA AACAACCTTT
GGCTTTAATC GCCATGATCG GTTCTTGCAA AAGACTCCAT TGAGTTTCGA TGCGTCGGTT
TGGGAATGTT ATGCCCCATT ATTGTGTGGC GGTCAGTTGA TTCTAGCCAA GCCTGATGGT
CATCACGATG CCCACTATCT GGTGGAAATG ATTCAGCGCT ATCAGATTTC GGTGCTTCAG
GTGGTTCCTT CGTTGTTGCG CATGCTCCAA ACTGAACCAC AGCTGGCCAA TTGTCGGAGC
TTACGCTACT TGTTTATTGG TGGTGAACCA CTGCATAGTG AGCTTGTGGC CCAAGTACGT
CGCGTTTTGC CAGCTAGGAT GATTAATTTG TATGGGCCAA CCGAAGCCAC CATTGATGCA
ACATGGGCTG AATGCAACCA AACGACCGAA TATCCAACCA TCCCAATTGG CTACCCGATT
GATAATCTTA CGACATGGGT GCTCGATGCC CAGATGCAGC CAGTTGCAGT TGGTAGATCT
GGCGAACTCT ATATTGGTGG AATGGGCTTA GCCCGAGGCT ATCAACGGCA ACCAGATCGC
TCGGCTGAGC GCTTTGTGCC CGATCCATTT AGTACACAAC CTGGGACGCG GCTTTATAAA
ACTGGCGATC GCGTTCGCTT GCTTGCCAAT GGTGCGCTGC TTTTTCTCGA TCGGATTGAT
CAGCAAATTA AGCTCCGTGG GTATCGGATT GAGTTGGGCG AGATTCAGGC CGGTCTCGAA
CGCCACCCTC AGATTCGCCA ATCGGTTGTT GTAGGCCAGC TCGATCAGAC TAAGACGCTT
CAATTGGTGG CCTATGTGGT ACCAATGCCC GAGGCTAGAA TTCCAATTGA GCAATTAAGA
GCGCTTTTGA AAGCCCAACT GCCGCGTTAT ATGCTGCCAA GCGCATTTGT GATCTTGGAA
CGGTTGCCAC TGCTTGCCAA TGGCAAACTT GATCGGGCTA GCTTGCCGCT TCCAGCACCA
TCAGCCTACC AATCGTCAGT CATTGTGCCG CCACGCACGC CCCAGGAAAT GCACTTGTTG
CAACAATGGC AGGAGCTTCT TGGTTTCGAT CAGATTAGTA TTGATGATGA TTTTTTCGAT
TTAGGCGGGC ACTCGTTGTT GGCAACGCAG CTTGTGGCCC GAATGCGTGA TCAGTTCCAG
CGCGATTGGT CGGTCGCAAC AATTTTCAAC TATCCAACAA TCGAGCAGTT GGCGGCGCAA
CTGCGGTCAA CCCCTGATCA TCAGCACGTC GCAACGATTC CCGTGGCTGA TCGTCAGCAA
CGGATTCCCT TGAGCACGAT GCAGGAACGC GTTTGGTTTC TTACCCAACT TAATCCTGAA
AGCCGTGCCT ATCATTTTCA GATGACAATT CATTTTACTG GGCAATTGCA TGTGCCAATC
CTTGAGCAAG CGTGTAGTGA GATTGTACGG CGGCATGAAA TTCTGCGAAC GACCTTTCCG
ACTGAAGCTG GTCAGCCATA TCAGCAGATT CATGCGCCGT GGGCTGTAAC AATTCCTAGC
ATCGATTTGC GACAGTATCC GTTGGAGCAA GCAAGTCATT TGGCTGAGCA AGCAATTGCT
GTTGCAATGT GTGAGGCCTT TGATCTTACC CAACTGCCCT TGGTGCGCTG GAGTGTTTTT
CGGTTAGCCG ACGATCAATG GATGTTATTG CAGATTGAGC ATCATTTTAT CCACGATGGT
TGGTCGATTG CGCGACTGTT GGCTGAAATA AAGACACTCT ATACTGATTA TCTTGCTGGA
TTAGCCCCAT CGTTGCCTGC GTTGCCAATT CAATATGCTG ATTTTGCAGT TTGGCAGCGT
CAGCAATTAA ACGCTGGCTT GCTCGAACGT GATTTGCGCT ATTGGGAGGC GCAATTGGCG
CATCGTCCAG TTGTGCTTGA ACTTCCGACC GATTATCAAC GACCACCTGT GCAAAGTATG
CGTGGTTCAG CTGAGCGGAT TGCAATTCCG GCTGAGTTGG CAAATGCTGC CCGTGAATTA
AGCCGTCGGG TCGGGACAAC CTTATTTATG ACCTTGCTCA CAACATTTAG CACGTTGCTC
TACCGTTATA CCGAGCAAAC CGATATTCTG CTTGGATCAG GAATTGCCAA TCGGCGGCAA
CGTGAGCTTG AGCCATTGTT GGGGATGTTT GTTAATACGG TCGTACTCCG CACTGATCTC
CAGGGGAATC CTAGCTTCCG AGAACTTTTG CTTCGCACGC GTTCGTTAAT GCTGGAGCAG
TATGAACACC TTGATGTTTT GATTGAAAAG GTGGTTGAAC GGTTACGTTT ACCGCGCGAT
TTGAGCCGCA ATCCACTGTT TCAAGTGATG TTTAGCTTTC ACGATTCGCC AGTACCAACG
CTTGATCTGC CAATGCTTCA TGGCGAGATT CTCGAACGCA ATAACGGTTC GGCGAAGGCC
GACCTGAACG TGATTGTGAT TCCATATGCT GAACAACATG GGGCGGCGGG CCATAGTGCT
GAGCAAAAAG CGATTACGAT GATCTGGGAA TATAGCACTG ATCTGTTTAC CCAAGCGACG
ATTCAAACCA TGATTGGGCA TTTTCAAGCA CTGTTGCGGT CTGTTACCCA GAATCCTGAT
CAGCGGATTA ATCAGCTAGC CATGCTCAGT CGTGCTGAAA CTGCCCAATG CCTTGAGCAG
GCCCGTGGTC CGCTTGTTCC AACTCCAACA ACGACCCTGC ATGGTTTGTT TGCCAATTAT
GTTCGCGAGC AACCAAATGC CCTTGCAATT GTCACAGACC ATGAATCTAT CAGCTATAGC
CGCTTGAATC AGCGGGCTGA CATGCTAGCG GGTGCGCTTC GCCAAGCTGG GGTTGGGCCA
GGGATGGAGG TGGGGATTGT GAGTGAGCCC TCAATTGCAA CAATTGCTGG AATTCTGGCA
GTGCTGAAAT TGGGCGCAGC CTATGTTCCA CTTGATCCGA GCCATCCTCA ACAACGCCTC
AACTTGATCA TCAACGAAGC TCAACTCCAA GCAATCTTGG TTGAATCGCA GCTTGAACAG
TTGCTGCCGA ACACATCGGC GGCAATTATT CGGCTTGATA GCGACCATGG AGCAGTAACA
GATTACCCGA TTGTAGCTGC TCAGGCGTGT GCCTATGGCT TATTTACCTC GGGTTCTACT
GGGCAACCAA AAGGAGTAGC CTGTAGTCAT GAGGCAGTGA TCAATCTTTT GGATGCCATG
CAGCAGATGC GTCCACTTCC GCAGGGATGT CGCCATAGTT TATGGACAAG CCTCAGCTTT
GATGTTTCAG TCTATGAGAT ATTTAGTGCG CTTACCCAAG GCGGCACGCT ATACTTAATC
GATCAGACTA TGCGGCTTGA TGCCGACCAG TTTTTTGCTT GGTTGGCCAA ATATGCCATT
GAAAGTGCCT ATATTCCACC ATTTATGCTC CATGATTTAG CGCTTTGGCT GATGGCGAAT
CGGAATCGAC TCCAGCTCAA ACGACTTTTA GTTGGGGTTG AACCAATTCC TGAGCAGAAT
TTAGCGATAA TTGGGCAGCT GATCCCTGGA TTAACGATCA TTAATGGGTA TGGTCCAACG
GAAACAACAA TCTGTGCGAC GTTCTATAGC GTGCCACCGT TCAACGATTC AGCTCGGGTA
ACGCCAATCG GTCGGGCAAT TCAGCAGATG GCAGTCTATG TGCTTGATCG GGAGTTGCAG
CCGATGCCAA CCGGAGTTAT TGGCGATATC TATATTGCTG GAATTGGGTT GGCGCTAGGC
TATATTGCCA AGCCAGATCT GACGGCTGAG GTATTTTTGC CTAACCCATT AAGTGCTGAG
CCAGGGATGC GTATGTATCG GAGTGGTGAT CGTGGACGGT ACTTGGCTGA TGGTTCATTA
ATGTTTGTCG GTCGGAGTGA TCGCCAAGTC AAAATTCGAG GAATGCGGAT TGAGCTTAAT
GAGATTCGTA CATGCGTGCT GCAGCATGCT CAGGTGCATG AGGCGGTCGT TAATATTTAT
AATGATCAGC CTGATAATCC TCAAATCGTT GCGTATGTTG TTCCAACCAA GGGTCAGTTG
CTGACTGAGG CTTCGCTACG AACATATATT GGTCAGAAAT TGCCGCTCGC GATGCAGCCA
CAAGCGTTTG TGCTGCTTGA TCGATTGCCG CTTACGGCCA ATGATAAACT TGATTGGGCT
GCTTTGCCTG CGCCATTTCC TGCAACCCGA TTAAGCCCCA TGGAAGCTCC ATCGACCCCG
CTTGAGCAGA TCCTTGCTGG TATTTGGAGT GAGCTATTTG CCCAACCAGC AATCAGCATT
GATGCTAACT TTTTTGAGTT AGGCGGCCAT TCATTATTGG CAACCCGAGT TGCCTCGCGG
CTCCAAGAAA CATTGCATAA AACAATTCCA GTCAGCCTCT TTTTTCAATA TCCCACGATC
AAGCAACTGG CCCATGTTCT CGATGGCTAC ACTGCTTATG AATCAGACCA TCATCGCGCC
ATGCTGCCGG AAAGCGATAG ATCACTGCTG AGTCGCGTTC ATGAGCTTTC TGAGCAGGAG
GTTGATCAAT TACTGGCTCA ATTCCTTGAT GAATCTGTTG AATAA
 
Protein sequence
MRIQGATQSN PTDTSPDDDQ PWCIHELVAQ QSRYWADAVA VIHADTQLTY AQLDQRANQV 
AHALLEQGIK PDHLVALCLE RSIDMLIIVF GILKAGAAYL PIDPHYPYER QRFMIEHSQA
PLVITTAPLE GALASRPVEI QLELLMAIAA QKPTSAPNQR VDPDQLAYVI YTSGSSGQPK
GVMITHRALV NHMQWMQTTF GFNRHDRFLQ KTPLSFDASV WECYAPLLCG GQLILAKPDG
HHDAHYLVEM IQRYQISVLQ VVPSLLRMLQ TEPQLANCRS LRYLFIGGEP LHSELVAQVR
RVLPARMINL YGPTEATIDA TWAECNQTTE YPTIPIGYPI DNLTTWVLDA QMQPVAVGRS
GELYIGGMGL ARGYQRQPDR SAERFVPDPF STQPGTRLYK TGDRVRLLAN GALLFLDRID
QQIKLRGYRI ELGEIQAGLE RHPQIRQSVV VGQLDQTKTL QLVAYVVPMP EARIPIEQLR
ALLKAQLPRY MLPSAFVILE RLPLLANGKL DRASLPLPAP SAYQSSVIVP PRTPQEMHLL
QQWQELLGFD QISIDDDFFD LGGHSLLATQ LVARMRDQFQ RDWSVATIFN YPTIEQLAAQ
LRSTPDHQHV ATIPVADRQQ RIPLSTMQER VWFLTQLNPE SRAYHFQMTI HFTGQLHVPI
LEQACSEIVR RHEILRTTFP TEAGQPYQQI HAPWAVTIPS IDLRQYPLEQ ASHLAEQAIA
VAMCEAFDLT QLPLVRWSVF RLADDQWMLL QIEHHFIHDG WSIARLLAEI KTLYTDYLAG
LAPSLPALPI QYADFAVWQR QQLNAGLLER DLRYWEAQLA HRPVVLELPT DYQRPPVQSM
RGSAERIAIP AELANAAREL SRRVGTTLFM TLLTTFSTLL YRYTEQTDIL LGSGIANRRQ
RELEPLLGMF VNTVVLRTDL QGNPSFRELL LRTRSLMLEQ YEHLDVLIEK VVERLRLPRD
LSRNPLFQVM FSFHDSPVPT LDLPMLHGEI LERNNGSAKA DLNVIVIPYA EQHGAAGHSA
EQKAITMIWE YSTDLFTQAT IQTMIGHFQA LLRSVTQNPD QRINQLAMLS RAETAQCLEQ
ARGPLVPTPT TTLHGLFANY VREQPNALAI VTDHESISYS RLNQRADMLA GALRQAGVGP
GMEVGIVSEP SIATIAGILA VLKLGAAYVP LDPSHPQQRL NLIINEAQLQ AILVESQLEQ
LLPNTSAAII RLDSDHGAVT DYPIVAAQAC AYGLFTSGST GQPKGVACSH EAVINLLDAM
QQMRPLPQGC RHSLWTSLSF DVSVYEIFSA LTQGGTLYLI DQTMRLDADQ FFAWLAKYAI
ESAYIPPFML HDLALWLMAN RNRLQLKRLL VGVEPIPEQN LAIIGQLIPG LTIINGYGPT
ETTICATFYS VPPFNDSARV TPIGRAIQQM AVYVLDRELQ PMPTGVIGDI YIAGIGLALG
YIAKPDLTAE VFLPNPLSAE PGMRMYRSGD RGRYLADGSL MFVGRSDRQV KIRGMRIELN
EIRTCVLQHA QVHEAVVNIY NDQPDNPQIV AYVVPTKGQL LTEASLRTYI GQKLPLAMQP
QAFVLLDRLP LTANDKLDWA ALPAPFPATR LSPMEAPSTP LEQILAGIWS ELFAQPAISI
DANFFELGGH SLLATRVASR LQETLHKTIP VSLFFQYPTI KQLAHVLDGY TAYESDHHRA
MLPESDRSLL SRVHELSEQE VDQLLAQFLD ESVE