Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1574 |
Symbol | |
ID | 5733461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1826304 |
End bp | 1831448 |
Gene Length | 5145 bp |
Protein Length | 1714 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278713 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544345 |
Protein GI | 159898098 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATTC AAGGAGCGAC CCAGAGCAAT CCAACCGATA CAAGCCCGGA TGACGATCAA CCCTGGTGCA TCCATGAGCT GGTTGCCCAA CAATCACGCT ATTGGGCTGA TGCTGTGGCG GTTATCCACG CTGACACGCA ATTAACCTAT GCCCAATTAG ACCAGCGGGC TAACCAAGTT GCCCACGCCC TGCTTGAGCA GGGGATTAAG CCCGATCACC TCGTTGCTCT CTGCCTTGAG CGCTCAATCG ACATGCTGAT TATCGTGTTT GGCATTCTAA AAGCCGGTGC AGCCTATTTA CCGATAGACC CTCACTACCC GTATGAGCGT CAACGCTTTA TGATCGAACA CTCACAAGCA CCGCTCGTAA TCACCACAGC TCCATTGGAA GGAGCGCTTG CTTCAAGGCC TGTGGAAATC CAGCTTGAAT TATTAATGGC AATTGCTGCT CAAAAACCAA CGAGTGCCCC TAATCAACGA GTTGATCCTG ATCAATTGGC GTATGTTATC TATACCTCAG GCTCAAGCGG TCAGCCTAAA GGAGTGATGA TCACGCATCG AGCGTTGGTC AATCATATGC AATGGATGCA AACAACCTTT GGCTTTAATC GCCATGATCG GTTCTTGCAA AAGACTCCAT TGAGTTTCGA TGCGTCGGTT TGGGAATGTT ATGCCCCATT ATTGTGTGGC GGTCAGTTGA TTCTAGCCAA GCCTGATGGT CATCACGATG CCCACTATCT GGTGGAAATG ATTCAGCGCT ATCAGATTTC GGTGCTTCAG GTGGTTCCTT CGTTGTTGCG CATGCTCCAA ACTGAACCAC AGCTGGCCAA TTGTCGGAGC TTACGCTACT TGTTTATTGG TGGTGAACCA CTGCATAGTG AGCTTGTGGC CCAAGTACGT CGCGTTTTGC CAGCTAGGAT GATTAATTTG TATGGGCCAA CCGAAGCCAC CATTGATGCA ACATGGGCTG AATGCAACCA AACGACCGAA TATCCAACCA TCCCAATTGG CTACCCGATT GATAATCTTA CGACATGGGT GCTCGATGCC CAGATGCAGC CAGTTGCAGT TGGTAGATCT GGCGAACTCT ATATTGGTGG AATGGGCTTA GCCCGAGGCT ATCAACGGCA ACCAGATCGC TCGGCTGAGC GCTTTGTGCC CGATCCATTT AGTACACAAC CTGGGACGCG GCTTTATAAA ACTGGCGATC GCGTTCGCTT GCTTGCCAAT GGTGCGCTGC TTTTTCTCGA TCGGATTGAT CAGCAAATTA AGCTCCGTGG GTATCGGATT GAGTTGGGCG AGATTCAGGC CGGTCTCGAA CGCCACCCTC AGATTCGCCA ATCGGTTGTT GTAGGCCAGC TCGATCAGAC TAAGACGCTT CAATTGGTGG CCTATGTGGT ACCAATGCCC GAGGCTAGAA TTCCAATTGA GCAATTAAGA GCGCTTTTGA AAGCCCAACT GCCGCGTTAT ATGCTGCCAA GCGCATTTGT GATCTTGGAA CGGTTGCCAC TGCTTGCCAA TGGCAAACTT GATCGGGCTA GCTTGCCGCT TCCAGCACCA TCAGCCTACC AATCGTCAGT CATTGTGCCG CCACGCACGC CCCAGGAAAT GCACTTGTTG CAACAATGGC AGGAGCTTCT TGGTTTCGAT CAGATTAGTA TTGATGATGA TTTTTTCGAT TTAGGCGGGC ACTCGTTGTT GGCAACGCAG CTTGTGGCCC GAATGCGTGA TCAGTTCCAG CGCGATTGGT CGGTCGCAAC AATTTTCAAC TATCCAACAA TCGAGCAGTT GGCGGCGCAA CTGCGGTCAA CCCCTGATCA TCAGCACGTC GCAACGATTC CCGTGGCTGA TCGTCAGCAA CGGATTCCCT TGAGCACGAT GCAGGAACGC GTTTGGTTTC TTACCCAACT TAATCCTGAA AGCCGTGCCT ATCATTTTCA GATGACAATT CATTTTACTG GGCAATTGCA TGTGCCAATC CTTGAGCAAG CGTGTAGTGA GATTGTACGG CGGCATGAAA TTCTGCGAAC GACCTTTCCG ACTGAAGCTG GTCAGCCATA TCAGCAGATT CATGCGCCGT GGGCTGTAAC AATTCCTAGC ATCGATTTGC GACAGTATCC GTTGGAGCAA GCAAGTCATT TGGCTGAGCA AGCAATTGCT GTTGCAATGT GTGAGGCCTT TGATCTTACC CAACTGCCCT TGGTGCGCTG GAGTGTTTTT CGGTTAGCCG ACGATCAATG GATGTTATTG CAGATTGAGC ATCATTTTAT CCACGATGGT TGGTCGATTG CGCGACTGTT GGCTGAAATA AAGACACTCT ATACTGATTA TCTTGCTGGA TTAGCCCCAT CGTTGCCTGC GTTGCCAATT CAATATGCTG ATTTTGCAGT TTGGCAGCGT CAGCAATTAA ACGCTGGCTT GCTCGAACGT GATTTGCGCT ATTGGGAGGC GCAATTGGCG CATCGTCCAG TTGTGCTTGA ACTTCCGACC GATTATCAAC GACCACCTGT GCAAAGTATG CGTGGTTCAG CTGAGCGGAT TGCAATTCCG GCTGAGTTGG CAAATGCTGC CCGTGAATTA AGCCGTCGGG TCGGGACAAC CTTATTTATG ACCTTGCTCA CAACATTTAG CACGTTGCTC TACCGTTATA CCGAGCAAAC CGATATTCTG CTTGGATCAG GAATTGCCAA TCGGCGGCAA CGTGAGCTTG AGCCATTGTT GGGGATGTTT GTTAATACGG TCGTACTCCG CACTGATCTC CAGGGGAATC CTAGCTTCCG AGAACTTTTG CTTCGCACGC GTTCGTTAAT GCTGGAGCAG TATGAACACC TTGATGTTTT GATTGAAAAG GTGGTTGAAC GGTTACGTTT ACCGCGCGAT TTGAGCCGCA ATCCACTGTT TCAAGTGATG TTTAGCTTTC ACGATTCGCC AGTACCAACG CTTGATCTGC CAATGCTTCA TGGCGAGATT CTCGAACGCA ATAACGGTTC GGCGAAGGCC GACCTGAACG TGATTGTGAT TCCATATGCT GAACAACATG GGGCGGCGGG CCATAGTGCT GAGCAAAAAG CGATTACGAT GATCTGGGAA TATAGCACTG ATCTGTTTAC CCAAGCGACG ATTCAAACCA TGATTGGGCA TTTTCAAGCA CTGTTGCGGT CTGTTACCCA GAATCCTGAT CAGCGGATTA ATCAGCTAGC CATGCTCAGT CGTGCTGAAA CTGCCCAATG CCTTGAGCAG GCCCGTGGTC CGCTTGTTCC AACTCCAACA ACGACCCTGC ATGGTTTGTT TGCCAATTAT GTTCGCGAGC AACCAAATGC CCTTGCAATT GTCACAGACC ATGAATCTAT CAGCTATAGC CGCTTGAATC AGCGGGCTGA CATGCTAGCG GGTGCGCTTC GCCAAGCTGG GGTTGGGCCA GGGATGGAGG TGGGGATTGT GAGTGAGCCC TCAATTGCAA CAATTGCTGG AATTCTGGCA GTGCTGAAAT TGGGCGCAGC CTATGTTCCA CTTGATCCGA GCCATCCTCA ACAACGCCTC AACTTGATCA TCAACGAAGC TCAACTCCAA GCAATCTTGG TTGAATCGCA GCTTGAACAG TTGCTGCCGA ACACATCGGC GGCAATTATT CGGCTTGATA GCGACCATGG AGCAGTAACA GATTACCCGA TTGTAGCTGC TCAGGCGTGT GCCTATGGCT TATTTACCTC GGGTTCTACT GGGCAACCAA AAGGAGTAGC CTGTAGTCAT GAGGCAGTGA TCAATCTTTT GGATGCCATG CAGCAGATGC GTCCACTTCC GCAGGGATGT CGCCATAGTT TATGGACAAG CCTCAGCTTT GATGTTTCAG TCTATGAGAT ATTTAGTGCG CTTACCCAAG GCGGCACGCT ATACTTAATC GATCAGACTA TGCGGCTTGA TGCCGACCAG TTTTTTGCTT GGTTGGCCAA ATATGCCATT GAAAGTGCCT ATATTCCACC ATTTATGCTC CATGATTTAG CGCTTTGGCT GATGGCGAAT CGGAATCGAC TCCAGCTCAA ACGACTTTTA GTTGGGGTTG AACCAATTCC TGAGCAGAAT TTAGCGATAA TTGGGCAGCT GATCCCTGGA TTAACGATCA TTAATGGGTA TGGTCCAACG GAAACAACAA TCTGTGCGAC GTTCTATAGC GTGCCACCGT TCAACGATTC AGCTCGGGTA ACGCCAATCG GTCGGGCAAT TCAGCAGATG GCAGTCTATG TGCTTGATCG GGAGTTGCAG CCGATGCCAA CCGGAGTTAT TGGCGATATC TATATTGCTG GAATTGGGTT GGCGCTAGGC TATATTGCCA AGCCAGATCT GACGGCTGAG GTATTTTTGC CTAACCCATT AAGTGCTGAG CCAGGGATGC GTATGTATCG GAGTGGTGAT CGTGGACGGT ACTTGGCTGA TGGTTCATTA ATGTTTGTCG GTCGGAGTGA TCGCCAAGTC AAAATTCGAG GAATGCGGAT TGAGCTTAAT GAGATTCGTA CATGCGTGCT GCAGCATGCT CAGGTGCATG AGGCGGTCGT TAATATTTAT AATGATCAGC CTGATAATCC TCAAATCGTT GCGTATGTTG TTCCAACCAA GGGTCAGTTG CTGACTGAGG CTTCGCTACG AACATATATT GGTCAGAAAT TGCCGCTCGC GATGCAGCCA CAAGCGTTTG TGCTGCTTGA TCGATTGCCG CTTACGGCCA ATGATAAACT TGATTGGGCT GCTTTGCCTG CGCCATTTCC TGCAACCCGA TTAAGCCCCA TGGAAGCTCC ATCGACCCCG CTTGAGCAGA TCCTTGCTGG TATTTGGAGT GAGCTATTTG CCCAACCAGC AATCAGCATT GATGCTAACT TTTTTGAGTT AGGCGGCCAT TCATTATTGG CAACCCGAGT TGCCTCGCGG CTCCAAGAAA CATTGCATAA AACAATTCCA GTCAGCCTCT TTTTTCAATA TCCCACGATC AAGCAACTGG CCCATGTTCT CGATGGCTAC ACTGCTTATG AATCAGACCA TCATCGCGCC ATGCTGCCGG AAAGCGATAG ATCACTGCTG AGTCGCGTTC ATGAGCTTTC TGAGCAGGAG GTTGATCAAT TACTGGCTCA ATTCCTTGAT GAATCTGTTG AATAA
|
Protein sequence | MRIQGATQSN PTDTSPDDDQ PWCIHELVAQ QSRYWADAVA VIHADTQLTY AQLDQRANQV AHALLEQGIK PDHLVALCLE RSIDMLIIVF GILKAGAAYL PIDPHYPYER QRFMIEHSQA PLVITTAPLE GALASRPVEI QLELLMAIAA QKPTSAPNQR VDPDQLAYVI YTSGSSGQPK GVMITHRALV NHMQWMQTTF GFNRHDRFLQ KTPLSFDASV WECYAPLLCG GQLILAKPDG HHDAHYLVEM IQRYQISVLQ VVPSLLRMLQ TEPQLANCRS LRYLFIGGEP LHSELVAQVR RVLPARMINL YGPTEATIDA TWAECNQTTE YPTIPIGYPI DNLTTWVLDA QMQPVAVGRS GELYIGGMGL ARGYQRQPDR SAERFVPDPF STQPGTRLYK TGDRVRLLAN GALLFLDRID QQIKLRGYRI ELGEIQAGLE RHPQIRQSVV VGQLDQTKTL QLVAYVVPMP EARIPIEQLR ALLKAQLPRY MLPSAFVILE RLPLLANGKL DRASLPLPAP SAYQSSVIVP PRTPQEMHLL QQWQELLGFD QISIDDDFFD LGGHSLLATQ LVARMRDQFQ RDWSVATIFN YPTIEQLAAQ LRSTPDHQHV ATIPVADRQQ RIPLSTMQER VWFLTQLNPE SRAYHFQMTI HFTGQLHVPI LEQACSEIVR RHEILRTTFP TEAGQPYQQI HAPWAVTIPS IDLRQYPLEQ ASHLAEQAIA VAMCEAFDLT QLPLVRWSVF RLADDQWMLL QIEHHFIHDG WSIARLLAEI KTLYTDYLAG LAPSLPALPI QYADFAVWQR QQLNAGLLER DLRYWEAQLA HRPVVLELPT DYQRPPVQSM RGSAERIAIP AELANAAREL SRRVGTTLFM TLLTTFSTLL YRYTEQTDIL LGSGIANRRQ RELEPLLGMF VNTVVLRTDL QGNPSFRELL LRTRSLMLEQ YEHLDVLIEK VVERLRLPRD LSRNPLFQVM FSFHDSPVPT LDLPMLHGEI LERNNGSAKA DLNVIVIPYA EQHGAAGHSA EQKAITMIWE YSTDLFTQAT IQTMIGHFQA LLRSVTQNPD QRINQLAMLS RAETAQCLEQ ARGPLVPTPT TTLHGLFANY VREQPNALAI VTDHESISYS RLNQRADMLA GALRQAGVGP GMEVGIVSEP SIATIAGILA VLKLGAAYVP LDPSHPQQRL NLIINEAQLQ AILVESQLEQ LLPNTSAAII RLDSDHGAVT DYPIVAAQAC AYGLFTSGST GQPKGVACSH EAVINLLDAM QQMRPLPQGC RHSLWTSLSF DVSVYEIFSA LTQGGTLYLI DQTMRLDADQ FFAWLAKYAI ESAYIPPFML HDLALWLMAN RNRLQLKRLL VGVEPIPEQN LAIIGQLIPG LTIINGYGPT ETTICATFYS VPPFNDSARV TPIGRAIQQM AVYVLDRELQ PMPTGVIGDI YIAGIGLALG YIAKPDLTAE VFLPNPLSAE PGMRMYRSGD RGRYLADGSL MFVGRSDRQV KIRGMRIELN EIRTCVLQHA QVHEAVVNIY NDQPDNPQIV AYVVPTKGQL LTEASLRTYI GQKLPLAMQP QAFVLLDRLP LTANDKLDWA ALPAPFPATR LSPMEAPSTP LEQILAGIWS ELFAQPAISI DANFFELGGH SLLATRVASR LQETLHKTIP VSLFFQYPTI KQLAHVLDGY TAYESDHHRA MLPESDRSLL SRVHELSEQE VDQLLAQFLD ESVE
|
| |