Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1805 |
Symbol | |
ID | 5733707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2094746 |
End bp | 2098987 |
Gene Length | 4242 bp |
Protein Length | 1413 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641278948 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544576 |
Protein GI | 159898329 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR01746] thioester reductase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.323791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAGC TGGCGCTATC GGCGGCACAA CATGGCATTT GGCTGGGTCA ACAGCTCGAT CCGAGTAGTC CGCTGTATAA CACAGCCGAA TATGTGGCTC TGCGCGGTGC GGTTGAGCTT ACCAATTTGA CCGCTGCGAT TAAGCAGGCC TTTGCTGAAG CCGCAACCCT GCATTTGCGT TTTGGGCTTG AGCATGATCA GCCGTATGCG CTGGTTGAGC CACAGCCAAT TAACTTGACC GTGCATGATT TGCGTGATTT ACCCGATGCT GAAGTACGAG CCATAGCTTG GATGCAGCAC GATTTGGGCA ATGTGGTCGA TCTAGCAACC GGCCCGTTGT TCAACACGGC GATTTTGCAA CTTGCTGATG ATCAGGTGTG GTGGTATTTG CGAGCGCATC ACATTGCCTT AGATGGCTAT AGTTTTGCTT TGCTCACCAA GCGCGTCGCC GAAATTTACT CGGCATTGCA AACCAAGGCC ACGCTCAGCC CAAGCTTTGG CGAATTGGTC CCAGTAATTG CTGAAGATCA CGCCTATCAA GTCTCAATTC AGGCCACGCT TGATCGCGAA TTTTGGGTTA ATCGCTTTGC AGATAATCCG CAAGTGGTCA GCCTGACTCA GCAAACTGCC CTATCGCAGC CGCGCAGCAT TCGCTTGAGC ACGGCTTTAG CGAACGACTT GATCGAGCGA TTGACTGCAA TCGCCAAGCC CAGCCGTAGC ACATGGCCCG ATGCCTTGAT GGCGGTGGTG GCAGCCTATC TCGCCCGCTG GAACAACAGC GAGAGCGTCG TTTTGGGCAT GCCCTTGATG AGCCGTTTGG GTTCGGTGGC GTTGCGTGTG CCGTGTATGG CCATGAATAT TGTGCCGCTG TGTCTTAACG TTGCGGCTGA GCATGATTTG GCCCAATTAA CTGCGGTAGT GGCAGCCGAA CGCAATGCCT TCCGCAAGCA TGGCCGCTAT CGCTATGAGC AGTTGCGCCG CGATTTGGGC TTTGTTGGCG CTGGGCGGCG CTTGTTTGGG CCTGTCGTCA ATATTATGCC CTTCGATCAT CCGCTGAATT TTGGTGATTG CCAAGCCCAG AGCACTACAC TCACCGCTGG CCCAGTCGAA GATTTGGCCT TCAACGTGAT TTTGCGCGGC AACCAACTCT ATCTGACGCT TGAGGCCAAT CCGGCTTGCT ACAGCCAAGC AGCGCTTGAA TATCATTTTG CGGCAATTCA ACACCTATTA AATGGATGGC TGGCAAATCC AAGCATACCT GTGGCTGAGC AGCAGGTTTT GCCAGCGCCG CTTGTGCTTG ATGGCGGCGA GTTGCGCTTG CCGCTGACCA GCGTGATCGA GCGAATTTTA CATAATGCCA GGCAACAGCC GCACGCTTTG GCTTTGGTTA CCGACACTGA GCAACTGAGC TATGCCGAGT TGGCGAGCCA CGTCCATGCG TGGGCAGGCC AATTGGTGCA GCGCGGGGTA ACTGCTGGCA GCGTGGTTGG CGTGGCTTTG CCGCGTAGCC GCGAGGCAAT TGTCGCAATT TTGGCGACGC TTTGTTGTGG GGCAGCCTAT CTGCCACTTG ACCCGCAATG GCCGCAAAGC CGCTTGGCGA GTGTCGTGGC GCAAGCCCAA CCAGTGCTAG TTTTGGCACA GCAAGCTTTT GATCTGCCCA ATTTGTTGTT GGTCGAGCAG TTGAGCAAGG CCAATGCATG GTTCGAGGCA CGGGTCGATT TGGCCCAACC AGCCTACATC ATGTATACCT CTGGCTCGAC TGGCGAGCCA AAAGGTGTGG TGATTAGCCA TCAAGCCTTG GCGGGTTTTG TGCAGGCTGC GGCTGAGCGT TACGCAATCA GCGCCGCTGA TCGGGTGCTG CAATTTGCCC CATTAGCTTT TGATGCTAGC GTTGAAGAAA TTTTTGTGAC GCTTTGCCAA GGCGCGACCT TGGTGTTGCG CAACGATGCC ATGCTCGAAT CGTTACAGCG CTTTGTGGCC GCCTGCCAAG CGCATGCGAT TAGTGTGCTC GATTTGCCAA CCGCCTTTTG GCATGAATTA GCCGATAGTG TGGCCCAAGG CGCGGTGCAG TTGCCCGAAT GTTTGCGGGT GGTAATTATC GGGGGCGAGG CGGCTCTGCC AGAGCGGGTT CAAGGCTGGT TGAACGTGGT TGCGCCGAAT GTGCGTTTGT TCAACACCTA TGGCCCAACC GAGGCGACCG TGGTGGCGAC CGTGGCCGAA TTGAGCGACC CCAACCAGCC AATTACGATT GGCCGACCAT TGGCTGGGGT GCAAGCAGCC ATTTTGGGCA GCGACCAGCG GCCAATTTTT GCAGGCGATG TTGGCGATTT ATATCTGCTG GGCAATGGCT TAGCAACTGG CTACTATCAA CGCCCCGATT TGGATGCGCT GAATTTTGGC CAACTTAGCC AATTGCCGCA TGCGCCCCGC GCCTATCGCA CTGGCGATCG AGTGCGTTTG TTCGCAGGTC AGTTACAGTT TGTGGGTCGC AGCGACGACG AATTCAAAAT TAGCGGCCAG CGTGTTACGC CTGCCGAAAT TGAATCGGTC TTTTTGCGGC ATACAGCGGT GCGCGAAGTA GCGGTGATTG GCCAGCAGCT TGGCAATGCG AGCAAGCGCT TGTTTGCAGC AGTCGTTGTC AGCGATGCTA GTTTGAGCGT GGCTGAATTG CGCAATCACG CCAGCCAACA TCTGCCAGCG GCGGTCATTC CGGCGGCGAT CACGATTGTT GAACGCTTGC CGCGCAGCAG TGCAGGCAAG ATCGATCGCA AGGCTGTGGC GGCCTTAGCG CCAGCACCAG TGATGGTGAA TGCTGCGATC AACGATACGC CAGCATTAAT TCGTCAAGTT TGGGCCGAAG TTTTGGGCCA AACTGAATTC AACGATGAAG CCGATTTCTT TGGCTTGGGC GGTCAATCGC TGCAAACCAT TCAGGTTGCC AATCGTTTGG GTATGGCCTT GGGTCGCGAA GTAACCGCCG CCTTGATCTT CCGCTATCCC ACGATTGCGG GCTTGAGCCA AGCGCTCGAC CCTGAATTTG AGCAGGCTCC TGAGGCAGCG CCGCAATTTT TGAGCGATGC CAATTTGCCT GAGCAGATTG TGCCCAAACA ACTGAATGCC CAGCCACGGC CAATCCAAAC CGTGCTGTTG ACTGGGGCAA CTGGCTTTGT CGGGGCACAT CTGTTGGCCG AATTGCTTAG CACAACCACC ACCAACGTGA TTTGTGTGGT GCGAGCTGGC TCGAATGCGG CAGCCTTTGA GCGGTTGCAA GCAAGTTTGC AACACTACGA ATTGCCAAGC GAGCAGCTTG CCGAGCAGGT TGAAGCTTGG CAGGGCGATT TGGCTCAGCC CCAATTTGGG CTTGACGATC AGCAATGGCA AAGCTTGATC GAACGTTGCG ATCTGATTTA TCACAATGCG GCGATGGTCA GCGTGGTTCG CGAGTATAGC AGCTTGCGGG CGGTCAACGT CAACGCCACC AGCGAAATTT TGCGTTTGGC AGCGGTGCAT TGCACCCCAG TGCATTACGT TTCGACCTTG GCAGTTTCAC CACCGCAAAG CGTGATGCAC CGCGTGCCCG AAGATTTTGT GGCGGCGCAT GCTGGCCTAC GCGATGGCTA TAGCCAAAGC AAATGGGTTG CCGAACGCTT GCTCGAACAA GCGGCTACCC GTGGTTTGCC GGTTGCTGTT TATCGTTTGG GGCGGGTAGT TGGCCCAAAT CAAAGCAATT TCGTCAATCA AGATGATTTA TTTTGGCGGA TTGTCCAAGC AGGTGTGCCG CGTGGCTTAT TGCCCAGCCT GCCTGTCGAG GAAATCTGGA ATCCAGTTGA TTTTGCTGCA CAGACAATCG TGCAATTTAG CCATAATCAT CGCGGCGTGC GCGTGTATAA CCTTGCTCCC AACGAACCAA TCAGCTTTGC CCAACTTTTG GGCTGGGTTG GCGAGTATGG CTATGCCGTG CAATTGTGCA GGGTTGAGCA ATGGTATCAA GCGTTGCGTA ACGCCGACGA TGCGATGAGT CAGGCGACTC TGACCTTCTT TGAGCGCCAG GCTGATGGTG GGGAACTGCC CAGCGCAATT GGTACGATTG AAAACAAACG CTTGCTGCAA ACGCTTGCAG CGCATGGCAT TGCTGTGCCT GTGATCGATC GCGAGCGCTT CTTTGGCTAT CTTGAGCGGT GTATTCGAAC GGGTTTATTG CCCGCACCCG ATTTACGCCA GACTAGTATT GGTATTCGCT AA
|
Protein sequence | MAKLALSAAQ HGIWLGQQLD PSSPLYNTAE YVALRGAVEL TNLTAAIKQA FAEAATLHLR FGLEHDQPYA LVEPQPINLT VHDLRDLPDA EVRAIAWMQH DLGNVVDLAT GPLFNTAILQ LADDQVWWYL RAHHIALDGY SFALLTKRVA EIYSALQTKA TLSPSFGELV PVIAEDHAYQ VSIQATLDRE FWVNRFADNP QVVSLTQQTA LSQPRSIRLS TALANDLIER LTAIAKPSRS TWPDALMAVV AAYLARWNNS ESVVLGMPLM SRLGSVALRV PCMAMNIVPL CLNVAAEHDL AQLTAVVAAE RNAFRKHGRY RYEQLRRDLG FVGAGRRLFG PVVNIMPFDH PLNFGDCQAQ STTLTAGPVE DLAFNVILRG NQLYLTLEAN PACYSQAALE YHFAAIQHLL NGWLANPSIP VAEQQVLPAP LVLDGGELRL PLTSVIERIL HNARQQPHAL ALVTDTEQLS YAELASHVHA WAGQLVQRGV TAGSVVGVAL PRSREAIVAI LATLCCGAAY LPLDPQWPQS RLASVVAQAQ PVLVLAQQAF DLPNLLLVEQ LSKANAWFEA RVDLAQPAYI MYTSGSTGEP KGVVISHQAL AGFVQAAAER YAISAADRVL QFAPLAFDAS VEEIFVTLCQ GATLVLRNDA MLESLQRFVA ACQAHAISVL DLPTAFWHEL ADSVAQGAVQ LPECLRVVII GGEAALPERV QGWLNVVAPN VRLFNTYGPT EATVVATVAE LSDPNQPITI GRPLAGVQAA ILGSDQRPIF AGDVGDLYLL GNGLATGYYQ RPDLDALNFG QLSQLPHAPR AYRTGDRVRL FAGQLQFVGR SDDEFKISGQ RVTPAEIESV FLRHTAVREV AVIGQQLGNA SKRLFAAVVV SDASLSVAEL RNHASQHLPA AVIPAAITIV ERLPRSSAGK IDRKAVAALA PAPVMVNAAI NDTPALIRQV WAEVLGQTEF NDEADFFGLG GQSLQTIQVA NRLGMALGRE VTAALIFRYP TIAGLSQALD PEFEQAPEAA PQFLSDANLP EQIVPKQLNA QPRPIQTVLL TGATGFVGAH LLAELLSTTT TNVICVVRAG SNAAAFERLQ ASLQHYELPS EQLAEQVEAW QGDLAQPQFG LDDQQWQSLI ERCDLIYHNA AMVSVVREYS SLRAVNVNAT SEILRLAAVH CTPVHYVSTL AVSPPQSVMH RVPEDFVAAH AGLRDGYSQS KWVAERLLEQ AATRGLPVAV YRLGRVVGPN QSNFVNQDDL FWRIVQAGVP RGLLPSLPVE EIWNPVDFAA QTIVQFSHNH RGVRVYNLAP NEPISFAQLL GWVGEYGYAV QLCRVEQWYQ ALRNADDAMS QATLTFFERQ ADGGELPSAI GTIENKRLLQ TLAAHGIAVP VIDRERFFGY LERCIRTGLL PAPDLRQTSI GIR
|
| |