Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2089 |
Symbol | |
ID | 5733977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2602175 |
End bp | 2606266 |
Gene Length | 4092 bp |
Protein Length | 1363 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279230 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544857 |
Protein GI | 159898610 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAA CCTTGCCCCA AGGCTTTTCG ACCGATGACC TTGAGTTGCT GGCCTATTTG CTTGAAGAAG CAGGCATTGA TCATGCCGTG CCAAACCAAA TTCAGCCTCG TCCAGCCAAC CAGCCAGTTC CACTGTCGTT TGCCCAAGAA CGCTTGTGGT TTATCGACCA GCTAGAACCT GGCAATCCAG CCTATAATAT TTTGTTTGCG GTGCAGATTG ACGGCCCGTT GCATGTCGCA TACTTGCAGC AGAGCTTTGA TGCGGTGATT GCCCGCCACG AGAGCCTGCG TACCAGCTTT CCAGTGCTCA ACGATCAGCC GATTCAGGCG ATTGCTGAAA AACATGATTT TGACTTAACA ATCGTCGATC TCCGCTATTT AGCAGCAATT GAACAAGCAT CAACAATTGA ACAACTATCA ATTGAACAAC GCTCAATTAT CGAACAGCAA TTACTGATCG ATAGTGCCCA TCGTTTTGAT TTGGCGCAAG GCCCATTGCT GTATGGTCGT TTGCTCTGGC TGGCCGAGCA GCAATATGTG CTGATTCTCA ACTTGCACCA TGCGATTTTT GATGGCTGGT CGTTGGCAAT TTTTATTGAG GAATTGCGCC ATTGCTATAG CGCCTTGCTT GCAGGTCAAG CGCTCGATTT AACCCCAGCA GCGTTGCAAT ATGCCGATTT TAGTTATTGG CAGCGTGAAT ATTTGCAAGG CGAGGTTTTG GCCGCGCAAT TAGCCTTTTG GCAAAACCAA TTTGCTGGAC GTTTGCCCAC CTTGGCCTTA CCAACCGATC GACCACGCCC AAAACACGAG ACTGGTCGTG GCGCAGCCTT GCCATTTCGT GTTGATCAGG TATTAACGGA GCAGTTGCAA CACCTCGCCC AAAACGAACA TGCCACGATG TTTATGCTGT TATTGGCGGC ATTTCAGCTG ATGCTGGCTC GCTATAGCCA ACAGCAAGAG TTTGTGGTTG GCTCGCCAAT TGCCAATCGT GATCGGGTTG AAATTGAGCA TTTGATCGGC TTTTTCGTCA ATATGCTGCT GCTGCGTTGT GATGTTCAGC CGCAGCTGAG CTTTCGCGAA TTTCTGGCGC AAGTGCGCGA AACGACGCTC GAAGCCTATG CCCACCAAGA TTTGCCCTTT GAGCAGTTGG TCGAGGTGCT TCAGCCCGAC CGTGGCGCAG GCTATGGTTC GTTGTTTCAG GTGATGTTTG TGCTGCAAAA TACGCCTAAG GTCAACTATG AGATCGCCGA TTTGCAGCTG AGCTTTCTCG ATACCGAGGC GCATAGTACC AAATATGACC TGACCATGAC CTTGACTGAA ACCGCAACTG GGCTGGAAGG TTGGTTTGAA TACAACACCG ATTTGTACGA TCAGGCCACG ATTCAGCGCA TGCTTGGGCA TTATCAACAA GTATTACGGG TGGTTGGCGA AGCGCCTGAT CAAGCACTTA ATGCGATTAG CGTATTTGAT GAGCAAGCCC AAACGGCTTT ATTCAAGCTA AGCAATCAAA CTCAGCACGA TTTTGGCCCT GCTGATTTCT TAGAACGCTT TGCCAATCAA GTGGCAGCGA CACCCAACGC GATTGCGGTG CGCGATGCCC ATCAGCAGTA TTCGTATCAG GCTTTGCAGC AACGAGCTAT GGCTTTGGCG GCCCAACTGC AACAGCATGG CGTTAGGCAA GAAACCCTCG TGCCAATTTT GTTGCCGCGT ACCAGCGATT TTGTGGTTGC GGTTTTGGGT GTTTTTTACG CTGGGGCAGC CTATTTACCG CTTGACCCTG CGTGGCCAGC CCAGCGTAGC GCCCAGATTT TGCAGGGATT GGCGATTCCT GCCTTGATTT GCGAACCCGA TTTAGCCCGT TGGTTTGCTA AGCATGTTCA GCCGTTGTTT AGGCTTCACA ATCAGCCGCA GTTAATCGAA CAATGGAATG ATGCAGCAAC CAAGCTTGTT GTAAGCCAAA CCCATCCGCA GCAATTGGCC TATACCCTGT TTACCTCTGG CTCAACTGGC ACACCCAAAG GAGTGATGAT CGATCAGGCT GGGATGCTCA ACCATCTGTT GGTGATGAAT CAGGTGCTGG AAATCCAAGC CCATGATGTG GTGGCCCAAA CCGCTTCGCA ATGCTTCGAT ATCTCAGTGT GGCAGATGCT GTCGGGCTTG TTGGTTGGCG CAACGGTGGC AATCATTGAT GATCAGACGA TGCGTGACCC GTTGGCCTTA GCCCAAACTC TGGCTGAACA GCAAGTCACA ACTTTCGAGC CGGTGCCAAG CCTGTTGCAA GCCTTGCTTG AAACATTGCA AACCCCTGCT GAACAAGCCT TATTGCACCG CTTGCGCTGG GTTTTGCCAA CCGGCGAGGC CTTGCAACCA GTTCAGGCCC GTCAATGGTT TGCCACCTAT CCGCAGATTC CGTTGCTGAA TGCGTATGGG CCTGCCGAAT GTGCCGACGA TGTAACCCTG CAACGGCTTG ATTCTGCTCC GACCGAAGGC CATAGCACCA TGCCAATCGG CAAGCCTGTC GCCAATATGC AGGTGTTTGT GCTTGATCCA AACTGGCAAT TGTTGCCATT GGGCGCAGTC GGCGAATTGT ATATCGGTGG AGTTGGGGTT GGTCGGGGCT ATTTGAATGA TCCAGCCCGC ACTGCCAGCG CCTTCGTACC CAACCCATTT GCTGATAATG GCAGTCGGCT CTATCGTACT GGCGATTTGG TGCGCCAAAC TGCTGATGGG GCTTTGCACT TCATTGGTCG CGCTGATCAG CAAGTTAAAG TGCGCGGCTA TCGGATTGAA CTAGGCGAGA TCGAGGCGGT CTTGGCGGAA TTGAGCTGGT TGCGCGAGGC GGCGGTGCAC CCTTGGCAGC AACAATTAGT TGCCTATCTG GTTCCGGTTT CCGATACTCC TGATTTGATC AGCCTTGTGC AGCCTGCGCT CCAACAGCGA TTGCCCAGCT ATATGCTGCC AAACCAATAT CTGGTTTTAG ATCAATTGCC GCGTAACCGT AATGGCAAGC TTGATCGTCA ACAACTGCCA GCGCCGAATC CTGCCAACCT TGGCTTTCAA ACACCGCTGG TTGTGCCACG TACCCAGCAC GAGGCCGAAC TCGCGGCGAT TTGGGCTGAC GTACTTCAAC TTGACGTAAT CAGTATTGAT GCCAATTTCT TCAGCAGCGG CGGCCACTCG CTCTTAGCAA CGCGGGTGAT GCTACGCACA CGCCAGCATT ATGGCCGTGA TTTACCATTG CGCATGATTT TTGAAGCACC AACTATTCGT GAATTTGCTG CCTTATTGGA ACAGCAACAA GCAGTGTCAG CGCTCCCTAA CCTACTCGTG CCTATTAAAC CCCAAGGCTC ACGCACGCCA CTGGTGTGTG TCCATGCAAT TGCGGGCACG GTTGGCTGTT ATAGCGAACT AGCAATTGCA CTTAATCCTG AACAACCCGT GTATGCCTTG CAAGCACCTG GCATTGACGG CGGCACAACC CATGCCAAGG TTGAAGCAAT TGCCCAAGAC TATTGCCAAG CATTGCGACA ACTGCAACCT CAGGGGCCAT ATCGTTTGGC TGGTTGGTCA TTTGGAGGTT TGGTTGCGCT TGAAATGGCG CGACAACTGC AACTTGCTGG CGAGCAGGTA TCTATGCTCA GTTTGATCGA TAGCTTTCTG GCCGAGCCAA CGCCTGATCC ATTCCCATTG ATTCAGAGCT TCGCCGCCGA TCTGTTTGCT GATGTTGATC CATTGGCGGC GCAACAGATC GATTGGCCCG CGATTGTAGT GCTGCCTGCT GAGCAACAAT TGGCGGCGCT CTATCAACAA GCCCAACAGG CTGGCCTAAT TGATCGTGAT CTGCCGTTCG ATCTGGCACA ACGCTTATAC GCGGTTTTTA CCAGCCATGC CCACGCCATA CAAGCCTATC AGCCCGCTAT ATATCTTGGG GAAGCTCAAT TGCTGCAAGC TCAAGCCAAC CCAGCAGCAG CTCGCCGCTG GCAAGCAGTC ATTCCAAATC TGCACATTCA GGTAATCGGC GGCGATCATA TCAGCATTCT GCGGCAGCCC CATGTGCATG GTTTAGCAAA TGCCATAGAG CAGAGAACAT AG
|
Protein sequence | MTQTLPQGFS TDDLELLAYL LEEAGIDHAV PNQIQPRPAN QPVPLSFAQE RLWFIDQLEP GNPAYNILFA VQIDGPLHVA YLQQSFDAVI ARHESLRTSF PVLNDQPIQA IAEKHDFDLT IVDLRYLAAI EQASTIEQLS IEQRSIIEQQ LLIDSAHRFD LAQGPLLYGR LLWLAEQQYV LILNLHHAIF DGWSLAIFIE ELRHCYSALL AGQALDLTPA ALQYADFSYW QREYLQGEVL AAQLAFWQNQ FAGRLPTLAL PTDRPRPKHE TGRGAALPFR VDQVLTEQLQ HLAQNEHATM FMLLLAAFQL MLARYSQQQE FVVGSPIANR DRVEIEHLIG FFVNMLLLRC DVQPQLSFRE FLAQVRETTL EAYAHQDLPF EQLVEVLQPD RGAGYGSLFQ VMFVLQNTPK VNYEIADLQL SFLDTEAHST KYDLTMTLTE TATGLEGWFE YNTDLYDQAT IQRMLGHYQQ VLRVVGEAPD QALNAISVFD EQAQTALFKL SNQTQHDFGP ADFLERFANQ VAATPNAIAV RDAHQQYSYQ ALQQRAMALA AQLQQHGVRQ ETLVPILLPR TSDFVVAVLG VFYAGAAYLP LDPAWPAQRS AQILQGLAIP ALICEPDLAR WFAKHVQPLF RLHNQPQLIE QWNDAATKLV VSQTHPQQLA YTLFTSGSTG TPKGVMIDQA GMLNHLLVMN QVLEIQAHDV VAQTASQCFD ISVWQMLSGL LVGATVAIID DQTMRDPLAL AQTLAEQQVT TFEPVPSLLQ ALLETLQTPA EQALLHRLRW VLPTGEALQP VQARQWFATY PQIPLLNAYG PAECADDVTL QRLDSAPTEG HSTMPIGKPV ANMQVFVLDP NWQLLPLGAV GELYIGGVGV GRGYLNDPAR TASAFVPNPF ADNGSRLYRT GDLVRQTADG ALHFIGRADQ QVKVRGYRIE LGEIEAVLAE LSWLREAAVH PWQQQLVAYL VPVSDTPDLI SLVQPALQQR LPSYMLPNQY LVLDQLPRNR NGKLDRQQLP APNPANLGFQ TPLVVPRTQH EAELAAIWAD VLQLDVISID ANFFSSGGHS LLATRVMLRT RQHYGRDLPL RMIFEAPTIR EFAALLEQQQ AVSALPNLLV PIKPQGSRTP LVCVHAIAGT VGCYSELAIA LNPEQPVYAL QAPGIDGGTT HAKVEAIAQD YCQALRQLQP QGPYRLAGWS FGGLVALEMA RQLQLAGEQV SMLSLIDSFL AEPTPDPFPL IQSFAADLFA DVDPLAAQQI DWPAIVVLPA EQQLAALYQQ AQQAGLIDRD LPFDLAQRLY AVFTSHAHAI QAYQPAIYLG EAQLLQAQAN PAAARRWQAV IPNLHIQVIG GDHISILRQP HVHGLANAIE QRT
|
| |