Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1877 |
Symbol | |
ID | 5733766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2233390 |
End bp | 2237967 |
Gene Length | 4578 bp |
Protein Length | 1525 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279021 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544648 |
Protein GI | 159898401 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGATG CGATGATCGA GGGATTTCGG CTTTCCCCAC AGCAACAGCA TGTTTGGGCG CTCCAGCAAT TGGATTCAGC CCAGCCTTAT CGTACTCAGG GAACTATCGT GATTGAAGGC TCCCTCGATA TTGCTCGCTT GCAGGCTAGT TTGTTGCAGG TTGTACAGCG CTACGAAATT TTGCGCACAA CCTTTCACTA TCTCCAAGGG ATGGCCTTGC CGTTGCAGGT GGTTACCGAA TTAAGCGAAC TAGCATTGCC AAGCTATGAT CTGAGCGAAC ATAGTTTGAC TGATTTGCAA AGCCAATTGG CTCAGCAAGC CTTTGATTTT GCTGCTGGCC CATTGCTGCA TGCCTGGCTT GGGCGTGAAC ATGCGCTAAA ACATTATTTG CTGTTGAGTG TGCCAACGCT CTGTGCTGAT AACCTGAGTT TGGTCAACTT AACCAGCGAG CTGGCGGCGA TTTATGCTGC CGAGCCAACT ACCGATGAGC CAATGCAATA TATCGATATT GCTGAGTGGC AGCACGAACT ATTGGAAGCT GAAGAAACCG CTGCTGGCCG CAGTTACTGG CAACAACAAA CCTGGCACGA TGCGATTACC GTGCGGCCAG CGTTTGTTGC CAATCAAGCA GCGGCCCAAA CGTTTCAACC GCAGCAATTG CCAATTCCGC TGACTGAACA ACTAATTGAG CTGCTTAATC AGGCAAGTCA AACTTTAGCT GTGCCTGCTT CAGCGCTTGC CTTGGCTGCT TGGCGAACCT TGTTGTGGCG CTTGAGTGAT CAAACCAATG GTGTGGTTGG CGTGGTTTGT GATGGGCGCA AATATGCCGA GCTTGAAACC ACGCTTGGTT TATTTGCCAA AGCCGTACCG CTGGCTAGCC CGTTGGCAGC AGACCAGCAG TTTGGTCAAT TGGTCAAACA AGTGCAGCAA GAATATGTGG AGGCCTATGC TTGGCAAGAA AGTTTTCATT GGCCTGCAGA AGTGTCCAGT CAATTGGCCT TTTTCCCCTT TGGCTTTGAA GCGCTGACCA CACCCAAACC GCTGGTGATG GCTAATCTCA GCTTTCAAGT CGTGCAGCAA CGGGCTACGC TCAATCCATA TGCAGCCCAC TTAACTTGGT TCAACCAACC CCAAGGATTT GCCGCTAGCC TGAATTTTGA TGCTGGGTTG ATTGCGTCAT CGAGTGCTGA ACGGTTGATT GAGCAATATC AAACCCTTTT GACTGCTGCA CTGAGCAACT TGAACACAAG CTTGGCTCAA CTGCCAATTG TGGGTACAAA CGAACGTCAA CAGCTATTAA TTGAATTTAA TCAAACAGCC GAGCCATTTG ACGCTGCACG TTGCTTTCAT GAGTTGTTTA GCGCTCAAGC AGCGATTACA CCTGATCACC CAGCGGTTGT GGTTGAAGAC CAACAACTGA GCTATGCAGA ACTTGAGGCA CGTTCCAATC AACTGGCGCG AGAGTTGCTG GCCCGGGGAG TTCGCCCTGA TCAACCAGTT GCTTTAGCCT TGGATCGCTC GCTCAACCTG CTGGTGGGTA TTTTGGGTAT TCTCAAGGCT GGCGGAGCTT ATGTACCGCT GGATTTGGGA TTGCCCAAAG AGCGTTTGGG CTTTATGCTG GGCGATATTC AAGCCTCAAT TGTGGTGAGT GAAACCAGCC TGCAAGCCCA ATTGCCTGAG CATGCTGCCG ACTATCTTTG GCTGGATCAA GCTTGGCCGA CGATTGCCGA GCATTCAAGC GAACCTGTTG CTGCTTCGGC GGTTCCTTCT AACTTGATGT ATATCATTTA TACCTCTGGT TCGACTGGCC AACCCAAAGG CGTTGGGGTC AGCCATCAAA GTCTTTATAA CTATATTTCA AGTATTAGCC AACGGCTTAA TCTGCCGCCA CAGGCTAGTT TTGCCAGTGT CTCGACCTTT GCCGCTGATC TTGGACATAC GGCGATTTTT CCAACCCTAA CAAATGGTGG CACGCTGCAC CTTATTACTG CTGAGCGAGC GAGCAATGCT AGCCAATTGG CCGATTATAT GCAGCAGCAT GCCGTTGATT GTCTTAAAAT TGTGCCATCG CATTTGGCGG CGCTCTTGGC TGTGGCTGAA CCTGCGCGAG TCTTGCCACG TCAGCGCTTG ATTCTGGGCG GCGAGGCGGT TAGTTGGAAG TTGCTACAAA CCTTGGCGCT ACTTGCACCT GATTGCCAAG TATTCAACCA CTACGGACCG ACCGAAACAA CGGTGGGTGT GCTAACCAAT CCACTGAGTG CGAATTTGCC AAGTGCTCAA TCGGCAATAC CAGCCTTGGG TCGTCCCATC GCTAACACCC AAATCTATCT GCTCGATGTT CACGGTCAGC CTGTGCCATT GGGTATGACT GGCGAGCTGT ATGTGGGGGG CGCGGCGCTG GCACGTGGTT ATTGGCAGCG ACCTGCGATT ACGGCTGAGC GGTTTGTGCC CGATGGCTTA AGTGGCCAAA CTGGCAGTCG CTTGTATCGT ACTGGCGATG TAGCTCGCTA TTTGCCTGAT GGCAAACTTG AGTTTTTAGG CCGCGCTGAT GATCAGGTGA AAATTCGCGG CTTCCGGATT GAATTGGGTG AAATTGAAGC AGCGTTGCGT AACCACACGG CGATTGAACA AGCAGCGGTG ATAGTGCGTG ATGATCCTGC TGGCGATAAG CGTTTGGTAG CTTATTTGGT TGCAGGCCAA CAACGCCCAC TTTCGTTACG CGAGTTGCGT AACTTTTTGA AACAGAGCCT GCCCGATTAC ATGGTTCCGG CGGCATTTGT GATGTTGGAA CGACTGCCAT TGAACGCTAA CGGCAAACTT GATCGTCAAG CCTTACCAGC GCCTGAACAG CAACAAACCA AGGCTAGCAC TCAGATTGTG GCTCCACGAA CCCCCGTTGA GACAACATTG GTTGACATTT GGAGCCAAGT GTTGGGTGGC AAGTCGGTGG GCATTAACAA TAACTTTTTT GAGCTGGGCG GCGATTCAAT TCTGAGTATT CAAATTATTG CCCGCGCCAG CCAAGCAGGC CTTAAGTTGA CACCCAAGCA ATTATTTGAT CATCCGACAA TTGCCGATTT GGCGCAAGTG GTGGCCACCA CAACAGCAGA TCAACAAGCT CAGCAGCAAT TGATAACTGG CCCTGTGCCG TTGACTCCGA TTCAACATTG GTTTTTTGAG CAAGCCCTCG CCGAGCCGCA GCATTATAAT CAAGCAGTCT TTTTCGAAGT GCGCTTTGAT CTCGATCCGG CGATTTTGGC TCAGGTGTTG CCTGAACTTG TGCGCCATCA CGATGCCTTG CGCCTACGAT TTAGCCCGAG CGAACAAGGC TGGCAACAAG TTAATAGTGC CGATGTAGCG GTCGAGTTGC TGCACATTAA TCTGGCCGCT GCGCCAGCCG AGCAGCAGCG CCAGTTGATG GAGCAAAAAG CCACTGAACT GCAAACTAGC CTTGATCTGA TCAATGGCCC GTTGTTGCGT ATGGCCTTGT TTGAGCTTGG ATCCAATCAA CCAAGTCGCT TGTTGGTGAT TGTGCATCAC TTGGCGATTG ATACTGTCTC GTGGCAGATT TTGTTTGCCG ATTTACCGCT GGTATACGAG CAAATTCGTC AGCAACAACC AATTAATTTA CCAGCCAAAA CCAACTCATT CAAAGATTGG GCTGAGCGCT TGCAACGCTA TGCAGGTTCG GCTGAACTTG AGCGCGAAGT TGCCTATTGG CTTGATCCTA CCCGCCAACA GGTTCGCCCA CTACCAGTCG ATTATGCTGC TGAAGCCCAT GCCAATACGG TTGCAAGCAC CCAAAATCTG AGCCTACATT TGAGTGTTGA GGAAACGAAG GCCTTATTGG AAGTGGTTCC TCCGGTCTAT AACACCCAAA TTAACGATGC GCTATTGGCA GCCTTAACCC AAAGCATCAG TCAATGGCAG GGCAATCCAA GCGTGTTAGT CGAGCTCGAA GGCCATGGTC GTGAAGATAT CTTGGATGAT TTGGATATTT CGCGCACGGT TGGATGGTTT ACTAGTCGCT TTCCGGTGTT GTTGCAAGCG AGCAAATCAG CCAATGCTGG CGATAGCCTA CGCGCAACTA AAGAACAGTT GCGCCAAATT CCACAGCGCG GGATTGGCTA TGGTTTATTG CGCTATTTGC GTGGCGATGC CCAGCTGAGC CAGCAATTAG CCAATCTGCC CCAACCGCAA CTGAGCTTTA ATTATTTGGG CACGGTCGCC CACGATGTTT CGCAAACTGG TCCATTGGCT TGGACGAGCG AATCGAGTGG GCCAACCCGT AGCCCCGCAG CCTTACGCCG CCATTACCTT GATCTGACGA TCTTGGTAAC CGATCACATG TTACAGATGA ATTGGACATA TAGCCAAGCA TTGCATAGTG CAGCGACGAT TCAGCGCTTG GCTGAACGTT TTGTGTCCGC CTTACAAGCG ATTATTCAGC ATTGCCAACA ACCCAATGCT GGTGGTTATA CCCCTTCGGA TTTTCCATCG GCCAACTTAA ACCAAAAGAA TTTGGATAGC TTTATCGCCA AATTACGCAA CAGCGAGAAC AGCACTCATG AAAGTTGA
|
Protein sequence | MSDAMIEGFR LSPQQQHVWA LQQLDSAQPY RTQGTIVIEG SLDIARLQAS LLQVVQRYEI LRTTFHYLQG MALPLQVVTE LSELALPSYD LSEHSLTDLQ SQLAQQAFDF AAGPLLHAWL GREHALKHYL LLSVPTLCAD NLSLVNLTSE LAAIYAAEPT TDEPMQYIDI AEWQHELLEA EETAAGRSYW QQQTWHDAIT VRPAFVANQA AAQTFQPQQL PIPLTEQLIE LLNQASQTLA VPASALALAA WRTLLWRLSD QTNGVVGVVC DGRKYAELET TLGLFAKAVP LASPLAADQQ FGQLVKQVQQ EYVEAYAWQE SFHWPAEVSS QLAFFPFGFE ALTTPKPLVM ANLSFQVVQQ RATLNPYAAH LTWFNQPQGF AASLNFDAGL IASSSAERLI EQYQTLLTAA LSNLNTSLAQ LPIVGTNERQ QLLIEFNQTA EPFDAARCFH ELFSAQAAIT PDHPAVVVED QQLSYAELEA RSNQLARELL ARGVRPDQPV ALALDRSLNL LVGILGILKA GGAYVPLDLG LPKERLGFML GDIQASIVVS ETSLQAQLPE HAADYLWLDQ AWPTIAEHSS EPVAASAVPS NLMYIIYTSG STGQPKGVGV SHQSLYNYIS SISQRLNLPP QASFASVSTF AADLGHTAIF PTLTNGGTLH LITAERASNA SQLADYMQQH AVDCLKIVPS HLAALLAVAE PARVLPRQRL ILGGEAVSWK LLQTLALLAP DCQVFNHYGP TETTVGVLTN PLSANLPSAQ SAIPALGRPI ANTQIYLLDV HGQPVPLGMT GELYVGGAAL ARGYWQRPAI TAERFVPDGL SGQTGSRLYR TGDVARYLPD GKLEFLGRAD DQVKIRGFRI ELGEIEAALR NHTAIEQAAV IVRDDPAGDK RLVAYLVAGQ QRPLSLRELR NFLKQSLPDY MVPAAFVMLE RLPLNANGKL DRQALPAPEQ QQTKASTQIV APRTPVETTL VDIWSQVLGG KSVGINNNFF ELGGDSILSI QIIARASQAG LKLTPKQLFD HPTIADLAQV VATTTADQQA QQQLITGPVP LTPIQHWFFE QALAEPQHYN QAVFFEVRFD LDPAILAQVL PELVRHHDAL RLRFSPSEQG WQQVNSADVA VELLHINLAA APAEQQRQLM EQKATELQTS LDLINGPLLR MALFELGSNQ PSRLLVIVHH LAIDTVSWQI LFADLPLVYE QIRQQQPINL PAKTNSFKDW AERLQRYAGS AELEREVAYW LDPTRQQVRP LPVDYAAEAH ANTVASTQNL SLHLSVEETK ALLEVVPPVY NTQINDALLA ALTQSISQWQ GNPSVLVELE GHGREDILDD LDISRTVGWF TSRFPVLLQA SKSANAGDSL RATKEQLRQI PQRGIGYGLL RYLRGDAQLS QQLANLPQPQ LSFNYLGTVA HDVSQTGPLA WTSESSGPTR SPAALRRHYL DLTILVTDHM LQMNWTYSQA LHSAATIQRL AERFVSALQA IIQHCQQPNA GGYTPSDFPS ANLNQKNLDS FIAKLRNSEN STHES
|
| |