Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1874 |
Symbol | |
ID | 5733763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2221657 |
End bp | 2226318 |
Gene Length | 4662 bp |
Protein Length | 1553 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279018 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544645 |
Protein GI | 159898398 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT TTGCCAAACG CCTAGCTGCT CTTTCGCCAG AGCAACGAGC CTTGCTTGAA AAACGGCTCA ACCAGAAAAA AGCCGAACCA AACAAACAGC AGATTCCCCA CGTCGAACGG CAAGGTGCAG CCTATGCCTT ATCGTTTGCC CAAGAACGTT TGTGGTTGAT GTACCAACTT GATCCGCAGA GTGCGCTGTA TAACGTGCCG GTCGTGATTC GCTTTGGGCC AAACTTTGAC GTGGCCTTAG AGCAACGGGT TTTTCAGGCG ATTATTGAGC GCCATGAAAT TCTGCGTACC ACGTTTAAAA CCGTCGATCA ACAACCAGTC CAAATGCTTG AGCCTGTGCC AGATTTTAGG CTCCCAGTGA TCGATTTGCG GTCTTTTCCC GAGGAGCAGC AAGCGTCCGA GGCCCAAAAA CACACCCTTG CTGAAGCTCA AAAGCCCTTT GATTTGAGCA ATGGGCCATT GTTGCGCGTG GTTTTGCTGC GCTTAACTCA TGAACATCAC TTAATTATTA ATTTGCACCA TATTGTTTCT GACGATTGGT CGCCTGGGGT TTTGGTCCAA GAGATTGGGG CTTTGCATCA GGCATTCAGT CAAGGCCAGC CTTCACCATT GGCTCCGTTG GCAATACAGT ATCTTGATTT TGCCGTATGG CAGCGCCAAC GCCTAAACCA GGCCAGTGTC TCCCAGCAAT TGGAATATTG GCAAAACCAA CTCTCGGGTT CATTACCCTT GTTGGCCTTG CCAACCGATC GCCCACGCCC ACGGATTCAA ACCTTCAACG GGGCAAAAAT CAGTCGGCGA ATGCCTGCCA GCCTACTTAA ATCGCTCAAA CAGTTGACTG CCCAAACTGG CGCAACCTTG TTTATGAGCG TTTTGGCGGC CTATAAGATC TTGTTGCAGC ACTATACAGG CCAAACAGAT TTGCTGATTG GTACGCCAAT TACCAGTCGA ACCCGCCCTG AATTAGAGCC ATTGATCGGC TGTTTTATTA ATATGTTGGT CTTGCGTAGC CAACTTAATC TTGAGCAAAC CTCCCGCGAG TTTATTCAGC AGGTGCGCCA AATGGCCTTG GATGCTTATG CCAATGCTGA AGTGCCGTTT GAGCGCTTGG TGCAGGTGTT GCGGCCTGAG CGTAATCCGA GCTATACCCC AATTTTTCAG GCAGCCTATA TTCTGCAAGA TCCCCAAGAA GCTCGTCAGA AGATGCCTGA AAATGTACTT GGTTATGCCG AGGTTGATAC TGGGACTTCT AAGTTTGATT TGACCTTGGA GATCGATGAA GTTGATGGTG AATTAGCTAC AGCTTTTGTG TATAGTACTG ATCTCTTCGA TCAAGCAACC GTCGAACGCA TGCTTGGGCA TCTGCAACAG ATTCTTGAAA CCATGGTTGC TCAGCCGGAT TTGCCGATTG CTGCGATTGA ATTGGTTACC AGTGCCGAAC GCCAGCAATT GTTGATCGAG TGGAATGCTA CTGAAACAGC CTTTGAAACA GATTTTGTTC AGCATGCAAT TGCCCGTCAC GCTCAAACCC AACCAGATCA ACTGGCCTTG CGCTATGGTG ATCAGCAGTA TTCCTATGCT GAATTGAACC AGCATGCTGA GCGCTTGGCG ACCTATTTAC AGCAATTAGG CGTAAAACCA GAATGTGTTG TTGGTTTGTG CGTTGAGCGC ACTCCTGCAA TGGTGATTGC GATTCTAGCG ATTTTTAAAG CTGGCGGCCT GTTTTTGCCA CTTGACCCCA GCTTTCCTGC TGATCGTTTG GCTTATATTG TTGCCGATGC CAAGCCTTTG GTCGTGCTAA CAACTGCTGC CTTAGCTGCT GAATTGCCCT TAGAAGCTCC CCATATTGTT GCACTTGATC AAGCTTGGCA TGCCCATATT CAGCAGGTTG ATGCGCCAAA TCATCAACTG CAACCCAGCA ATTTGGCCTA TATGATTTAT ACCTCGGGCA CGACTGGCAC GCCTAAAGCG GTTTTGGTTA CCCATCAAAA TCTGTTGAAT GTGTTGTTGG CAAGTCAGCA AGCGTTTGGC TTCAATCCTC GTGATGTGAT GCCGTGTATT GCCCCATTCT CATTTGATAT TTTTCTGTTT GAATTGCTCA ACCCATTGCT TGCTGGCGGC ACATCGTGGA TGCTTACCCG CGAAGAAATT TTGGATATTG CTGGCTTGAT CGAGTCGCTG GCTTCGATGA GTGTGATTCA CACTGTCCCA AGTTTGATGC GTCAATTGGT CAATGCCTTG GAAACTGAAG GCTATACTGC CGCTGCTTGT CAAAGCATTC GGATGATTTT TATTGGTGGC GATTTAGTGC CGCCTGAATT GTTAAATGCG ATGCGGCTCG CCTTTCCGCA AGCCGCAATT CATGTGTTGT ATGGCCCAAC CGAAGCCACG ATTATTTGTA CCAGCTATCG TGTGCCCCAA CAGGGCTTGC TTGAGCGCCA TTTGATTGGG CGACCACTAC CAAATATGGC GATTCGCTTG TATGACCCAC AGCAAAACCT CGTACCAATT GGTATGCCAG GCGAACTGTA CATCGGCGGG GCTGGGGTTA GTCGGGGCTA TTTGAATCGC TCGGAATTGA CTGACGAGAA ATTTGTCGAG CTTGATCAGC AGCGTTGGTA TCGTACTGGC GATTTGGCGC GTTATCAGGT TGATGGAAAT CTCGAATTTT TAGGCCGCAT CGATCAACAA GTTAAAATTC GTGGCTTTCG GATTGAGCTT GGCGAAATTG AAGCAGTACT GGCGCAACAT CCTAGCATTC GGGAAGCGGT GGTGGTTGCC CGTGAAGATC TGCCTGGCGA TAAGCGACTC GTAGCTTATT TGATTGCGGA ATCAGAACAA ATGCCGCATA TTGGTGAATT ACGGGCATTT TTGCAAACCA AACTGCCTGA ATATATGCTG CCTGCAGCGT TTATGGTGTT GGAGAGCCTG CCGCTGACCC GCAATGGGAA GGTTGATCGT CAAGCGTTGC CTGTGCCGCC CACCACCCGT GAGCATTTGG CCAATCAATT TGCTGCACCA ACCAATCAAC TCGAAACGCT GCTGAGCACG ATTTGGGCCG AGGTGTTGGG GCGCGAACAG GTTGGCATTC ACGACAATTT CTTTGAACTC GGCGGCGATT CGATTCTGAG TTTGCAAATT GTAGCGCGAC TCAATCAGGC TGGCTATCAT GTGCTAACCA AAGATATATT TCAGTATCAA ACGATTGCTG AATTGGCACA GGTGGTTTCG AGCACCACGC TGGTTGTGGC CGATCAAGGC TTGATCGAAG GTGCCGTGCC GCTTACGCCC ATTCAACAGT GGTTCTTTTG CCAAAATCTA CCTAATCCAC ACTATTTTAA TAGTATGCCA GTGTTGCTTG AAGCGCCTGC CGAGCTTACT CAGGCTGATT TGCACTCGAT CGTTGCCCAA CTTTTGCAGC ATCACGATGG CCTGCGCTTG CGCTTCGAGT TGGTTGCTGA CCAATGGCAA CAAACCCACG GCTCGCTTGA GGCTGATTTA CCACTAGCGA TAATTGATCT CAGGGGCTTG AATCAAGGAA CCCAAACCCA AACGATTGAA GCAACCGCGA TTGAATACCA AACCAAACTC GATTTGAGCA CTGGCCCGCT GATTTGGTTT GTGCTATTCG AAGCAGAGTT GAGCAAGCGC CTATTAATTG TGGCGCATCA CCTTGTATTT GATGGAATTT CGCTGCGGAT TCTGTTGGAA GATTTACAAA CGGCCTATGC TCAACTACAG GCAGGGCAAT CGATTAATTT ACCGCTCAAA ACCAGTTCGT TCAAAACTTG GGCTTTGGCG CTTCAGGAGT ATGCCCAATC ACCAGAAGTC GCCCAACAAG CCAGCTATTG GCAGACGATT CAGCATACTC AATCACCTTT GCCGCTTGAT CATAGCGGTC AGGCCAATAC TGAAGCCTCA AGCAGTATTG TCTTGGCCCG ACTTGAGGTC GCAGAAACCG ACGTATTGCT GAACCAATTG CCAACGCTCT ATCATGCCAG CCTTGAAGAA GTATTGCTAA CCGCTTTAGC CCAAACGATT GGCGAATGGA CGTATAGCCA AAGCCTTGTG GTTGATTTAG AAAGCCATGG CCGCGCCGAA TCGATTGCTG AAAATCTCAA TCTTTCGCGC ACGATTGGCT GGTTTACCAG TCTTTATCCA GTAATTTTGG ATTGGACTGG CTTTGATGGG CCGCTTGAAA TGCTGAAAGT GATTAAAGAA ACCCTGCGCC AAGTGCCTGA ATATGGGCTT AGTTATGGCT TATGGCAGTT CAATCAACCT AATCCTAGCG CCAATTCACA CGCTGAACTA CGCTTTAATT ATCTTGGTCA ATTAGGGGGT GCGGCTCAAA AAGCAGCCTT TGAGTTATTG CCCCAACTTG AAGTGCCACT GCGTGACCCT GCTAGCACGC GATCGCATGT TTTAGACGTT GATGTGGTGG TGGTGCAACA GCAATTGTGG GTGCGCTGGA CGTATAGCAA TCACTTACAT GAGCCAGCCA CGATTACCCA GCTTGCCGAG CGCTTTATGG CGGCATTACG CGATTTATTG CAAGGCGATA GCGCAACTAG CGCCGCGTAT GTTCCATCCG ATTTCCCGAT GGCAAACCTT GATCAGCAGA CATTGAACTC ACTTATGAAG AAACTTCGCT AG
|
Protein sequence | MSDFAKRLAA LSPEQRALLE KRLNQKKAEP NKQQIPHVER QGAAYALSFA QERLWLMYQL DPQSALYNVP VVIRFGPNFD VALEQRVFQA IIERHEILRT TFKTVDQQPV QMLEPVPDFR LPVIDLRSFP EEQQASEAQK HTLAEAQKPF DLSNGPLLRV VLLRLTHEHH LIINLHHIVS DDWSPGVLVQ EIGALHQAFS QGQPSPLAPL AIQYLDFAVW QRQRLNQASV SQQLEYWQNQ LSGSLPLLAL PTDRPRPRIQ TFNGAKISRR MPASLLKSLK QLTAQTGATL FMSVLAAYKI LLQHYTGQTD LLIGTPITSR TRPELEPLIG CFINMLVLRS QLNLEQTSRE FIQQVRQMAL DAYANAEVPF ERLVQVLRPE RNPSYTPIFQ AAYILQDPQE ARQKMPENVL GYAEVDTGTS KFDLTLEIDE VDGELATAFV YSTDLFDQAT VERMLGHLQQ ILETMVAQPD LPIAAIELVT SAERQQLLIE WNATETAFET DFVQHAIARH AQTQPDQLAL RYGDQQYSYA ELNQHAERLA TYLQQLGVKP ECVVGLCVER TPAMVIAILA IFKAGGLFLP LDPSFPADRL AYIVADAKPL VVLTTAALAA ELPLEAPHIV ALDQAWHAHI QQVDAPNHQL QPSNLAYMIY TSGTTGTPKA VLVTHQNLLN VLLASQQAFG FNPRDVMPCI APFSFDIFLF ELLNPLLAGG TSWMLTREEI LDIAGLIESL ASMSVIHTVP SLMRQLVNAL ETEGYTAAAC QSIRMIFIGG DLVPPELLNA MRLAFPQAAI HVLYGPTEAT IICTSYRVPQ QGLLERHLIG RPLPNMAIRL YDPQQNLVPI GMPGELYIGG AGVSRGYLNR SELTDEKFVE LDQQRWYRTG DLARYQVDGN LEFLGRIDQQ VKIRGFRIEL GEIEAVLAQH PSIREAVVVA REDLPGDKRL VAYLIAESEQ MPHIGELRAF LQTKLPEYML PAAFMVLESL PLTRNGKVDR QALPVPPTTR EHLANQFAAP TNQLETLLST IWAEVLGREQ VGIHDNFFEL GGDSILSLQI VARLNQAGYH VLTKDIFQYQ TIAELAQVVS STTLVVADQG LIEGAVPLTP IQQWFFCQNL PNPHYFNSMP VLLEAPAELT QADLHSIVAQ LLQHHDGLRL RFELVADQWQ QTHGSLEADL PLAIIDLRGL NQGTQTQTIE ATAIEYQTKL DLSTGPLIWF VLFEAELSKR LLIVAHHLVF DGISLRILLE DLQTAYAQLQ AGQSINLPLK TSSFKTWALA LQEYAQSPEV AQQASYWQTI QHTQSPLPLD HSGQANTEAS SSIVLARLEV AETDVLLNQL PTLYHASLEE VLLTALAQTI GEWTYSQSLV VDLESHGRAE SIAENLNLSR TIGWFTSLYP VILDWTGFDG PLEMLKVIKE TLRQVPEYGL SYGLWQFNQP NPSANSHAEL RFNYLGQLGG AAQKAAFELL PQLEVPLRDP ASTRSHVLDV DVVVVQQQLW VRWTYSNHLH EPATITQLAE RFMAALRDLL QGDSATSAAY VPSDFPMANL DQQTLNSLMK KLR
|
| |