Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1881 |
Symbol | |
ID | 5733770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2248321 |
End bp | 2253390 |
Gene Length | 5070 bp |
Protein Length | 1689 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279025 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544652 |
Protein GI | 159898405 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAGC ATTCAGATGA TTCGTTGCGC AACCAGACTG ATCCTGCATC TCGGCGAGTT CCGCTTTCGG CGGCCAAAAA AGCGCGGCTC GAAAAACGCC TGCGCGGTGA TAGCGACAAT CAGCAGGCTG AATCAAGCAT TCCTCGCCTC GCGACCTATG ATCAGGTGCC ATTATCGTTT GCGCAAGAAC GGCTGTGGTT TCTCAGCCAA TACGAACCAA CGAGCAGCAA CGCCTATATT ATTCCCTTGG CCGTGCGGAT CGATGGCCCC ATTGATCCAT CGCTGTTGGA GCAGGCTTTG CAATTGGTGG TGGATCGCCA TGCCAGCTTC CGCACCACGT TTCATGCGCA GAATGGGGTG CCGTTTCAAC GGGTTGCCCC ACAGCTGCCG CTCAGGTTGC CCGTGCTTGG GCTGGACGTG GCTGATGCGA GCGATGAGGC GGCGGTGTTG CAGGTCGTGC TGGATCAGCT TGTACCGCTG TTGCAGCTAC CGTTTGATCT TGAGCATGGG CCGTTGTTGC GAGCGACCTT ACTGCGTTTA GCTGCCGAAT CGCATGTGCT GCTGCTGATT TGCCATCATA TTATTAGTGA TGGCTGGTCG ATGGGGGTGC TGCTGCGCGA TTTTGCCAGC TTTTTAGGCG CGTTACGCAC CAATACTGCG CCGGATGTAC CGCCGTTGGT GGTGCAAGCC CCGGATGTCG CGGTGTGGCA ACGGCAACGC TTGCAAGGCC ACTATCTGAC GACACTCCAA GATTTTTGGA AGCAGCAGTT GGCGGATCTC GAACCATTGA ATGTGCCTAC CGATTTTGTC CGACCGGCCC AGCAATCCTA TCGTGGGGCG ACATTGAGTT TTCAGCTCCC CGCTGCGCTC AGCACCCAGC TTCAGCGTAT GGCGCAACAG CATGATGTGA CCCCATTTAT GCTGCTGCTT GCCGCATTTC AGGCTTTTTT GGCGCGACTG AGTGGGCAAC AGGATCTGGC GATTGGTTCC GTGCTAGCGA GTCGGGCCGA TGCTGACCTT GATCCGGTGA TTGGTTTTTT GGTGAATACC TGGACGTTGC GGAACAACAT AGATGTAGCG CAGCCATTGG CGCAGCTCTT GCCCACGGTG CGACGTACCG TCCTGGCAGC GTTTGAGCAC CGCGATTTAC CGTTTGAGCA GGTGGTGCAA CTGGTGCAAC CGGAGCGTGA TCTGAGCCGT TCGCCCGTGT TTCAGGTGAT GATGACCTAT CAAAACGTGC CGCAACGGCA GATGGAATGG GGGGATGTTC GGTTGACCCC GATTAGCCTG CCCAGCACGG TGGCGAAATT CGACCTAACT CTGGCGCTGA GCGAAACCCC CGAGGGTTTT CGTGGGGTGA TGGAATATCG GAGCGATTTA TTTCGGCGCA GCACGATTGC CACGATGGTT GCGCGTTGGG AAATGTTTTT ACACGGGATT GTGGCAGACT TTACCACGTC CATTGCCCGA TTGCCGTTGG TCTTGCCTGC GGAACGCAGC TTATTGCTTG ATACGTTGAA TGCGACCACA ACCGCCTACC CACACGATCA AAGCGTAGCA AGTTTGTTTG CCGAACAAGC CCGCCTGTGG CCGGAGCGGA TTGCGCTTCG TTTTGGTGAG CACAGCCTCA GCTATCACGC GCTTGAGCAA CGGGCCAACC AGCTAGCGCA CCATCTGCAA CTGCTGGGTG TTGGGCCAGA GCATGTGGTT GGTTTGTGTG TTGAGCGCTC GTTGGACTTA GTGGTGGCGA TTCTGGCGAT TCTCAAGGCT GGCGCAGCCT ATGCCCCGGT CGATCCGAGT TATCCCGTTG AGCGTTTGGC CTGGATGCTG AGTGATTTAC AGCCAACGGT GGTGATTGCA CAGCACGGCG TGCTCGACCG CTTACCGTCG GTTGCGTGTT CCGTGGTTGT GCTTGAAACC ATAGCCGCGC ACCTCGCAGC GTATCCCACG ACTGCGCCAA CCGTGGACAT CAGCCCCGAA AATTTGGCCT ATGTGATGTA TACCTCTGGT TCAACAGGCC GACCCAAAGG GATTATGATC AATCAGCGGA ACATTGTGCG ATTGGTCCGC AACACCACGT ATGCGGCATT TGGGCCAGAC CAGGTTGGGT TATTGCTGGC AACAGTGGCA TTTGATGCTT CGACGTTCGA ACTTTGGGGG TGTTTGCTGA ATGGTGGACG CTTAGTGATC GCCCCACCGC AGCAACTCAG CCTTGCCGAA TTGGGCCACT TGGTGGAGCG CGAACAGATT ACGACGCTCT GGTTGACCGC CGGATTGTTC CATCAAATGG TGGATCATGC GCTGGATCGA TTGGGTTCGT TGCGTCAATT ACTGGCCGGT GGCGATCGAC TGTCGCCCGT GCATGTACAC AAAGTGCTGG AACGCTGGCC GCAGTGTCGC CTGATTAATG GGTATGGCCC AACGGAAAAC ACCACATTTA GCTGTTGTCA GCAGCTTAGT GCAACCACTG ACCTGGCGCA GGGCGTGCCG ATTGGGCAGC CGATTGCGAA CAGCACGGCC TATATTCTTG ACCGGTTGTT GCAACTGGTT CCCATAGGGG TTGTAGGCGA ACTGTATTTG GGTGGCGCAG GCTTAGCGCG AGGGTATTTA GCGCGTCCAG ACCAGACGGC GGCGGCATTT ATCCCGAACC CCATGAGCCA AACGGCGGGC GAACGCCTGT ATCGCTCGGG GGATCTGGCG CGGTATCGCG ATGATGGGAC GATCGAATTT ATTGGACGAC GGGATCAGCA AGTCAAGGTA CGCGGGTATC GGATTGAGCT GGAAGAAATC GTTGGCGTGT TGCTGGCACA ACCACAGGTG GATGATGCGG TGGTGGTGGT GCGGGAGGAT CGGGTTGGTG ATCAGCGCTT GGTGGCCTAT CTGGTGGGTG ACAATCCGGC GATTGAGCTG ATTGAACAAG CGGTGCAAGG CCAGGTCCCG AGCTATATGC TCCCGAGTGC CTATGTTGTG CTTGATGCCT TGCCGTTGAC GGCGAATGGC AAGGTTGATC GGCGGCGGTT GCCAGCGCCG AGCTATGCCG CCATCGCGAA CGATGATCCG CCACAAACCG ATTTAGAGCA GGCGATAGCG GCGATTTGGG CCGAGGTCTT GGCGGTGCCG AGCATTCAAC GCCAGACCAA CTTTTTCCAA GTAGGTGGAC ATTCGTTGAG TGCAACTCAG TTGATTGTGC GGTTACGCCA AATGCTGAAT CGCGATCTGC CCTTGCAATT ATTGTTCGAT TATCCTTATT TATATCAGCT TGCCGAGCAG CTTGAGCAGC AACCAACAGC CCTGCCAACC GCGATTCAAC CAATTCCACG TCATCAGCGC CTGCCGTTAA CGTCTGCCCA GCAACGGGTT TGGTTTTTTG AGCAATTAGT GCCCAATACG GCGATGTATA CGATTGCTTT GCAACTACGA TTAAGTGGCA AGCTTGAGCC AGCCTTGTTG CAACAGGCGA TTAATCTGCT GATTGCTCGC CATGAGATCT TGCGAGCGTC GTTTCATGGG CAGGCTGGGC AGCCTTGGCT ACACATTGCT AACCACTTAA CGCTTGATCT AGAGTTGATT GATCTTCGGA AAATAGATAT TGAACAGCAA TCGATCGATG TTCAGAAAAT ATGCCAACGT TTAGCATATG CCAGCTATCA ACTTGAGTAT GCGCCCTTGC TGCGTTTTTG CTTAATCCAG CTTGGCGTAC AAAGCGCGAT ACTATTATTC ACTATTCAGC ATAGTATTAC CGATGCTTGG TCGATTGATC TGCTTTTGCA GGAGTTATGG CAAATCTATG CTGATTTAAT CCACCAGCGC CCGCTGAGCC TTGCTGAATT ATCAGTGCAA TATGTTGATT TTGCTGCTTG GCAAGCTGCT TGGTTGGCTA CGCCCGCAGC GCAACGCCAG TTAAGCTACT GGCTTGAACA GCTCGCCGAA GCTCCACGCT TGTTGGCCTT ACCAACCGAT TATCCCCGCC CCGACAACCA AACCTTTGCT GGCTCGGTGG TGAGCCTTGA ATTACCTCAG CCTTTGACCC ACGAGTTGCG TAGTTTGAGT CAACATCAGC ATGTAACCTT GTTTATGTTG TTGTTGGCGG CTTTCAAGAG CTTGCTTTAT CGCTACACTG GGCAAACTGA TCTCTGCGTT GGTTCGCCGA TTGCCAATCG TGGCCAGCCT GAGCTGCAAA CAATGCTTGG CTTTTTTATC AATACCTTGG TGCTGCGTAG TCGCATTCAG CCAGCTTGGT CTTTTCTTGA GCTCTTGGCT ACGGTGCGAG CCACAACCTT GGCTGCCTAT GCCCAGCAAG ATCTCCCCTT AGAAAAGATT ATTGAACAGC TCAAACTAGA GCGCGATTTG AGTTATAACC CATTGTTTCA GGTGATGTTT AATTTTCGTC ACGATTTTGC GATCAATCGC CAGCATTATG AGCTAGCGAT TGAGGCTGAA ATGTTGGCGA ATGGTACATC AAAATTTGAT TTAACTTTAG ATGTAGCCGA TCGTGGCTCC ACGTTATTGC TATGGGTCGA ATATAACTCG GCCTTGTTTG CCCCAGCGAC GATTGAGCGG ATGCTGGCAC AATTTCAGGT CTTACTGAAT AGCATTTGTG CGCATCCTGG GCAACAATTA AGTACACTGG AGCTGCGAAC ACCCAATCAA ATTGCTCAGT TAGAACAAGC GCGAGTTGCG CTATTACAGC AACTTGAGCA GCATCCAGCG ATTAGCCAAG CAGTTGTATT GGTTCGGCCC GATTTGCCGC AAGCCAACTG GGTCGTTGGC TATGTTGTCA AACAGCCAAA TCAGCAGCTT GAGCTTGGTC AATTGCAGGC AAGTATCCAA ACAACCTACC CATTGGTACA GTTGGCACTT TATGAAGTGC CCGATATGCC CTTGGATGCG GCTGGCACGG TTGACCACGC TGGATTAAGC AAGTATGGTC AGCCATTGCT TAACCAGCAA TTACAGCAGC CCGCCGTGCA ATCCCCGACT GAAGCAATGC TTGCCGAAAA ACGGGCTAAA CTTTCCGCAG ATAAACGGTC ACTCCTAGAG AAGCGTTTGA AACAATTACG AGGTGAATAA
|
Protein sequence | MSEHSDDSLR NQTDPASRRV PLSAAKKARL EKRLRGDSDN QQAESSIPRL ATYDQVPLSF AQERLWFLSQ YEPTSSNAYI IPLAVRIDGP IDPSLLEQAL QLVVDRHASF RTTFHAQNGV PFQRVAPQLP LRLPVLGLDV ADASDEAAVL QVVLDQLVPL LQLPFDLEHG PLLRATLLRL AAESHVLLLI CHHIISDGWS MGVLLRDFAS FLGALRTNTA PDVPPLVVQA PDVAVWQRQR LQGHYLTTLQ DFWKQQLADL EPLNVPTDFV RPAQQSYRGA TLSFQLPAAL STQLQRMAQQ HDVTPFMLLL AAFQAFLARL SGQQDLAIGS VLASRADADL DPVIGFLVNT WTLRNNIDVA QPLAQLLPTV RRTVLAAFEH RDLPFEQVVQ LVQPERDLSR SPVFQVMMTY QNVPQRQMEW GDVRLTPISL PSTVAKFDLT LALSETPEGF RGVMEYRSDL FRRSTIATMV ARWEMFLHGI VADFTTSIAR LPLVLPAERS LLLDTLNATT TAYPHDQSVA SLFAEQARLW PERIALRFGE HSLSYHALEQ RANQLAHHLQ LLGVGPEHVV GLCVERSLDL VVAILAILKA GAAYAPVDPS YPVERLAWML SDLQPTVVIA QHGVLDRLPS VACSVVVLET IAAHLAAYPT TAPTVDISPE NLAYVMYTSG STGRPKGIMI NQRNIVRLVR NTTYAAFGPD QVGLLLATVA FDASTFELWG CLLNGGRLVI APPQQLSLAE LGHLVEREQI TTLWLTAGLF HQMVDHALDR LGSLRQLLAG GDRLSPVHVH KVLERWPQCR LINGYGPTEN TTFSCCQQLS ATTDLAQGVP IGQPIANSTA YILDRLLQLV PIGVVGELYL GGAGLARGYL ARPDQTAAAF IPNPMSQTAG ERLYRSGDLA RYRDDGTIEF IGRRDQQVKV RGYRIELEEI VGVLLAQPQV DDAVVVVRED RVGDQRLVAY LVGDNPAIEL IEQAVQGQVP SYMLPSAYVV LDALPLTANG KVDRRRLPAP SYAAIANDDP PQTDLEQAIA AIWAEVLAVP SIQRQTNFFQ VGGHSLSATQ LIVRLRQMLN RDLPLQLLFD YPYLYQLAEQ LEQQPTALPT AIQPIPRHQR LPLTSAQQRV WFFEQLVPNT AMYTIALQLR LSGKLEPALL QQAINLLIAR HEILRASFHG QAGQPWLHIA NHLTLDLELI DLRKIDIEQQ SIDVQKICQR LAYASYQLEY APLLRFCLIQ LGVQSAILLF TIQHSITDAW SIDLLLQELW QIYADLIHQR PLSLAELSVQ YVDFAAWQAA WLATPAAQRQ LSYWLEQLAE APRLLALPTD YPRPDNQTFA GSVVSLELPQ PLTHELRSLS QHQHVTLFML LLAAFKSLLY RYTGQTDLCV GSPIANRGQP ELQTMLGFFI NTLVLRSRIQ PAWSFLELLA TVRATTLAAY AQQDLPLEKI IEQLKLERDL SYNPLFQVMF NFRHDFAINR QHYELAIEAE MLANGTSKFD LTLDVADRGS TLLLWVEYNS ALFAPATIER MLAQFQVLLN SICAHPGQQL STLELRTPNQ IAQLEQARVA LLQQLEQHPA ISQAVVLVRP DLPQANWVVG YVVKQPNQQL ELGQLQASIQ TTYPLVQLAL YEVPDMPLDA AGTVDHAGLS KYGQPLLNQQ LQQPAVQSPT EAMLAEKRAK LSADKRSLLE KRLKQLRGE
|
| |