Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0353 |
Symbol | |
ID | 5708025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 395910 |
End bp | 399074 |
Gene Length | 3165 bp |
Protein Length | 1054 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641269879 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001535274 |
Protein GI | 159036021 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0224507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.487426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACGG CCAGCGGAGG TGTACCGGTG GACAGCGTTG GAGAGGTGTC GTTCGCGCAG GAACGGCTGT GGTTCCTCGA TCAGCTGCGC CCCGGCACGC CGGACTACCT GCTGCCGCTC GCGCTGCGTA TCCGCGGGCC GCTTGACGTC ACCGCCCTCA CCGAGGCTTT CCAAGCGATC GTGGATCGTC ACGAGGTGTT GCGCACCCGC TACGTCGAGG TCGATGGCCG GCCGGTGGCC CACGTCGACG CGCACACCAA GGTCACCATC GCGTACACCA CTGATCACCA CGTCCTGGAG CGCGAACTCG CCCGCGCCAT CGACATCGCC GAGGACCTGC CGTTCCGGCT GTCGCTGGCC CGACTCGGCG ACGACCATCT CCTGGTGTTC GTCGTGCACC ACATCGCCTT CGACGGCTGG TCCTGGGGTG TGCTCGCCCG CGAGTTGGCC GCCGGGTACG CCGGCCGGAC GGCCGAGGTG AGCAAGCAGG CCCCGCAGTA CGACGGGTTC GCCCGCTGGC AACGCGAGCG GTTCACCGAC GAGCGCAGCC GCCGTCAGCT CGACTACTGG CGAGCCCAAC TCGTTGGTGC CCCCGCCATC GACCTACCCA CCGACCGGCC GCGACCGCGG ACCTGGGACG GCACGGGCGA CGTCGTCCGC GTCGACCTGT CCGCGACACT GCTGCGGGAG GTCGACGCGT TGGCTCGTAG CCGTCGCGTC ACCCGTTTCA TGGTCCTGCT CGCCGCCTTC CAGATCGTTC TCGCCCGTGC CAGCGGTCAA ACCGACTTCG CCGTCGGCAC GCCGGTCGCC GGCCGGACGC GGGTCGCGGA CGAGGACCTG ATCGGCCTGT TCGTCAACTC GATCGTGCTG CGCGCGGACC TGTCCGGCTC ACCCACGTTC GAGGAGCTAC TCATCCGCGT ACGGGACACC GCGCTCGGCG CGTTCTCCCA TGCCGAGACC CCGTTCGAGC GGATCGTCAC GGAACTCGCT CCCGAACGCG ACCTGTCCCG CAACCCGCTC TTCCAGGTGT CGTTCAGCCT GCTCGACGTG CGGGCTCCGA TGTCGCTTCC CGGACTGGAC GTCGAACTGG TGGAGCCACC ATTGACGGGC TCCCCGCTCG ACCTCTTCCT CGACATCAAC GTGCGGACGG ACGGCACCGC GGTGGCGCGG CTGCAGTACG CCACCGCGTT GTTCGACCAT GCCCGGGTGG AGCGGCTCGC CCGGGGGTTC GTCGACCTGC TCCGCACCAT CGTCGCCGAG CCCGAGATCA GCGTCAGCGG CCTGGCGACG CGACTGGAAC TGGGACCGGA CGGGGAGCGG GACCGCCTGC TGCACGCCTG GAACGGCACC GCCGAGGATC TGCCCGACGG CACGGTCGAC GGGCTGATCG CCGTTCAGGC GCAATCCACG CCGGACGCCG TGGCGGTGCG GACGACCGCC GAGGACATCA CCTATGCCGA GCTGGACACG AGGGTCAACC GGCTCGCTCA CCATCTGCGC GCTCTCGGCG TCCGATCAGG CTCGCTGGTC GCCGTGCTGC TCGACCGTGG GCCAGATCTG CTCACCGCTC TCCTCGCCGT GCTCCGGGCC GGCGGCGCCT ACGTGCCCAT CGACCCCGAA TATCCGGACG CCCGGGTCGC CTTCATCGTG GTCGACTCCG CCGCGGAGGT GGTGATCACT CGGTCCACGC TTGCCGACCG AGTTGGTGAC ACCGACGGAA AGCTCGTCTT GCTGGACCGG GACCGGGCCG CCGTGGCAGC CCGGCGGGCA GACGCCGTCG GTCCCACGGC GACCGCTGAC GACCTCGCAT ACCTGATCTA TACGTCCGGC TCGACCGGAA CGCCCAAGGG TGTGATGGTC CACCACCGGG CGCTGACCAA CTTCGTCACC TCGATCGTGC GGCGGCCCGG GCTCACCGCC AGCCAGTCGG TCGTCGCGCT CACCACGATC TCGTTCGATC CGTCGTTGCT GGAGCTCTAT GTGCCGTTGC TCGTCGGCGC GACGGTCGTC CTCGCCGACA CCGAGCAGGC CCGCGACCCC CAGCGGCTGA CCGACCTGGT CGCACTCACT CGTCCCGCGG TTCTGCAGGC GACCCCGGCG ATGCTGCGGG CGCTGCTCGA CACCGGCTGG GTTCCGCCGG CCAGGCTCAC CGTGTTGTCC GGTGGCGAGA AGCTGCCGTC CGAGCTGGCC CGGCGGCTCG CCACGGATGG GGCTCAGGTG TGGGACCTGT ATGGCCCGAC GGAGACGACC GTGTGGGTGA CCTCGGCTCG ACTCGACCCG GCCGGCCGGG TCGTGGACTG GTCGCCGCAG GCCAACTGCA CGGTCCACCT GCTCGACCGG CATGCCGAGC CGGTGCCCAT CGGTTCGGTC GGAGAACTGT ACGTGGGCGG CACCTGCGTC GCGCTTGGCT ACCGGGGTCA GCCCGCGCTG ACCGCCGAGA GGTATGTGCC CGACCCCTAC TCCACGACAC CCGGAGGACG CCTCTACCGA ACCGGCGACC TGGCCCGCCG CCACCAGGAC GGATCGGTCG AGATCCTTGG CCGTGCCGAT CGGCAGGTGA AGATACGCGG CCATCGCATG GAACCCAGCG AGATCGAGGC GGCGTTGCTC GGTCACGACG AGATTCGCGC GGTCGCCGTG CACCCGACCT CGACTCCCGC CGGCGAGCAG CAGCTGACCG CCTACATCGT CCCGCGAGGG AACACCCCGC CGCCGGTCGA GGGACTGCGG ACGTTCCTGC GGCGGACTCT GCCCGACTAC ATGGTCCCGG CGGCGTACGT GCCGATGGAG GCACTTCCAC TGACGCCCAA CGGCAAGGTC GACTACAACG CGTTGCCGGA ACCCACGATC CGGGTAGCCG TGGAGCGGGT GTCCCCGCGT ACCACCGAGG AACGCGTGGT CGCCGGCATC TGGCAGGAGG TTCTCGGCAG CAGCACCCAG ATCGGCGTGA ACGAGAACTT CTTCGACATC GGCGGACACT CGCTACTTGC CACCCGGGTT GCCGTGCGCC TCCGCGCCCA ACTCGGTATC GACGTTCCCG TCCGCGGTCT GTTCGACCAC AGCACGGTGG CCAGCCTCGC CGCCGCGCTC ACTGACTATC CGCAAGTCTC CCAGCGCGCC GCGATGCCCA CGCTGACCGC CCGGCGCCGC CGTGTCACGC GTTGA
|
Protein sequence | MNTASGGVPV DSVGEVSFAQ ERLWFLDQLR PGTPDYLLPL ALRIRGPLDV TALTEAFQAI VDRHEVLRTR YVEVDGRPVA HVDAHTKVTI AYTTDHHVLE RELARAIDIA EDLPFRLSLA RLGDDHLLVF VVHHIAFDGW SWGVLARELA AGYAGRTAEV SKQAPQYDGF ARWQRERFTD ERSRRQLDYW RAQLVGAPAI DLPTDRPRPR TWDGTGDVVR VDLSATLLRE VDALARSRRV TRFMVLLAAF QIVLARASGQ TDFAVGTPVA GRTRVADEDL IGLFVNSIVL RADLSGSPTF EELLIRVRDT ALGAFSHAET PFERIVTELA PERDLSRNPL FQVSFSLLDV RAPMSLPGLD VELVEPPLTG SPLDLFLDIN VRTDGTAVAR LQYATALFDH ARVERLARGF VDLLRTIVAE PEISVSGLAT RLELGPDGER DRLLHAWNGT AEDLPDGTVD GLIAVQAQST PDAVAVRTTA EDITYAELDT RVNRLAHHLR ALGVRSGSLV AVLLDRGPDL LTALLAVLRA GGAYVPIDPE YPDARVAFIV VDSAAEVVIT RSTLADRVGD TDGKLVLLDR DRAAVAARRA DAVGPTATAD DLAYLIYTSG STGTPKGVMV HHRALTNFVT SIVRRPGLTA SQSVVALTTI SFDPSLLELY VPLLVGATVV LADTEQARDP QRLTDLVALT RPAVLQATPA MLRALLDTGW VPPARLTVLS GGEKLPSELA RRLATDGAQV WDLYGPTETT VWVTSARLDP AGRVVDWSPQ ANCTVHLLDR HAEPVPIGSV GELYVGGTCV ALGYRGQPAL TAERYVPDPY STTPGGRLYR TGDLARRHQD GSVEILGRAD RQVKIRGHRM EPSEIEAALL GHDEIRAVAV HPTSTPAGEQ QLTAYIVPRG NTPPPVEGLR TFLRRTLPDY MVPAAYVPME ALPLTPNGKV DYNALPEPTI RVAVERVSPR TTEERVVAGI WQEVLGSSTQ IGVNENFFDI GGHSLLATRV AVRLRAQLGI DVPVRGLFDH STVASLAAAL TDYPQVSQRA AMPTLTARRR RVTR
|
| |