Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2948 |
Symbol | |
ID | 5707826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3340693 |
End bp | 3343866 |
Gene Length | 3174 bp |
Protein Length | 1057 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272397 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001537765 |
Protein GI | 159038512 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.735724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0141341 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCACG GAGCGGAGAG CACCCTTTCG CTCGGCCAGG AGCGGTCGTG GTTCATCGAC CAGTACGCCG GCGCAGCGGT CAACACGCTG GCGCACCGGG TGCGGCTACG TGGTGAGCTG CGCACCGACG CGCTCGGAGC TGCGTTGACC GACGTCATCG GGGCCAGCGA CGTGCTTCGC TGGGCGATCA GGTCGCACGA GGGCCAGGCC AGTCGGGTAC CGGCGCAGCC CCTACCTGCG GCCGTGCCGA CGACCGACCT GTCGACCCTG CCCGCGACGG AGCGCGAGGA CGCGGTGCTC CGCGCCGTGG AGCGTGCCGC CGGCAACCCA TTCGATCTCG GCCAGGGTCC GCTGCTCCGA GCCGAGCTAC TCCGCGTCGA CGTCGACGAC CATGTTCTCC TGCTCACCAC TCACCGGGCA GTGGTGGATG ACGCGAGTCT GTCGCTGCTC GTCAACCGGT TGGGCCGGGC GTACCGGGCC CGCCTGCACC GCGCCGAGCC ACTGCCGTCC GACGCCGGCG CCTTCGAGGT GTTCGTAGCC CGGGAGCGCG CCGGGGCGCA GCAGGGGCGC AGCGTGGCCG GTGCGGCCTG GCGGGACTAC CTCGCCGACG TCGTCACACT GGAACTGCCG ACCGACCGGG TACGCGGCCT GCAGCCCCAG TACACGGCCG CTACCGTCGA GACGATCCTG TCGCCGCCGC TGGACGACGC GGTGCGGGCC GTGGCCGATC GCCTCTCCGT GCCCGCCGGT GAGGTGCTGC GCGCGGCGTT CCAGGCCCTG CTGCACCGCT ACTCCGGTCA GCCGGTCGTG GCGCTGGGCG TGCCTGCCGG GCGCCGCGAT TCGGCCACCG CGGACCTGGT TGGTCCGCTG GCCACCTGGC TCGTCCTGCG CTCCGAGTTG CACCCGGACA CGACGCTGAC CGGCCTGATC ATGCCAGCCG GCGGGGCGGA CCGGCCTCCG GTCCTGCCCT TCGAGACCCT GATCGAGGAG GTCCAACCGC CCCGCGACCC GGCCCGCAGC GCGATCGTCC AGGCAACGTT CACCGCTCGG GAGGCTGCCA CACCCGCGGA CTTCGGCGCG GCTACCGTCG TCGACCAGCG GACCGTCCCG GCCACGGCCA CCCTGCACGA CGTGTCCGTG CTCGCCCAGC CGGGCCCGGC AGGCACCCGG ATCGGGCTCA CCTATCGCGC CGACCTGTTC GACCGGGCCA CCGCGGGGCG GCTGCTGGGT CACTACGTCA GCCTGCTCGA CGCGGCGGTT GCCCGCCCCG ACGAGCCGGT CGCCCGACTG CCCTACCTCA GTGTCACGGA GCGGACGCGG ATCCTCGACG AGTTCAACCG GACGGAGGCG CCGTTCCCCC GGGACGCCAC CGTGCACGAA CTCTTCGAGG AGCAGGTGCT GCGAAACTCG GACGCGCGCG CGGTGACCAT CGAGGGACAG CACCTCACGT ATCGAGAGTT GAACGAGCGA GCGAACAAGC TGGCCCACCG GCTGCGGTCA TGCGGGGTCG GACGCGGCAC GTACGTTGCC CTCTGCCTGG AACGATCCCT TGAGCTCATG GTGGCCGTCA TGGCGGTCCT CAAGTCCGGT GGGGCCTACA TACCGCTGGA CCCGGCCTAC CCGACCGACC GGCTCGCCTT CATGCTGGCG GACACCCAGG CCCGCTTCCT GGTCACCCAG CGCCGGCTGC GCGAGATGGC ACCGATCGAC GACGCCGCCA CGGTGATCGT GCTGGACGAC CCGGCCGACG CGGCGGTCGT GGCCGACCAG TCCGCGGTGA ACCCGGTCAA CGTGAACGCC GCCGAGGACC TGACGTACAT CGTCTACACC TCCGGCTCCA CCGGTCGACC CAAGGGAGTC GAGACGGTCC ACTTCGGTGT CGTTCGCCTC GTCGTCAACA CCGACATTCT CGAGTTGGAC GAGCGAACCA GCTACCTGCA GATCTCGCCG CTGTCCTTCG ACGCCTGCAC CCTCGAAATC TTCGGCCCAC TGCTCAACGG TGGCCGGGTC GTCCTGCTCC CGCCGGGCGT GCCGACACCA GCGCGGGTGG CCCACACCGT CCGGGAACAG GGTGTCGACA CCCTGTGGCT GGTGGCTCCC CTGGCCAACC TCACCATCGA CACGCACCTC GACGACCTGC GGGGGCTGCG CCAGTTCATG GCCGGTGGCG ACGTGCTCTC CATCCCGCAC ATCCGGCAGG TGCTGGACAA GCTGCCGCAC ATAAAGTTGA TCAACGGATA CGGCCCCACC GAGGTTACCG CCTTCAGCGT CAGCCACAAG ATCGACTACA TCGACCCGGA CTGGCCCTCG ATTCCGATCG GCCGGCCGAT GCACAACACC ACGGCCTACA TTCTCGACCC CCTCGGCCAG CCGGTGCCGA TTGGTGTGTG GGGCGAAATG TACCTGGGCG GCCCGGGCGT CGCGCTCGGT TACCACAACC GACCTGATCT CAATGCCGAG CGGTTTCTGC CGGACAACTT CCGCCCCGGG CCCGGGGCAC AGCTGTACCG TACCGGCGAC CGGTGCCGGT GGCTGCCCGA CGGCACCATC CAGTTCCACG GCCGACTCGA CACACAGGTG AAGATCGACG GCCTGCGGGT CGAGTTGGGC GAGATTCAGA GCGTGGTGGC GGGGCACGGG TCGGTGGCGG CGGCGGTGGT CACCGCGCCG GTGATCGGCA CCCGGCGCAC CCTCGTGGCG TACGTGGTGC CGGCGGATCC CGACGGTTTC GACGCTTCCG TACTGCGGGC GCACCTCACC GGCGTCCTGC CGAGCGTGAT GGTGCCCGCC CATTTCGTCA CCATGTCGAC GATTCCGCTG ACTCCGAACA ACAAGGTCGA CTTCCAGGCA CTACCCGAGC CGCAGTTCGG CACCGTGCGG GGGCACCGAC CACCGGAGAC GAGCACTCAG CAGGCACTGG CGGAGATCTG GCGTGAAATC CTGGGAGTGC CCGCCGTCGG GCTGGACGAC AACTTCTTCG AACTCGGCGG TCATTCACTA CGCGCGGTGC CGATGATCGC CGCCATCAGC ATGCGGTTCG GAATCGACCT TGCCGTTCAG GACATCTTCG AGGCACCCGG GCTGGAAGCT CTGGCGAGCC GAGTGGAGGA GCGGATGCTT GCGGCGATTC CTGCCGAGGA GTTGGAGAGA ATGTTCTCCG AGCTCGGCAA TTGA
|
Protein sequence | MRHGAESTLS LGQERSWFID QYAGAAVNTL AHRVRLRGEL RTDALGAALT DVIGASDVLR WAIRSHEGQA SRVPAQPLPA AVPTTDLSTL PATEREDAVL RAVERAAGNP FDLGQGPLLR AELLRVDVDD HVLLLTTHRA VVDDASLSLL VNRLGRAYRA RLHRAEPLPS DAGAFEVFVA RERAGAQQGR SVAGAAWRDY LADVVTLELP TDRVRGLQPQ YTAATVETIL SPPLDDAVRA VADRLSVPAG EVLRAAFQAL LHRYSGQPVV ALGVPAGRRD SATADLVGPL ATWLVLRSEL HPDTTLTGLI MPAGGADRPP VLPFETLIEE VQPPRDPARS AIVQATFTAR EAATPADFGA ATVVDQRTVP ATATLHDVSV LAQPGPAGTR IGLTYRADLF DRATAGRLLG HYVSLLDAAV ARPDEPVARL PYLSVTERTR ILDEFNRTEA PFPRDATVHE LFEEQVLRNS DARAVTIEGQ HLTYRELNER ANKLAHRLRS CGVGRGTYVA LCLERSLELM VAVMAVLKSG GAYIPLDPAY PTDRLAFMLA DTQARFLVTQ RRLREMAPID DAATVIVLDD PADAAVVADQ SAVNPVNVNA AEDLTYIVYT SGSTGRPKGV ETVHFGVVRL VVNTDILELD ERTSYLQISP LSFDACTLEI FGPLLNGGRV VLLPPGVPTP ARVAHTVREQ GVDTLWLVAP LANLTIDTHL DDLRGLRQFM AGGDVLSIPH IRQVLDKLPH IKLINGYGPT EVTAFSVSHK IDYIDPDWPS IPIGRPMHNT TAYILDPLGQ PVPIGVWGEM YLGGPGVALG YHNRPDLNAE RFLPDNFRPG PGAQLYRTGD RCRWLPDGTI QFHGRLDTQV KIDGLRVELG EIQSVVAGHG SVAAAVVTAP VIGTRRTLVA YVVPADPDGF DASVLRAHLT GVLPSVMVPA HFVTMSTIPL TPNNKVDFQA LPEPQFGTVR GHRPPETSTQ QALAEIWREI LGVPAVGLDD NFFELGGHSL RAVPMIAAIS MRFGIDLAVQ DIFEAPGLEA LASRVEERML AAIPAEELER MFSELGN
|
| |