Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4894 |
Symbol | |
ID | 5707546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5555847 |
End bp | 5559098 |
Gene Length | 3252 bp |
Protein Length | 1083 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641274289 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001539634 |
Protein GI | 159040381 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.950239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0263577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCG AGACCTACAT CCTGCCGTCG TCGTCCTCCC AGCGGCGACT GTGGATGATC GATCAACTGG CGCCGGGGGC GGTGACCTAC CACATCGCCT GGGCGGTCGA GCTGACCGGC CCGCTCGACG TGGCCGCGCT GGAGTCCACG CTGAGCTGGC TGGTCGACCG GCACGAGACC CTACGGACCC ACTTCACCTC GGTCGACGGA GAACCGGCCC AGGTGGTCGT ACCGGCCGCG CCGGTCCGGC TGCCCATCGT GGACGCCAGC TCCGACGACG CCTGGCCGGC GCTGGTGGAC GAGGCCGCCC GGGAGCCCTT CGACCTGGCG ACTGGCCCGT TGGCCCGGTT CGGACTCGTC CGGCGCAGCC GCGGGAAGCA CGTGCTCACC ATCGTGGTGC ACCACAGTCT CGCCGACGGC TGGTCGTTCG GGATCCTCTT CCGTGAGCTG GCCGCCGGGT ACGCGGCGGC GGTCGCCGGC AGCCACCCCG ACCTGCCCGC ACCCGAGGTG CAGTACGCGG ACTTCGCGGT CTGGCAGCGG GAGCAGGCCG GCCGCGGCGC CTTCGAGGCC GACGTCGACT TCTGGCACGC CGAGTTGGCC GGCGCCCCGA CCCTGCTCGA CCTCCCCGCC GACCGGCCCC GCCCGGCCGA GCAGTCCGAC GCCGGCGGCG AGGTGGTCTT CGACGTCCCC GACGAGCTGA CCGCGCGGCT GCGGGCGAGC CGGGACGGCA CCCTGTTCAC CCGGCTCCTG GCCGGCTTCC AGACCCTGCT GCACCGGCTG ACCGGGGCCG ACGACCTGCT GGTAGCGGTC CCGGTGGCCG GACGCACCCG GCCGGAGACC CGGAACGTGG TGGGCTTCTT CGCCAACACC CTCGCCCTGC GGGCACGCTT CACCGGCCGG CCCGGCTTCA CCGAGATCCT CGCCCAGGCC CGAGCCTCGA CCACCGCCGC ACAGACCCGG CAGGACGTGC CCTTCGACCG GATCGTCGAC CGGCTCGCCC CGACCCGTAG CCTCGCCCAC AACCCCCTGG TGCAGGTGAT GTTCGCCCTC GACGAGCCAC CGCCGGAGGC CACCTCGGCC GGGCTGCGGA TCACCCCCCG GCTCTGGGAG AACGGCACGG TCAAGTTCGA CCTCACCCTC ACCGTGGAGG ACCGCCCCGA CGGGCTGCGC GGGCGCCTCA CCTACCGCAC CGATCGCTAC GAGGCGAACC GGATTCGCCG CTTCGCGCAG CGGTATCTCA CCCTGCTCAC CGCCGCCCTC GACCGGCCCG GCACCCCCGT CGGCGAGCTG CCCCTGCTCG ATCCGGCCGA ACGCGAGCAG ATCCTGCGGG ACGGCAACGA CACCGAGCTG CCCCTGCCCG ATGTGGCCAG CATCAGTGAC CTGCTCGACC GGTTTCCGCC GGCCGAACCG GACGCGGTCG CGGTCACCGG CCCGGACGGC ACCCTGCGCC ACCAGGACCT CGCCGCTCGC GTCAACCGCC TCGCCCACCT GCTCCGCGCC CATGGCGTCG GCCCGGACGT GCCGGTCGGG CTCTGCCTTG GCCGGAGCAC CGACCTACCG GCCGCGCTCC TCGCCGTCTG GCGCGCCGGC GGTGGGTACC TGCCGCTCGA CCCGACGTTG CCGGCCGGCC GGCTGGCCAC CATGCTGGCC GACGCGGCCC CGCCGGTGCT CCTCACCGAC TCCGCCGGGA CGACCGTCCT CGGCGATGCC GTCGCCGCGG CCGGCACCAC CCCGGTGGTG CTCCGGGTCG ACCAGCTCGA CCCGGCCCTG CCGACCGACC CGCCGCCGGT CGCCGGCCAT CCGGACGGGC TCGCCTACCT GCTCTACACC TCCGGCTCCA CCGGCACGCC CAAGGGCGTC GTGGTCACCC ACCGCTCGGT GGTCAACCAC CTGGTCGGCT GTCACCGGCT GTTCGGGCTC ACACCCGAGG ACCGGGTCGC GGCGATCACC ACCCCGGCCT TCGACATCTC CGTGGTCGAG CTGGTGCTGC CGCTGCTGGC CGGGGCGCGC GTCGACGTCC TGGACGCGGC AACCGCCCGG GACGCGACCT TGCTGCGGGC CGCCTGCGAG GCGCGGGGGG TCACCGTCGT CCAGGCCACC CCGGCGAGTT GGCGGATGCT GGTCACCGCA GCCGGCGTAC CGGCCGGGGT GCGGTTGCGG ATCAGCGGCG GCGAGGCGCT GACCCGCGAC CTGGCCGACG CGTTGCGCAC CGACGGGGCT CGGGTCGTCA ACGGGTACGG ACCGTCGGAG ACGACCGTCT ACTCCTCGGC TGGAGTGGTG GGGGAAAGCG GCCCGGTCGA CCTGGGGCGT CCCCTCGCCA ACACCCGGAT TCAGCTGCTC GACCCCGCGG GCGAGCCAGT CCCGGACGGT GTGGTCGGAG AGATCCACAT CGGCGGCACC GGAGTGGCGC GGGGCTACCA CGGTGACCCT GGCCGGACCG CGGCCCGATT CCGCCCCGAC CCGTTCAGCC CGATCCCGGG CGGTCGGCTC TACGCCACCG GCGACCTGGC CCGACGGCTC CCGGACGGGC GTCTCGACTA CCACGGCCGC GCCGATCAAC AGGTCAAGGT GCGTGGATTC CGGATCGAGC TCGGCGAGAT CGAGTCGGTG CTGCGCGACC AGCCCGGCAT TCGGGACGCG ATGGTGACCA CCTGGGGAAC GGGCGGCGAT GTGCGGCTCG CCGCGTACGC GGTCACCGAA CCGGCCGCCG CCGACCCGGC ATCGGTCTGG CCGGCGCTCC GTACCGGCCT GGCCCGGCGG CTGCCGGAGT ACATGGTGCC GGCCACCCTG GTCCTGCTCG ACGTGCTGCC CCGCACCGCG AGCGGCAAGC TGGACCGGCG GGCGCTGCCC GAGCCGACCT GGCGCGAGAC CACCGGTAGC GGCCCGACCG CCCCCCGCAC CCCGGCTGAG GAGCAACTCG CCACGCTCTG GCAGGACGTG CTCGGCCGTA CCGACGTCGG CGTGCACGAC AACTTCTTCG CCCTCGGTGG ACACTCGCTC ACCGCGACCC GGCTGATCGC CCGTATCCGG ACCACCTTCG GGGTCGACCT GACGCTGCGG AGCCTCTTCG CCGCGCCCAC CGTCGCCGAG CTCGCCGTCG AGGTCGCCGC CACCGCGGAT TCCCGCGGCG CGCCCCACCG GATCGGTCCC GCCGTCACCA CCCCAGAGGA CCTGCTCGCC TCGCTCGACG ACCTCTCCGA CCGTGAGGTC GACGAGCTCC TGGACAGTCT GATCGCCGAG GAGGGCGTAT GA
|
Protein sequence | MTTETYILPS SSSQRRLWMI DQLAPGAVTY HIAWAVELTG PLDVAALEST LSWLVDRHET LRTHFTSVDG EPAQVVVPAA PVRLPIVDAS SDDAWPALVD EAAREPFDLA TGPLARFGLV RRSRGKHVLT IVVHHSLADG WSFGILFREL AAGYAAAVAG SHPDLPAPEV QYADFAVWQR EQAGRGAFEA DVDFWHAELA GAPTLLDLPA DRPRPAEQSD AGGEVVFDVP DELTARLRAS RDGTLFTRLL AGFQTLLHRL TGADDLLVAV PVAGRTRPET RNVVGFFANT LALRARFTGR PGFTEILAQA RASTTAAQTR QDVPFDRIVD RLAPTRSLAH NPLVQVMFAL DEPPPEATSA GLRITPRLWE NGTVKFDLTL TVEDRPDGLR GRLTYRTDRY EANRIRRFAQ RYLTLLTAAL DRPGTPVGEL PLLDPAEREQ ILRDGNDTEL PLPDVASISD LLDRFPPAEP DAVAVTGPDG TLRHQDLAAR VNRLAHLLRA HGVGPDVPVG LCLGRSTDLP AALLAVWRAG GGYLPLDPTL PAGRLATMLA DAAPPVLLTD SAGTTVLGDA VAAAGTTPVV LRVDQLDPAL PTDPPPVAGH PDGLAYLLYT SGSTGTPKGV VVTHRSVVNH LVGCHRLFGL TPEDRVAAIT TPAFDISVVE LVLPLLAGAR VDVLDAATAR DATLLRAACE ARGVTVVQAT PASWRMLVTA AGVPAGVRLR ISGGEALTRD LADALRTDGA RVVNGYGPSE TTVYSSAGVV GESGPVDLGR PLANTRIQLL DPAGEPVPDG VVGEIHIGGT GVARGYHGDP GRTAARFRPD PFSPIPGGRL YATGDLARRL PDGRLDYHGR ADQQVKVRGF RIELGEIESV LRDQPGIRDA MVTTWGTGGD VRLAAYAVTE PAAADPASVW PALRTGLARR LPEYMVPATL VLLDVLPRTA SGKLDRRALP EPTWRETTGS GPTAPRTPAE EQLATLWQDV LGRTDVGVHD NFFALGGHSL TATRLIARIR TTFGVDLTLR SLFAAPTVAE LAVEVAATAD SRGAPHRIGP AVTTPEDLLA SLDDLSDREV DELLDSLIAE EGV
|
| |