Gene Sare_0353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0353 
Symbol 
ID5708025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp395910 
End bp399074 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content70% 
IMG OID641269879 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001535274 
Protein GI159036021 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0224507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.487426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGG CCAGCGGAGG TGTACCGGTG GACAGCGTTG GAGAGGTGTC GTTCGCGCAG 
GAACGGCTGT GGTTCCTCGA TCAGCTGCGC CCCGGCACGC CGGACTACCT GCTGCCGCTC
GCGCTGCGTA TCCGCGGGCC GCTTGACGTC ACCGCCCTCA CCGAGGCTTT CCAAGCGATC
GTGGATCGTC ACGAGGTGTT GCGCACCCGC TACGTCGAGG TCGATGGCCG GCCGGTGGCC
CACGTCGACG CGCACACCAA GGTCACCATC GCGTACACCA CTGATCACCA CGTCCTGGAG
CGCGAACTCG CCCGCGCCAT CGACATCGCC GAGGACCTGC CGTTCCGGCT GTCGCTGGCC
CGACTCGGCG ACGACCATCT CCTGGTGTTC GTCGTGCACC ACATCGCCTT CGACGGCTGG
TCCTGGGGTG TGCTCGCCCG CGAGTTGGCC GCCGGGTACG CCGGCCGGAC GGCCGAGGTG
AGCAAGCAGG CCCCGCAGTA CGACGGGTTC GCCCGCTGGC AACGCGAGCG GTTCACCGAC
GAGCGCAGCC GCCGTCAGCT CGACTACTGG CGAGCCCAAC TCGTTGGTGC CCCCGCCATC
GACCTACCCA CCGACCGGCC GCGACCGCGG ACCTGGGACG GCACGGGCGA CGTCGTCCGC
GTCGACCTGT CCGCGACACT GCTGCGGGAG GTCGACGCGT TGGCTCGTAG CCGTCGCGTC
ACCCGTTTCA TGGTCCTGCT CGCCGCCTTC CAGATCGTTC TCGCCCGTGC CAGCGGTCAA
ACCGACTTCG CCGTCGGCAC GCCGGTCGCC GGCCGGACGC GGGTCGCGGA CGAGGACCTG
ATCGGCCTGT TCGTCAACTC GATCGTGCTG CGCGCGGACC TGTCCGGCTC ACCCACGTTC
GAGGAGCTAC TCATCCGCGT ACGGGACACC GCGCTCGGCG CGTTCTCCCA TGCCGAGACC
CCGTTCGAGC GGATCGTCAC GGAACTCGCT CCCGAACGCG ACCTGTCCCG CAACCCGCTC
TTCCAGGTGT CGTTCAGCCT GCTCGACGTG CGGGCTCCGA TGTCGCTTCC CGGACTGGAC
GTCGAACTGG TGGAGCCACC ATTGACGGGC TCCCCGCTCG ACCTCTTCCT CGACATCAAC
GTGCGGACGG ACGGCACCGC GGTGGCGCGG CTGCAGTACG CCACCGCGTT GTTCGACCAT
GCCCGGGTGG AGCGGCTCGC CCGGGGGTTC GTCGACCTGC TCCGCACCAT CGTCGCCGAG
CCCGAGATCA GCGTCAGCGG CCTGGCGACG CGACTGGAAC TGGGACCGGA CGGGGAGCGG
GACCGCCTGC TGCACGCCTG GAACGGCACC GCCGAGGATC TGCCCGACGG CACGGTCGAC
GGGCTGATCG CCGTTCAGGC GCAATCCACG CCGGACGCCG TGGCGGTGCG GACGACCGCC
GAGGACATCA CCTATGCCGA GCTGGACACG AGGGTCAACC GGCTCGCTCA CCATCTGCGC
GCTCTCGGCG TCCGATCAGG CTCGCTGGTC GCCGTGCTGC TCGACCGTGG GCCAGATCTG
CTCACCGCTC TCCTCGCCGT GCTCCGGGCC GGCGGCGCCT ACGTGCCCAT CGACCCCGAA
TATCCGGACG CCCGGGTCGC CTTCATCGTG GTCGACTCCG CCGCGGAGGT GGTGATCACT
CGGTCCACGC TTGCCGACCG AGTTGGTGAC ACCGACGGAA AGCTCGTCTT GCTGGACCGG
GACCGGGCCG CCGTGGCAGC CCGGCGGGCA GACGCCGTCG GTCCCACGGC GACCGCTGAC
GACCTCGCAT ACCTGATCTA TACGTCCGGC TCGACCGGAA CGCCCAAGGG TGTGATGGTC
CACCACCGGG CGCTGACCAA CTTCGTCACC TCGATCGTGC GGCGGCCCGG GCTCACCGCC
AGCCAGTCGG TCGTCGCGCT CACCACGATC TCGTTCGATC CGTCGTTGCT GGAGCTCTAT
GTGCCGTTGC TCGTCGGCGC GACGGTCGTC CTCGCCGACA CCGAGCAGGC CCGCGACCCC
CAGCGGCTGA CCGACCTGGT CGCACTCACT CGTCCCGCGG TTCTGCAGGC GACCCCGGCG
ATGCTGCGGG CGCTGCTCGA CACCGGCTGG GTTCCGCCGG CCAGGCTCAC CGTGTTGTCC
GGTGGCGAGA AGCTGCCGTC CGAGCTGGCC CGGCGGCTCG CCACGGATGG GGCTCAGGTG
TGGGACCTGT ATGGCCCGAC GGAGACGACC GTGTGGGTGA CCTCGGCTCG ACTCGACCCG
GCCGGCCGGG TCGTGGACTG GTCGCCGCAG GCCAACTGCA CGGTCCACCT GCTCGACCGG
CATGCCGAGC CGGTGCCCAT CGGTTCGGTC GGAGAACTGT ACGTGGGCGG CACCTGCGTC
GCGCTTGGCT ACCGGGGTCA GCCCGCGCTG ACCGCCGAGA GGTATGTGCC CGACCCCTAC
TCCACGACAC CCGGAGGACG CCTCTACCGA ACCGGCGACC TGGCCCGCCG CCACCAGGAC
GGATCGGTCG AGATCCTTGG CCGTGCCGAT CGGCAGGTGA AGATACGCGG CCATCGCATG
GAACCCAGCG AGATCGAGGC GGCGTTGCTC GGTCACGACG AGATTCGCGC GGTCGCCGTG
CACCCGACCT CGACTCCCGC CGGCGAGCAG CAGCTGACCG CCTACATCGT CCCGCGAGGG
AACACCCCGC CGCCGGTCGA GGGACTGCGG ACGTTCCTGC GGCGGACTCT GCCCGACTAC
ATGGTCCCGG CGGCGTACGT GCCGATGGAG GCACTTCCAC TGACGCCCAA CGGCAAGGTC
GACTACAACG CGTTGCCGGA ACCCACGATC CGGGTAGCCG TGGAGCGGGT GTCCCCGCGT
ACCACCGAGG AACGCGTGGT CGCCGGCATC TGGCAGGAGG TTCTCGGCAG CAGCACCCAG
ATCGGCGTGA ACGAGAACTT CTTCGACATC GGCGGACACT CGCTACTTGC CACCCGGGTT
GCCGTGCGCC TCCGCGCCCA ACTCGGTATC GACGTTCCCG TCCGCGGTCT GTTCGACCAC
AGCACGGTGG CCAGCCTCGC CGCCGCGCTC ACTGACTATC CGCAAGTCTC CCAGCGCGCC
GCGATGCCCA CGCTGACCGC CCGGCGCCGC CGTGTCACGC GTTGA
 
Protein sequence
MNTASGGVPV DSVGEVSFAQ ERLWFLDQLR PGTPDYLLPL ALRIRGPLDV TALTEAFQAI 
VDRHEVLRTR YVEVDGRPVA HVDAHTKVTI AYTTDHHVLE RELARAIDIA EDLPFRLSLA
RLGDDHLLVF VVHHIAFDGW SWGVLARELA AGYAGRTAEV SKQAPQYDGF ARWQRERFTD
ERSRRQLDYW RAQLVGAPAI DLPTDRPRPR TWDGTGDVVR VDLSATLLRE VDALARSRRV
TRFMVLLAAF QIVLARASGQ TDFAVGTPVA GRTRVADEDL IGLFVNSIVL RADLSGSPTF
EELLIRVRDT ALGAFSHAET PFERIVTELA PERDLSRNPL FQVSFSLLDV RAPMSLPGLD
VELVEPPLTG SPLDLFLDIN VRTDGTAVAR LQYATALFDH ARVERLARGF VDLLRTIVAE
PEISVSGLAT RLELGPDGER DRLLHAWNGT AEDLPDGTVD GLIAVQAQST PDAVAVRTTA
EDITYAELDT RVNRLAHHLR ALGVRSGSLV AVLLDRGPDL LTALLAVLRA GGAYVPIDPE
YPDARVAFIV VDSAAEVVIT RSTLADRVGD TDGKLVLLDR DRAAVAARRA DAVGPTATAD
DLAYLIYTSG STGTPKGVMV HHRALTNFVT SIVRRPGLTA SQSVVALTTI SFDPSLLELY
VPLLVGATVV LADTEQARDP QRLTDLVALT RPAVLQATPA MLRALLDTGW VPPARLTVLS
GGEKLPSELA RRLATDGAQV WDLYGPTETT VWVTSARLDP AGRVVDWSPQ ANCTVHLLDR
HAEPVPIGSV GELYVGGTCV ALGYRGQPAL TAERYVPDPY STTPGGRLYR TGDLARRHQD
GSVEILGRAD RQVKIRGHRM EPSEIEAALL GHDEIRAVAV HPTSTPAGEQ QLTAYIVPRG
NTPPPVEGLR TFLRRTLPDY MVPAAYVPME ALPLTPNGKV DYNALPEPTI RVAVERVSPR
TTEERVVAGI WQEVLGSSTQ IGVNENFFDI GGHSLLATRV AVRLRAQLGI DVPVRGLFDH
STVASLAAAL TDYPQVSQRA AMPTLTARRR RVTR