Gene Sare_4890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4890 
Symbol 
ID5707542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5545936 
End bp5550438 
Gene Length4503 bp 
Protein Length1500 aa 
Translation table11 
GC content74% 
IMG OID641274285 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001539630 
Protein GI159040377 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.365672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCA CCGGGCCCGA CCGGGAACGG CTGATCCGGG AACTGATGGC CCGCCGGGGA 
CTGACCGGCC GGGCGCAGCC GACCGGGATT CCCCACCGGG CTCCGGGTAG CCGGGTGCCG
CTCTCCCCGA TGCAGGAGGG CATGTGGTTC CTCGACCAAC TCCAGCCAGG AAACTCGGCG
TACGTGCTCA GCCAGGCGGT TCGGGTCACC GGCCGGTTGG ACCTCGTCGC GTTGCAGTCC
GCGCTCGACG AGGTCGTCGC CGCGCATGAC GCCCTCCGGA CCACGTTCGT GCAGCGGGAC
GGGGAGGTGG AGCAGGTCGT CCTGCCCGCC ACCGATCCGG CTGCCCGGTG CCAGCTCGCC
GTGGAGGACC TGACCACGGT GGCGCCGGCC GCCCGCGAGG CCGCGGTGGT CGCCGCCGTC
AGCCAGGAAA GTGATACCCC GTTCGACCTG GGCCGGGCGC CCCTGCTGCG GGTCCGGTTG
GGGCGGCTGG ACGATGACGA ACACCTGTTG GTGGTCGCCA TGCACCACAT CATCGCCGAC
GAGCGGTCCC TCGACATCGT GCTGACCGGG CTGCTCGACG GCCACCACCG CCGGCTCACC
GGAGCGGTCG CCTCGACCAC TCCGCCGGCC ACGCACTTCC CCGACCACGT GCTCTGGCAA
CGTGACTGGT TGGCCAGCGC TGCCGCCGAC AGAGCGCGGG AGTTCTGGTC CGACCGGCTG
GCGGGCTGCG CCGGCGTGCT CGACCTGCCG ACCGACCGGC CGCGACCGGC CGCGGTGACC
TTCGCCGGCC GCACCCACGG CTTCGCCTTC GCCGACGACC TGATCGCATC GCTCGACGGC
TTGGCGCAGC GGACCGGCTG CACCCGCTTC ATGATTCTGC TCGCCGGTCT CCAGGTGCTG
CTGGCCCGCC TCGCGGGCAC CGACGACGTC TGCGTCGCCG CTCCGGTGAC GTTACGGGCC
AACCCCGAGC AGCAGCAGAT GGTCGGCCTG CTCGTCAACA GCCTGCCGCT GCGCACCGAC
CTGTCCGGGG ACCCGACTCT GGCCGAGGCG CTGACCCGGG TCCGGGGTAC CTGCGTGGAG
TCCCTGGCGC ACGCCGAGTT GCCCTTCGAG CGGATCGTCG AGACCGTCCG GATGCCCCGC
GACCCGAGCC GCAACCCGCT CTTCCAGGTG ATGCTGGTCC TGAACCAGGC CGGTGTCGCC
GGCCCTCAGG CAGGTCTCGC CGTCCGCCCG GTGCCGGTGC TGCGGGACAC CGCCCGGCTG
GACCTCACCG TGGCGGTGTA CGAGTCCCCG GCCGGTCTCA GCGCGGTGGT CGACTACAAC
ACCGACCTCT TCGACCAGGT GACCATCGGC CGGTTCGTCG ACCGGTTCCA GTCCGTGCTG
CGGGCGTTGG CCACCCGGCC GCGACTGCGC CTCTCCGCGC TGGAGCTGGT CGGCCCCGCT
GAGCGACGCG ACCTGGCCTC CTGGAACGCC ACCGGTCGGG CGTACCCGGG TGGGCTGGGC
CTGCACGAGC TGGTCAGCGC ACGGGCGGCG GCCGACCCGG ACGCGCCCGC GTTGCTGGAC
GTCAGCCCGG ACGAGCCGGA AGGGCCGGCG CTGACCTACG GCCGACTGGA CGCGGACGCC
GACCGACTCG CCGCGCACCT GCGTCGCTGC GGAGTCCGCC CCGACCAGCC GGTGGGGCTC
GCGCTGGCCG CCGGCCGCGC CGCCGTCACC GGCGTCCTGG CGATCCTCAA GGCGGGTGCC
GGCTACCTGC CGGTGGACCC CACCCACCCG CCGGCCCGGC TGCGCGCCCT GCTCACCGCC
GCGGGTACCA CGGTCTGCCT GGCCGACGCC GGGCTCGCCG CCACCCTGGC CACCCCGCCG
GGCACGTCCG ACGACGACCA GCCGTACCCG GGCACCGTGC TCGCTGTCGG CCCCGACGGC
CAGCCGGTCG ACGCCGACCC GACCGGGGCT CCGGCACCGG CCGCGCCCCG GGCGGTCCAC
CCCGACCAGC TCGCCTACGT CATCCACACC TCCGGCTCCA CCGGCACGCC GAAGGGGGTG
ATGGTCAGCC ACCGCACCGC CACCAACCTG GCCTTGGCCT TCGCCGACCT GCACGGCATC
GGCCCCGGCG ACCGGCTGCT CATGCTGCCC CCACTGAGCT TCGACGCCTC CGTCGGTGAT
CTCTTCCCGG CCCTGGTCAG CGGAGCGGCG ATCGTGGTGC ACCGGCAGCC GGCCGCGATC
ACCGGAGCGG GCCTGGTCGA ACTGTGCCGG ACCCACGGCC TCACCCTGGT CGACACCGCC
GCGCCGCTCT GGGCCCGCTG GGTCGCCGAC CTCGCCGCGC AGCCCGGCGG GGTCGACGTC
ACGCCGCTGC GCGCCATGAT GGTCGGTGGG GAGCCGGTGG ACCTGGAGAC CGTTCGCCGC
TGGGCGGGGC TCACCGGCGG CCGGGTCACC CTGCACAACC ACTACGGCCC GACCGAGGCG
ACCGTCTGCG CCACCACCTA TGCCACCGTG GACGCCGCCG AACTGCCCGG CCTGACCCGG
TTGCCGATCG GCCGCCCGGT GCCGAACGTC GAGGTGCACG TCCTCGACCC AGACCTGCGG
CCGGTGCCGA TCGGTCTCCC CGGTGAGGTC TGTGTCGGTG GCACCGCCCC CGCCCGCGGG
TACCGCGATA ATCCCGCCGA AACCGCCGGC CGCTTCGTCC CGAACCCGTA CGGCCCCGCC
GGGTCGCGGC TGTACCGCAC CGGCGACCTC GCCCGGCACC GCGCCGACGG CAGTCTCGAA
TTCCTCGGCC GGACCGACCA GCAGGTCAAG ATCCGGGGCC ATCGGATCGA GATCGGCGAG
GTCGAGGCGG CCTGCGCCGC GCTGCCCGGG GTACGCCGTA CCGCCGTGGT CGTCGACCAC
GCTCCGGCCG GTCCCCGACT CGTCGCCTAC CTGGTCGGTG ACGACGTCAC CCCAACCGGG
CGGGAGGCGC GGATCGCATT GCGCCGCCGG TTGCCGGAGT ACCTCGTCCC CAGCGCCTTC
GTCCGGGTGC CCGACCTGCC CACCACCCGG CACGGCAAGC TGGACCTGGC TGCCCTGCCC
CGGCCGGAGG ACACCGACCA GCCGGCGTAC GAGCCACCGG CCACCCCGAC TGAGAAGACA
CTCGCCGACA TCTGGGCCGA ACTGCTCGAC ACCGGCCCGG TGGGTCGGCA CGACAACTTC
TTCGACCTCG GTGGACACTC ACTGCTCGGG GCGAGTCTGG CGACCCGGAT CCGGGCCGCG
CTCGGGGCGG AACTGCCGCT GCGGGCACTC TTCTTCACCG CCGACCTCGC CGGTCTCGCC
GCTGTCCTCG ACGGAGACGG GTCCACCCGG GACGGCGACG TCGACCTGCT CCGCGCCGAG
GCCCGGCTCC CCGACGACCT GGCCGTACCG GCCGGCACGC CGACTGTGCC GGCCCGTGCG
GTGCTCTGCA CCGGGGCGAC CGGATTCCTC GGGGCGTACC TGCTCGCCGA CTGGCTCGAC
CACAGCACCG CCACCGTGCA CTGTCTGGTC CGGGCGGCCA CCCCGACCGC CGCGCTCGAC
CGGGTCCGCG CCAACCTGCG CCGGTACGGG CTCTGGCGGC CGGAGTACGC CGCCCGCCTG
GTCGGCGTCC CCGGCGACCT CGGCGCCCCC CGGCTCGGGT TGAGCGGGGC GGACTTCGCC
GAGCTGGGCG AACGGCTGGA CGCGATCGTG CACAACGGCG GCGTGGTCAA CTTCGTCGCG
CCGTATCCGG CACTGCGTCC GGCCAACGTC GGCGGCACTC TGGAGGTGCT TCGGCTGGCC
AGCACCGGCC GCCCGAGCGC GGTGCACTTC GTCTCCACCC TCGGGGTATT CGTCACCCCG
TCGCACACCG GCACGCTGGT GCGCGAAGGT GACCTCCCGG CCGACTGTGA CGGGTTCCCC
GACGGCTACA ACGCCAGTAA GTGGGTGGCC GACGCGCTGG TCCGGGCCGC CCGGGAGCGG
GGGCTGCCGG TCAGTGTGCA CCGGCCCGCG CGGATCACCG GGGACGCCGG CAGCGGTCTC
GGCAACACCG ACGACTTCTT CAGCCGGCTG CTGAAGACCT GCGTCCAGCT GGGCGCCGTA
CCGGAGATCG ACGACCCCGC CGATCTCGCG CCGGTCGACT ACGTCGGGGC CGGCATCGGG
CACCTGACCC GGGCGGGCGC GACGACGGAC CACCACTACT ACAACAACCA AACCATGTCG
TACGCCGCTC TGGCCGACGC GCTGGCCAGC TTCGGCTACC CGGTGACGTC GATGCCGTAT
CCGCGCTGGC GGGCGGCGCT GCTGGAGCGT CCGGACGCCG CGCTGGCCCG GCTCGCCCCG
CTCTTCGATG CGGACACTCC GGTTCGGACG CAGCCCGACT TCGACTGCAC CGGCACCGAG
GCGACGCTCG CGGCGGCCGG GATCACCTGT CCGCCGGCCG ACGAGCGGCT GCTGCACGCC
TACCTGGCGG CGTTCGTGGC CGCTGGCTTT CTCGACCCAC CGCCCGGGGG TATCCATGGC
TGA
 
Protein sequence
MSATGPDRER LIRELMARRG LTGRAQPTGI PHRAPGSRVP LSPMQEGMWF LDQLQPGNSA 
YVLSQAVRVT GRLDLVALQS ALDEVVAAHD ALRTTFVQRD GEVEQVVLPA TDPAARCQLA
VEDLTTVAPA AREAAVVAAV SQESDTPFDL GRAPLLRVRL GRLDDDEHLL VVAMHHIIAD
ERSLDIVLTG LLDGHHRRLT GAVASTTPPA THFPDHVLWQ RDWLASAAAD RAREFWSDRL
AGCAGVLDLP TDRPRPAAVT FAGRTHGFAF ADDLIASLDG LAQRTGCTRF MILLAGLQVL
LARLAGTDDV CVAAPVTLRA NPEQQQMVGL LVNSLPLRTD LSGDPTLAEA LTRVRGTCVE
SLAHAELPFE RIVETVRMPR DPSRNPLFQV MLVLNQAGVA GPQAGLAVRP VPVLRDTARL
DLTVAVYESP AGLSAVVDYN TDLFDQVTIG RFVDRFQSVL RALATRPRLR LSALELVGPA
ERRDLASWNA TGRAYPGGLG LHELVSARAA ADPDAPALLD VSPDEPEGPA LTYGRLDADA
DRLAAHLRRC GVRPDQPVGL ALAAGRAAVT GVLAILKAGA GYLPVDPTHP PARLRALLTA
AGTTVCLADA GLAATLATPP GTSDDDQPYP GTVLAVGPDG QPVDADPTGA PAPAAPRAVH
PDQLAYVIHT SGSTGTPKGV MVSHRTATNL ALAFADLHGI GPGDRLLMLP PLSFDASVGD
LFPALVSGAA IVVHRQPAAI TGAGLVELCR THGLTLVDTA APLWARWVAD LAAQPGGVDV
TPLRAMMVGG EPVDLETVRR WAGLTGGRVT LHNHYGPTEA TVCATTYATV DAAELPGLTR
LPIGRPVPNV EVHVLDPDLR PVPIGLPGEV CVGGTAPARG YRDNPAETAG RFVPNPYGPA
GSRLYRTGDL ARHRADGSLE FLGRTDQQVK IRGHRIEIGE VEAACAALPG VRRTAVVVDH
APAGPRLVAY LVGDDVTPTG REARIALRRR LPEYLVPSAF VRVPDLPTTR HGKLDLAALP
RPEDTDQPAY EPPATPTEKT LADIWAELLD TGPVGRHDNF FDLGGHSLLG ASLATRIRAA
LGAELPLRAL FFTADLAGLA AVLDGDGSTR DGDVDLLRAE ARLPDDLAVP AGTPTVPARA
VLCTGATGFL GAYLLADWLD HSTATVHCLV RAATPTAALD RVRANLRRYG LWRPEYAARL
VGVPGDLGAP RLGLSGADFA ELGERLDAIV HNGGVVNFVA PYPALRPANV GGTLEVLRLA
STGRPSAVHF VSTLGVFVTP SHTGTLVREG DLPADCDGFP DGYNASKWVA DALVRAARER
GLPVSVHRPA RITGDAGSGL GNTDDFFSRL LKTCVQLGAV PEIDDPADLA PVDYVGAGIG
HLTRAGATTD HHYYNNQTMS YAALADALAS FGYPVTSMPY PRWRAALLER PDAALARLAP
LFDADTPVRT QPDFDCTGTE ATLAAAGITC PPADERLLHA YLAAFVAAGF LDPPPGGIHG