Gene Sare_2948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2948 
Symbol 
ID5707826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3340693 
End bp3343866 
Gene Length3174 bp 
Protein Length1057 aa 
Translation table11 
GC content70% 
IMG OID641272397 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001537765 
Protein GI159038512 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.735724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0141341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCACG GAGCGGAGAG CACCCTTTCG CTCGGCCAGG AGCGGTCGTG GTTCATCGAC 
CAGTACGCCG GCGCAGCGGT CAACACGCTG GCGCACCGGG TGCGGCTACG TGGTGAGCTG
CGCACCGACG CGCTCGGAGC TGCGTTGACC GACGTCATCG GGGCCAGCGA CGTGCTTCGC
TGGGCGATCA GGTCGCACGA GGGCCAGGCC AGTCGGGTAC CGGCGCAGCC CCTACCTGCG
GCCGTGCCGA CGACCGACCT GTCGACCCTG CCCGCGACGG AGCGCGAGGA CGCGGTGCTC
CGCGCCGTGG AGCGTGCCGC CGGCAACCCA TTCGATCTCG GCCAGGGTCC GCTGCTCCGA
GCCGAGCTAC TCCGCGTCGA CGTCGACGAC CATGTTCTCC TGCTCACCAC TCACCGGGCA
GTGGTGGATG ACGCGAGTCT GTCGCTGCTC GTCAACCGGT TGGGCCGGGC GTACCGGGCC
CGCCTGCACC GCGCCGAGCC ACTGCCGTCC GACGCCGGCG CCTTCGAGGT GTTCGTAGCC
CGGGAGCGCG CCGGGGCGCA GCAGGGGCGC AGCGTGGCCG GTGCGGCCTG GCGGGACTAC
CTCGCCGACG TCGTCACACT GGAACTGCCG ACCGACCGGG TACGCGGCCT GCAGCCCCAG
TACACGGCCG CTACCGTCGA GACGATCCTG TCGCCGCCGC TGGACGACGC GGTGCGGGCC
GTGGCCGATC GCCTCTCCGT GCCCGCCGGT GAGGTGCTGC GCGCGGCGTT CCAGGCCCTG
CTGCACCGCT ACTCCGGTCA GCCGGTCGTG GCGCTGGGCG TGCCTGCCGG GCGCCGCGAT
TCGGCCACCG CGGACCTGGT TGGTCCGCTG GCCACCTGGC TCGTCCTGCG CTCCGAGTTG
CACCCGGACA CGACGCTGAC CGGCCTGATC ATGCCAGCCG GCGGGGCGGA CCGGCCTCCG
GTCCTGCCCT TCGAGACCCT GATCGAGGAG GTCCAACCGC CCCGCGACCC GGCCCGCAGC
GCGATCGTCC AGGCAACGTT CACCGCTCGG GAGGCTGCCA CACCCGCGGA CTTCGGCGCG
GCTACCGTCG TCGACCAGCG GACCGTCCCG GCCACGGCCA CCCTGCACGA CGTGTCCGTG
CTCGCCCAGC CGGGCCCGGC AGGCACCCGG ATCGGGCTCA CCTATCGCGC CGACCTGTTC
GACCGGGCCA CCGCGGGGCG GCTGCTGGGT CACTACGTCA GCCTGCTCGA CGCGGCGGTT
GCCCGCCCCG ACGAGCCGGT CGCCCGACTG CCCTACCTCA GTGTCACGGA GCGGACGCGG
ATCCTCGACG AGTTCAACCG GACGGAGGCG CCGTTCCCCC GGGACGCCAC CGTGCACGAA
CTCTTCGAGG AGCAGGTGCT GCGAAACTCG GACGCGCGCG CGGTGACCAT CGAGGGACAG
CACCTCACGT ATCGAGAGTT GAACGAGCGA GCGAACAAGC TGGCCCACCG GCTGCGGTCA
TGCGGGGTCG GACGCGGCAC GTACGTTGCC CTCTGCCTGG AACGATCCCT TGAGCTCATG
GTGGCCGTCA TGGCGGTCCT CAAGTCCGGT GGGGCCTACA TACCGCTGGA CCCGGCCTAC
CCGACCGACC GGCTCGCCTT CATGCTGGCG GACACCCAGG CCCGCTTCCT GGTCACCCAG
CGCCGGCTGC GCGAGATGGC ACCGATCGAC GACGCCGCCA CGGTGATCGT GCTGGACGAC
CCGGCCGACG CGGCGGTCGT GGCCGACCAG TCCGCGGTGA ACCCGGTCAA CGTGAACGCC
GCCGAGGACC TGACGTACAT CGTCTACACC TCCGGCTCCA CCGGTCGACC CAAGGGAGTC
GAGACGGTCC ACTTCGGTGT CGTTCGCCTC GTCGTCAACA CCGACATTCT CGAGTTGGAC
GAGCGAACCA GCTACCTGCA GATCTCGCCG CTGTCCTTCG ACGCCTGCAC CCTCGAAATC
TTCGGCCCAC TGCTCAACGG TGGCCGGGTC GTCCTGCTCC CGCCGGGCGT GCCGACACCA
GCGCGGGTGG CCCACACCGT CCGGGAACAG GGTGTCGACA CCCTGTGGCT GGTGGCTCCC
CTGGCCAACC TCACCATCGA CACGCACCTC GACGACCTGC GGGGGCTGCG CCAGTTCATG
GCCGGTGGCG ACGTGCTCTC CATCCCGCAC ATCCGGCAGG TGCTGGACAA GCTGCCGCAC
ATAAAGTTGA TCAACGGATA CGGCCCCACC GAGGTTACCG CCTTCAGCGT CAGCCACAAG
ATCGACTACA TCGACCCGGA CTGGCCCTCG ATTCCGATCG GCCGGCCGAT GCACAACACC
ACGGCCTACA TTCTCGACCC CCTCGGCCAG CCGGTGCCGA TTGGTGTGTG GGGCGAAATG
TACCTGGGCG GCCCGGGCGT CGCGCTCGGT TACCACAACC GACCTGATCT CAATGCCGAG
CGGTTTCTGC CGGACAACTT CCGCCCCGGG CCCGGGGCAC AGCTGTACCG TACCGGCGAC
CGGTGCCGGT GGCTGCCCGA CGGCACCATC CAGTTCCACG GCCGACTCGA CACACAGGTG
AAGATCGACG GCCTGCGGGT CGAGTTGGGC GAGATTCAGA GCGTGGTGGC GGGGCACGGG
TCGGTGGCGG CGGCGGTGGT CACCGCGCCG GTGATCGGCA CCCGGCGCAC CCTCGTGGCG
TACGTGGTGC CGGCGGATCC CGACGGTTTC GACGCTTCCG TACTGCGGGC GCACCTCACC
GGCGTCCTGC CGAGCGTGAT GGTGCCCGCC CATTTCGTCA CCATGTCGAC GATTCCGCTG
ACTCCGAACA ACAAGGTCGA CTTCCAGGCA CTACCCGAGC CGCAGTTCGG CACCGTGCGG
GGGCACCGAC CACCGGAGAC GAGCACTCAG CAGGCACTGG CGGAGATCTG GCGTGAAATC
CTGGGAGTGC CCGCCGTCGG GCTGGACGAC AACTTCTTCG AACTCGGCGG TCATTCACTA
CGCGCGGTGC CGATGATCGC CGCCATCAGC ATGCGGTTCG GAATCGACCT TGCCGTTCAG
GACATCTTCG AGGCACCCGG GCTGGAAGCT CTGGCGAGCC GAGTGGAGGA GCGGATGCTT
GCGGCGATTC CTGCCGAGGA GTTGGAGAGA ATGTTCTCCG AGCTCGGCAA TTGA
 
Protein sequence
MRHGAESTLS LGQERSWFID QYAGAAVNTL AHRVRLRGEL RTDALGAALT DVIGASDVLR 
WAIRSHEGQA SRVPAQPLPA AVPTTDLSTL PATEREDAVL RAVERAAGNP FDLGQGPLLR
AELLRVDVDD HVLLLTTHRA VVDDASLSLL VNRLGRAYRA RLHRAEPLPS DAGAFEVFVA
RERAGAQQGR SVAGAAWRDY LADVVTLELP TDRVRGLQPQ YTAATVETIL SPPLDDAVRA
VADRLSVPAG EVLRAAFQAL LHRYSGQPVV ALGVPAGRRD SATADLVGPL ATWLVLRSEL
HPDTTLTGLI MPAGGADRPP VLPFETLIEE VQPPRDPARS AIVQATFTAR EAATPADFGA
ATVVDQRTVP ATATLHDVSV LAQPGPAGTR IGLTYRADLF DRATAGRLLG HYVSLLDAAV
ARPDEPVARL PYLSVTERTR ILDEFNRTEA PFPRDATVHE LFEEQVLRNS DARAVTIEGQ
HLTYRELNER ANKLAHRLRS CGVGRGTYVA LCLERSLELM VAVMAVLKSG GAYIPLDPAY
PTDRLAFMLA DTQARFLVTQ RRLREMAPID DAATVIVLDD PADAAVVADQ SAVNPVNVNA
AEDLTYIVYT SGSTGRPKGV ETVHFGVVRL VVNTDILELD ERTSYLQISP LSFDACTLEI
FGPLLNGGRV VLLPPGVPTP ARVAHTVREQ GVDTLWLVAP LANLTIDTHL DDLRGLRQFM
AGGDVLSIPH IRQVLDKLPH IKLINGYGPT EVTAFSVSHK IDYIDPDWPS IPIGRPMHNT
TAYILDPLGQ PVPIGVWGEM YLGGPGVALG YHNRPDLNAE RFLPDNFRPG PGAQLYRTGD
RCRWLPDGTI QFHGRLDTQV KIDGLRVELG EIQSVVAGHG SVAAAVVTAP VIGTRRTLVA
YVVPADPDGF DASVLRAHLT GVLPSVMVPA HFVTMSTIPL TPNNKVDFQA LPEPQFGTVR
GHRPPETSTQ QALAEIWREI LGVPAVGLDD NFFELGGHSL RAVPMIAAIS MRFGIDLAVQ
DIFEAPGLEA LASRVEERML AAIPAEELER MFSELGN