Gene Sare_4894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4894 
Symbol 
ID5707546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5555847 
End bp5559098 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content74% 
IMG OID641274289 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001539634 
Protein GI159040381 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.950239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0263577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG AGACCTACAT CCTGCCGTCG TCGTCCTCCC AGCGGCGACT GTGGATGATC 
GATCAACTGG CGCCGGGGGC GGTGACCTAC CACATCGCCT GGGCGGTCGA GCTGACCGGC
CCGCTCGACG TGGCCGCGCT GGAGTCCACG CTGAGCTGGC TGGTCGACCG GCACGAGACC
CTACGGACCC ACTTCACCTC GGTCGACGGA GAACCGGCCC AGGTGGTCGT ACCGGCCGCG
CCGGTCCGGC TGCCCATCGT GGACGCCAGC TCCGACGACG CCTGGCCGGC GCTGGTGGAC
GAGGCCGCCC GGGAGCCCTT CGACCTGGCG ACTGGCCCGT TGGCCCGGTT CGGACTCGTC
CGGCGCAGCC GCGGGAAGCA CGTGCTCACC ATCGTGGTGC ACCACAGTCT CGCCGACGGC
TGGTCGTTCG GGATCCTCTT CCGTGAGCTG GCCGCCGGGT ACGCGGCGGC GGTCGCCGGC
AGCCACCCCG ACCTGCCCGC ACCCGAGGTG CAGTACGCGG ACTTCGCGGT CTGGCAGCGG
GAGCAGGCCG GCCGCGGCGC CTTCGAGGCC GACGTCGACT TCTGGCACGC CGAGTTGGCC
GGCGCCCCGA CCCTGCTCGA CCTCCCCGCC GACCGGCCCC GCCCGGCCGA GCAGTCCGAC
GCCGGCGGCG AGGTGGTCTT CGACGTCCCC GACGAGCTGA CCGCGCGGCT GCGGGCGAGC
CGGGACGGCA CCCTGTTCAC CCGGCTCCTG GCCGGCTTCC AGACCCTGCT GCACCGGCTG
ACCGGGGCCG ACGACCTGCT GGTAGCGGTC CCGGTGGCCG GACGCACCCG GCCGGAGACC
CGGAACGTGG TGGGCTTCTT CGCCAACACC CTCGCCCTGC GGGCACGCTT CACCGGCCGG
CCCGGCTTCA CCGAGATCCT CGCCCAGGCC CGAGCCTCGA CCACCGCCGC ACAGACCCGG
CAGGACGTGC CCTTCGACCG GATCGTCGAC CGGCTCGCCC CGACCCGTAG CCTCGCCCAC
AACCCCCTGG TGCAGGTGAT GTTCGCCCTC GACGAGCCAC CGCCGGAGGC CACCTCGGCC
GGGCTGCGGA TCACCCCCCG GCTCTGGGAG AACGGCACGG TCAAGTTCGA CCTCACCCTC
ACCGTGGAGG ACCGCCCCGA CGGGCTGCGC GGGCGCCTCA CCTACCGCAC CGATCGCTAC
GAGGCGAACC GGATTCGCCG CTTCGCGCAG CGGTATCTCA CCCTGCTCAC CGCCGCCCTC
GACCGGCCCG GCACCCCCGT CGGCGAGCTG CCCCTGCTCG ATCCGGCCGA ACGCGAGCAG
ATCCTGCGGG ACGGCAACGA CACCGAGCTG CCCCTGCCCG ATGTGGCCAG CATCAGTGAC
CTGCTCGACC GGTTTCCGCC GGCCGAACCG GACGCGGTCG CGGTCACCGG CCCGGACGGC
ACCCTGCGCC ACCAGGACCT CGCCGCTCGC GTCAACCGCC TCGCCCACCT GCTCCGCGCC
CATGGCGTCG GCCCGGACGT GCCGGTCGGG CTCTGCCTTG GCCGGAGCAC CGACCTACCG
GCCGCGCTCC TCGCCGTCTG GCGCGCCGGC GGTGGGTACC TGCCGCTCGA CCCGACGTTG
CCGGCCGGCC GGCTGGCCAC CATGCTGGCC GACGCGGCCC CGCCGGTGCT CCTCACCGAC
TCCGCCGGGA CGACCGTCCT CGGCGATGCC GTCGCCGCGG CCGGCACCAC CCCGGTGGTG
CTCCGGGTCG ACCAGCTCGA CCCGGCCCTG CCGACCGACC CGCCGCCGGT CGCCGGCCAT
CCGGACGGGC TCGCCTACCT GCTCTACACC TCCGGCTCCA CCGGCACGCC CAAGGGCGTC
GTGGTCACCC ACCGCTCGGT GGTCAACCAC CTGGTCGGCT GTCACCGGCT GTTCGGGCTC
ACACCCGAGG ACCGGGTCGC GGCGATCACC ACCCCGGCCT TCGACATCTC CGTGGTCGAG
CTGGTGCTGC CGCTGCTGGC CGGGGCGCGC GTCGACGTCC TGGACGCGGC AACCGCCCGG
GACGCGACCT TGCTGCGGGC CGCCTGCGAG GCGCGGGGGG TCACCGTCGT CCAGGCCACC
CCGGCGAGTT GGCGGATGCT GGTCACCGCA GCCGGCGTAC CGGCCGGGGT GCGGTTGCGG
ATCAGCGGCG GCGAGGCGCT GACCCGCGAC CTGGCCGACG CGTTGCGCAC CGACGGGGCT
CGGGTCGTCA ACGGGTACGG ACCGTCGGAG ACGACCGTCT ACTCCTCGGC TGGAGTGGTG
GGGGAAAGCG GCCCGGTCGA CCTGGGGCGT CCCCTCGCCA ACACCCGGAT TCAGCTGCTC
GACCCCGCGG GCGAGCCAGT CCCGGACGGT GTGGTCGGAG AGATCCACAT CGGCGGCACC
GGAGTGGCGC GGGGCTACCA CGGTGACCCT GGCCGGACCG CGGCCCGATT CCGCCCCGAC
CCGTTCAGCC CGATCCCGGG CGGTCGGCTC TACGCCACCG GCGACCTGGC CCGACGGCTC
CCGGACGGGC GTCTCGACTA CCACGGCCGC GCCGATCAAC AGGTCAAGGT GCGTGGATTC
CGGATCGAGC TCGGCGAGAT CGAGTCGGTG CTGCGCGACC AGCCCGGCAT TCGGGACGCG
ATGGTGACCA CCTGGGGAAC GGGCGGCGAT GTGCGGCTCG CCGCGTACGC GGTCACCGAA
CCGGCCGCCG CCGACCCGGC ATCGGTCTGG CCGGCGCTCC GTACCGGCCT GGCCCGGCGG
CTGCCGGAGT ACATGGTGCC GGCCACCCTG GTCCTGCTCG ACGTGCTGCC CCGCACCGCG
AGCGGCAAGC TGGACCGGCG GGCGCTGCCC GAGCCGACCT GGCGCGAGAC CACCGGTAGC
GGCCCGACCG CCCCCCGCAC CCCGGCTGAG GAGCAACTCG CCACGCTCTG GCAGGACGTG
CTCGGCCGTA CCGACGTCGG CGTGCACGAC AACTTCTTCG CCCTCGGTGG ACACTCGCTC
ACCGCGACCC GGCTGATCGC CCGTATCCGG ACCACCTTCG GGGTCGACCT GACGCTGCGG
AGCCTCTTCG CCGCGCCCAC CGTCGCCGAG CTCGCCGTCG AGGTCGCCGC CACCGCGGAT
TCCCGCGGCG CGCCCCACCG GATCGGTCCC GCCGTCACCA CCCCAGAGGA CCTGCTCGCC
TCGCTCGACG ACCTCTCCGA CCGTGAGGTC GACGAGCTCC TGGACAGTCT GATCGCCGAG
GAGGGCGTAT GA
 
Protein sequence
MTTETYILPS SSSQRRLWMI DQLAPGAVTY HIAWAVELTG PLDVAALEST LSWLVDRHET 
LRTHFTSVDG EPAQVVVPAA PVRLPIVDAS SDDAWPALVD EAAREPFDLA TGPLARFGLV
RRSRGKHVLT IVVHHSLADG WSFGILFREL AAGYAAAVAG SHPDLPAPEV QYADFAVWQR
EQAGRGAFEA DVDFWHAELA GAPTLLDLPA DRPRPAEQSD AGGEVVFDVP DELTARLRAS
RDGTLFTRLL AGFQTLLHRL TGADDLLVAV PVAGRTRPET RNVVGFFANT LALRARFTGR
PGFTEILAQA RASTTAAQTR QDVPFDRIVD RLAPTRSLAH NPLVQVMFAL DEPPPEATSA
GLRITPRLWE NGTVKFDLTL TVEDRPDGLR GRLTYRTDRY EANRIRRFAQ RYLTLLTAAL
DRPGTPVGEL PLLDPAEREQ ILRDGNDTEL PLPDVASISD LLDRFPPAEP DAVAVTGPDG
TLRHQDLAAR VNRLAHLLRA HGVGPDVPVG LCLGRSTDLP AALLAVWRAG GGYLPLDPTL
PAGRLATMLA DAAPPVLLTD SAGTTVLGDA VAAAGTTPVV LRVDQLDPAL PTDPPPVAGH
PDGLAYLLYT SGSTGTPKGV VVTHRSVVNH LVGCHRLFGL TPEDRVAAIT TPAFDISVVE
LVLPLLAGAR VDVLDAATAR DATLLRAACE ARGVTVVQAT PASWRMLVTA AGVPAGVRLR
ISGGEALTRD LADALRTDGA RVVNGYGPSE TTVYSSAGVV GESGPVDLGR PLANTRIQLL
DPAGEPVPDG VVGEIHIGGT GVARGYHGDP GRTAARFRPD PFSPIPGGRL YATGDLARRL
PDGRLDYHGR ADQQVKVRGF RIELGEIESV LRDQPGIRDA MVTTWGTGGD VRLAAYAVTE
PAAADPASVW PALRTGLARR LPEYMVPATL VLLDVLPRTA SGKLDRRALP EPTWRETTGS
GPTAPRTPAE EQLATLWQDV LGRTDVGVHD NFFALGGHSL TATRLIARIR TTFGVDLTLR
SLFAAPTVAE LAVEVAATAD SRGAPHRIGP AVTTPEDLLA SLDDLSDREV DELLDSLIAE
EGV