Gene Sare_4891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4891 
Symbol 
ID5707543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5550435 
End bp5553749 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content72% 
IMG OID641274286 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001539631 
Protein GI159040378 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0706407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0725779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CCGACTCCAC CGAACCCGCC AGCGGCGGCA AACGCGCCCT GCTGGCCAGG 
CTTCTCACCG AACGGGCCCA GGCGCCCCGC AGCTACCCGG TGTCCTTCGG CCAACAGCGG
CTGTGGTTCC TGGACCGATT CACCGGGGGC ATCCCGGTCT ACAACATCCC GGTCGCCTTC
CAGGTGCACG GGCCACTGGA CGTCGCGGCG CTGCGTACCG CACTGACCAC GCTCGTCAAC
CGGCACGCGG CGCTGCGCAC CACGTTCAGC GAATCCGGCG GCGAACCAGT GCAGGTGGTC
CGACCGACCG GCGAGGTGGA CCTCACCGAG ATCGACCTGA CCGGGCTGCC GGTGGATCGG
CGTGAGGTCG AGGCCACCCA GCTGGTCCGA GAGCACGCCG GGCAACCGTT CGACCTGGCC
GAAGGCCCGC TGCTGCGGGT GGCGGTGATC CGGACCGACG CGGACCGGTT CCACGTGGCG
TTCTGCGTAC ACCACATCGT CTTCGACGCC TGGTCGGTCG GGGTGCTCTT CCGGGAGTTG
GAGACCGCGT ACGCGGCCGC GTTGGCGGGT ACCCCCGCCA ACCTGCCCGA CCTGCCCACG
CAGTACGCCG ATTTCACCGT CTGGCAGCGG GACCGGCTCG CCGGCGACGC GCTGCGCCGA
CAGCTCGATC ACTGGGTGGC GCACCTGCGG GGGGCGCCGG CCCTGCTCAG CCTTCCCACC
GACCGGCCCC GACCGGCCAA CCGCTCGTAT CGGGGCGCCT GCCACTACGT CACGGTCCCG
GCGGCGGCCG TCCGACGGGT CGAGGAGTTC AACCAGGACG CCGGGGTCAC CATGTTCATG
ACCCTGCTCG CCGTCTTCCA GGCTGTGCTG TCCCGACACA GTGGGCAGGA CGACATCGTG
ATCGGCGCAC CGGTCGCCGG CCGGGAGCAT CCCGACCTGG AGCGGCTGGT TGGCTTCTTC
GTCAACACGC TCGCCCTGCG GGTGTCGTCC GCCGGCTCCC CGTCGCTACG GCAACTCGTC
GACCGGGTGC GGGAGGTCAC CCTCGCCGGT CTCGGCAATG CCGAGGTGCC GTTCGAGAAG
GTGGTCGAGG AACTGCAGCC GGCACGTAGC CTGGCCCACG CACCGATCTT CCAGGCCCAG
CTGATCCTGC AGAACGCACC GCACAACGCC TTTCGTCTCA GCGGCTGCAC GACCACCTCG
CTGCGGGTCG ACAGCGGCAC GGCCAAGTTC GACCTCACTC TCGCCGGTGA GATGACCGCC
GAGGGCGCCC TGCGGTTGGC CTTCGAGTAC GACACCGAAC TGTTCGATGC CGCCACGGTC
GACCGGCTGG CCCGGCACCT GTGCACCCTG CTGGACGCGG CGGTCACCGA GCCGGATCGT
CCCCTGACGC GGTTCCCGCT GCTCAGCGGA GTGGATCGGT GGCGTGCCGT GGTCGAGTGG
AATCAGACCG ACCGGGGTAC GTTGCCGGTC GACACCATCC TGGACCTGCT TCCCACCGAC
CCCGCCGAGC CCGGCGCCCC GCCTGCCGTC ACCGGTCCGG ACGGGCACCT GGACCGGGCC
GGGCTGCACC GGCGGGCCGG GCAGATCGCC CGGCAACTGC TCGCTGCCGG CGTCGCCCCG
GACACCCCGG TGGGGATCTG CCTGGACCGC GGGGTCGACA TGGTCGCCGC GGTGCTCGGC
GTGTGGCGGG CCGGGGCCGG CTACCTACCG CTCGACCCCA CCCTGCCCCC CGAGCGGCTG
CGCCACCTGC TCGCCGACTC CGGCACCCGG GTCGTGCTGA CCCACCAGGC GGTCGTCGCG
CGGCTCGGCC CGGCGCTGGA GGGCTCGGTG ACGATGCTGC TCGACGATGC CACCGATGTC
CCCGGCCCGG ACGAGCCACT CCCGGCGGTC CCGGCGCATC CGGACGGGCT GGCGTACCTG
ATCTACACCT CGGGTTCGAC CGGCCAACCG AAAGGGGTCG CGGTCCCACA CCGCAGCGTG
ACCAACCTCG TTGCCTCCTT CCACGACGAC CTGGACCTGA CGCCCGAGGA CCGGTTCGCC
GCGGTCACCA CCCTGTCGTT CGACATCTCG GTGCTGGAAC TGCTGGTGCC GCTGCTGCTG
GACGTCCCGC TGCTGGTCGT GGGCGCCGAC GAGGTCGGCG ACGGTCCGGC CCTGCGTCGC
CGGCTCACCG AAGCGGGGAT CACCGCCATG CAGGCCACCC CGGCGACGTG GCGGCTGCTG
CTGGCGTCCG GTGGTGTACC GCCGACGCTG CGGCTACGCC TCTGCGGCGG CGAGGCCCTA
CCCCGGGACC TCGCCGACGC GCTACAGGCC GACGGTGCGG CCCTGTGGAA CTGCTACGGG
CCCACCGAAA CCACCGTCTG GTCCGCGGCG ACCCCCGTGG CGCCTGCCCC GGCCGCGGTG
GACCTCGGTA AACCGATCGC CAACACCCGG ATCTACCTCC TCGACGAGGT CTACCAGCCG
GTGCCGGTGG GCGTGGTGGG AGAAATCCAC ATCGGCGGGT CCGGTGTGGT GCGCGGATAC
CACAGTCGAC CTGGCCTGAC CGCCGGTCGG TTCGTCCCCG ACCCGTTCGC CGACCAGCCC
GGCGCCCGGC TCTACGCCAC CGGTGACCTG GCCCGGCAGC GCGCTGACGG TCGGCTGGAG
TTTCTTGGCC GCACCGACCA TCAGGTCAAG GTGCGTGGGT TCCGGATCGA GTTGGGTGAG
ATCGAAGCCC TGCTCCGTGG CCACGATCTG GTCGCGGATG CGGTGGTCGG CACCTGGGCC
GGCGGGGACG GCGACACCCG CCTGGTGGCG TACGCCGTGC CGGCGCACGG TGTCGACCCG
GACTCCCTCG CCGACCAGGT CCGTGCCGAC CTGGCCGGCC GGCTGCCCGA GTACATGCTT
CCCGCTGCCC TGGTGCCGTT GACCGCGCTG CCCCTCAACG ACAACGGCAA GGTCGACCGG
AACGCCCTGC CCACCCCCCA GTGGACCGAC CCGCGGGCGG AGCGGGTCGC CCCCCGCGAC
CCCCTCGAGC AGCTACTCGC CGGGATCTGG CAGGAGGTAC TGCACGTCGA GGGGATCGGT
GTGCACGACG ACTTCTTCCG CCTCGGTGGG CATTCACTCC TCGGTGCGCA GGCGTTGAGC
CGGATCGGTG CCGCGCTGGA GACGGAGGTG CCGATCCGGA TCCTCTTCGA GGCTCCGACG
ATCGAGGCGA TGGCCCGCGC GCTGCGCTCC ACGGAGGAGG TAGCCGGCCA GACCGACGCC
ATCGCCGCCC TCCGGGTGGA GGTGGCCGAC CTCTCCGACG AGGAACTGCG GGCCCTGCTG
GGCGGCCAGG AGTGA
 
Protein sequence
MTTADSTEPA SGGKRALLAR LLTERAQAPR SYPVSFGQQR LWFLDRFTGG IPVYNIPVAF 
QVHGPLDVAA LRTALTTLVN RHAALRTTFS ESGGEPVQVV RPTGEVDLTE IDLTGLPVDR
REVEATQLVR EHAGQPFDLA EGPLLRVAVI RTDADRFHVA FCVHHIVFDA WSVGVLFREL
ETAYAAALAG TPANLPDLPT QYADFTVWQR DRLAGDALRR QLDHWVAHLR GAPALLSLPT
DRPRPANRSY RGACHYVTVP AAAVRRVEEF NQDAGVTMFM TLLAVFQAVL SRHSGQDDIV
IGAPVAGREH PDLERLVGFF VNTLALRVSS AGSPSLRQLV DRVREVTLAG LGNAEVPFEK
VVEELQPARS LAHAPIFQAQ LILQNAPHNA FRLSGCTTTS LRVDSGTAKF DLTLAGEMTA
EGALRLAFEY DTELFDAATV DRLARHLCTL LDAAVTEPDR PLTRFPLLSG VDRWRAVVEW
NQTDRGTLPV DTILDLLPTD PAEPGAPPAV TGPDGHLDRA GLHRRAGQIA RQLLAAGVAP
DTPVGICLDR GVDMVAAVLG VWRAGAGYLP LDPTLPPERL RHLLADSGTR VVLTHQAVVA
RLGPALEGSV TMLLDDATDV PGPDEPLPAV PAHPDGLAYL IYTSGSTGQP KGVAVPHRSV
TNLVASFHDD LDLTPEDRFA AVTTLSFDIS VLELLVPLLL DVPLLVVGAD EVGDGPALRR
RLTEAGITAM QATPATWRLL LASGGVPPTL RLRLCGGEAL PRDLADALQA DGAALWNCYG
PTETTVWSAA TPVAPAPAAV DLGKPIANTR IYLLDEVYQP VPVGVVGEIH IGGSGVVRGY
HSRPGLTAGR FVPDPFADQP GARLYATGDL ARQRADGRLE FLGRTDHQVK VRGFRIELGE
IEALLRGHDL VADAVVGTWA GGDGDTRLVA YAVPAHGVDP DSLADQVRAD LAGRLPEYML
PAALVPLTAL PLNDNGKVDR NALPTPQWTD PRAERVAPRD PLEQLLAGIW QEVLHVEGIG
VHDDFFRLGG HSLLGAQALS RIGAALETEV PIRILFEAPT IEAMARALRS TEEVAGQTDA
IAALRVEVAD LSDEELRALL GGQE