Gene Sare_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3052 
Symbol 
ID5707099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3459787 
End bp3461361 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content68% 
IMG OID641272494 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001537862 
Protein GI159038609 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0113678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGATTG ACAGCTCACC CGGCCCGACG AAGGCCGGCG GACACCATCT ATACGAATGG 
TTCCATGCCT CGGCCATCCG GTACGCCGAC GAGCCCGCCC TGGAGGTGGG TGCCGAGCGG
CTTACGTACC GCAGCCTCGC GCGGCGCGCC GGCGAGCTGA CCGACGCGCT GCGGCAGGCT
GGCGTCACTG GCCGGCCCAC CCGGGTAGGG TTGCTCGCCG GGCGCAGCGT GGCCGCCTAC
GCGGGATATC TGGCGGTGCA ACGCCTCGGC GCCACGGTGG TGCCGTTGAA CCCGGCCTTC
CCGCCCGCCC GAAACGCAGC GATCGCCGCA GCGGCCGGGC TCGAGGTGAT CCTTTCCGAG
CAGGGCACCC TCCCGGGCTG CACGATGCCG ACGGTGGCGG CGGCCGTCGA CCCGCAGGCA
CCAGAGTTCG GCAAGGTGCC GGAGCTTCCG CAACTCGACA GGTCGCCGGA CGACCTGGCG
TACATCCTGT TCACGTCGGG CTCCACCGGC CGGCCGAAGG GCGTCCCCAT CACCCACCGA
AATCTCTCGT CATATCTGAC GGAGGTGATC CCCCGCTACC ACACGGGACC GGGCTGTCGG
CTCTCGCAGG CATTCGACCT GACCTTCGAC GTGTCGGTGT TCGATATGTT CGTCGCCTGG
GGATCGGGCG CCACCCTGGT CGTTCCCACT GCCGATGAGC TACTCGCCCC GGTCGGTTTT
ATATCTTCCC GGGCCATCAC GCACTGGTGC TCGGTGCCGT CGATAATCTC CTTCACCCGG
CGCATGCGGG CCCTACGCGA GCGCTCGATG CCGACGCTGC GCTACAGCGT CTTCGCCGGG
GAGCCGTTGA CCCTACAACA GGCCGAGGCG TGGCAGCGGG CCGCGCCGCA GAGCCGAATC
GACAACCTGT ACGGCCCTAC CGAGGCCACG GTAACCTGCG CTGGCTATCG GCTGCCTGCC
AACCCGGCAC AGTGGCCCCG GTCCGCCAAC GACACAGTGC CGATCGGCAT TGTTCACTCC
GGAGTAGAAC AACTAGTCCT CGATGAAGAC GGCAGGCCGG CCGGAAAGGG TGAGCTCTGC
TTGCGGGGGC CGCAGCGTTT TCCTGGCTAC CTTGACCCGA CCGACGACGC GGGACGCTTC
GTCCGGGTCC ACGACGGCGT CGTGACCGAA CTAGAACCGG AAGACCAGGT TGACCAGGGG
TGCTGGTACC GCACCGGTGA CCGAGTCGGC GTTTACGACG GAACGTTAGT CCACCTCGGC
CGACTCGACC AGCAGGTCAA AGTCCACGGC TACCGGGTCG AACTCGGCGA GGTGGAGGCC
ACGCTACGTA CACATCCTGG AGTAGGCGAC GCGGTGGTAC TCGCCCACCC CGACGATCGT
GGGGACACCG ACCTCTACGC GGTCTGCACT GGCACGGCCA CGCCTGACGA GCTCATCGCC
GGACTGCGCA CCCGGCTACC CGCGTACATG ATGCCTCGCG AGGTCACGGT GGTGGACTGC
CTGCCGCTAA ATGCTAACGG CAAGACCGAC CGCCGTGCCC TCTCCGAGCA GCTGGCCCGG
GCGATGGTCC GATGA
 
Protein sequence
MLIDSSPGPT KAGGHHLYEW FHASAIRYAD EPALEVGAER LTYRSLARRA GELTDALRQA 
GVTGRPTRVG LLAGRSVAAY AGYLAVQRLG ATVVPLNPAF PPARNAAIAA AAGLEVILSE
QGTLPGCTMP TVAAAVDPQA PEFGKVPELP QLDRSPDDLA YILFTSGSTG RPKGVPITHR
NLSSYLTEVI PRYHTGPGCR LSQAFDLTFD VSVFDMFVAW GSGATLVVPT ADELLAPVGF
ISSRAITHWC SVPSIISFTR RMRALRERSM PTLRYSVFAG EPLTLQQAEA WQRAAPQSRI
DNLYGPTEAT VTCAGYRLPA NPAQWPRSAN DTVPIGIVHS GVEQLVLDED GRPAGKGELC
LRGPQRFPGY LDPTDDAGRF VRVHDGVVTE LEPEDQVDQG CWYRTGDRVG VYDGTLVHLG
RLDQQVKVHG YRVELGEVEA TLRTHPGVGD AVVLAHPDDR GDTDLYAVCT GTATPDELIA
GLRTRLPAYM MPREVTVVDC LPLNANGKTD RRALSEQLAR AMVR