Gene Sare_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3152 
Symbol 
ID5706210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3595892 
End bp3600538 
Gene Length4647 bp 
Protein Length1548 aa 
Translation table11 
GC content73% 
IMG OID641272584 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001537951 
Protein GI159038698 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0379641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.36258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG AAGACAAGCT GGTCGACTAC CTGCGCTGGG TGACGGCCGA CCTCAAGGAG 
GCCCGGGAAC AGGTGCGGGC CGCCAAGCAG CGAGAGAACG AGCCGCTTGC CATCGTCGGC
ATGTCCTGCC GGCTGCCCGG CGGGGTCAGC AGCCCGGAGC AGCTCTGGCA ACTGGTCGAC
GCGGGAGTGG ATGCGATCTC CCGTTTCCCG GCCGACCGCG GGTGGGATAT CGGTGACGTG
TTCGATCCGG AACCAGGCCG GCCTGGCAAG TCGTACGTCC GCGAGGGTGG CTTCCTCGAC
GCACCAGGCG ACTTCGATGC GGCGTTCTTT GGCATGAGCC CCAAGGAGGC CGTCGCCACT
GACCCGCAGC AGCGGCTGCT GCTGGAGACG GCGTGGGAGG CTCTGGAGCG GGCGGGCCTC
GACCCGCAGG CCCTGCGGGG CAGCCAGACC GGGGTCTTCG TCGGCAACAA CGGCCAGGAC
CACGTCATCG GACTCTCCCG GGCCCCGGTT GAGCTGTCCG GCTATACCGT CAGCGGCGCC
ACGGCCAGCA TCCTGTCCGG GCGGGTGTCG TACACGTTCG GCTTCGAGGG GCCCTCCCTG
GCAGTCGACA CCGCCTGTTC GTCGTCGCTG GTCGCGCTGC ACGTCGCGGG GCAGGCGCTG
CGGGGTGGCG AGTGTTCGCT TGCCCTGGTT GGTGGGATCA CGGTGATGAC CACCCCGACG
TTGTACGTGG GCTTCAGCCA GCAGCGCGGG CTGTCGGCGG AGGCGCGGTG CAAGGCGTTC
TCGGACGACG CCGACGGCAC CTCGATGGCG GAGGGGGTCG GCTGGCTGGT GCTCGAGCGA
CTCTCCGACG CCCAGCGCCT CGGCCACCGG GTGCTCGCGG TCGTCCGGGG CAGCGCGATC
AACCAGGACG GCGCGTCCAA CGGCATGACC GCCCCCAGCG GACCCGCTCA GCAGCGGGTG
CTCAGCCGGG CGTTGGCCAG CGCCGGCCTC GCGGCGTCGC AGGTCGACCT GGTCGAGGCG
CACGGCACCG GCACCGCGCT CGGCGACCCG ATCGAGGCGC AGGCGGTGCT GGCCGTCTAT
GGCAAGGACC GACCTGCGGG GCGGCCGCTG TGGATGGGAT CGCTGAAGTC GAACATCGGC
CACACCCAGG CGGTCGCCGG GATCGCGGGC ATCATCAAGG CGGTGCAGGC CATCCGGCAC
GGCGTGCTGC CGCGGACCCT GCACGTCAAG GAGCCGTCGA CCCAGGTCGA CTGGAGCGCC
GGCGACGTCG AATTGCTGAC CGAGCCGAAG GCGTGGCCGG AGACCGGGCA GCCACGCCGG
GCGGGAGTGT CCTCTTTCGG CGCGAGCGGC ACGAACGCGC ACGTCATCCT GGAGCAGGCA
CCCGATGTCG ACGCCGAAGC GGATTCTGGG ACCGATGCGC CATCGCCGGT GGTGCCTTGG
CTGCTGTCAG CCAGGTCGGA GCGGGCTCTG CGGGGTCAGG CGGATCGTCT GCTGGCGCAC
GTGACGGCTC ATCCGGAGCT GTCCCCGCGG GACGTGGCCT ACTCGCTGGT GCGCGGCCGG
GCAGCCTTCG AGCACCGCGG TGTGGTGTTG GGCGCGGACC GTGACGAGCT GCTGTCCGGG
CTGGCTGAGC TTGCTGTGGG CCGAGCGGCG CCGGGGGTGG TGACTGGGCG GGGCTCGGGA
GGCTTGGCTG TCCTGTTCAC CGGACAGGGC GCCCAGCGCA CCGGGATGGG CCGTGAGCTG
TACGACACGT TCCCGGTCTT CGCGGCCGCC TTCGACGCGG CGTGTGCGCA GCTGCGGCCG
GGACTGAAGG ACGCTGTCCT CAGCGGCGGC GCCGAGCTCT CCGAGACGGG GTGGGGGCAG
CCGGCGCTGT TCGCGTTCGA GGTGGCGCTG TTCCGCTTGG TCGAGTCCTG GGGGGTACGC
CCCGAGGTGC TTGGTGGCCA CTCCGTGGGC GAGATCGTCG CTGCTCATGT GGCGGGTGTG
TGGTCGCTGG AGGACGCTGC CCGGCTGGTG TCGGCCCGTG CGGAGTTGAT GCAGGCGCTG
CCGTCCGGTG GGTCGATGGT CGCCGTGGCG GCCAGCGAGG ACGAGGTGCG TACGGCCATC
GCCGGTTTCC CGGGCGTGGA CGTCGCTGCA GTGAACGGCC CCGCGTCGGT CGTGGTGGCC
GGCGCGACTG ACGCGGTCCG GGTAGTGGTG CAAGAGCTGG CCGCCCAAGG GCACCGGACC
AAGGCACTCC GGGTGAGTCA CGCGTTCCAC TCGTCGCTGA TGGAGCCGAT GCTGGCGGCG
TTCGGGGAGG TGCTGCGCTC CGTGACGTTC CACCATCCCC GAATTCCGCT GCTGTCACTG
GTCAGCGGTA CGCTCGGCGA CCCGGCAGTG TCCACTGCAG AATACTGGGT GCGGCATACT
CGGGAGGCGG TGCGGTTCGC CGACGGGATC CAGGCCGCCG TGCAGGCCGG GTGCACCACG
TTCCTGGAGC TGGGCCCGGA CGCGGCGCTG GCGGGGATGG GCCGCGATGG CCTGCCGGAC
GCGGCATGGG TGGCATCCGT ACGGAAGAAC CGTCCGGAGG TGCTCGCAAC GCTGGAGGCG
GCCGCCCGCG TCGCTGTCCA GGGAGCGACG GTCGACTGGG CTGCCCTGAC GCCGGGCGGA
CGCATGGTGG ACCTGCCCAC CTACGCCTTC CAGCACGAGC GCTTCTGGCT CGAGTACGAC
GTGCAGCCCC GGGTTCCCGA CGGTGTGGAC TGGCGCTACC GGGTCGGGTG GACGCCCGTG
CCCGACCCGG ACGCTCCGGC GGTACTCGCG GGCACCTGGC TGGTGGTCAT CCCGTCCACC
GTCACGCGGG AGGTGGACGT CTGGGCTCGT ATGGTGACCG GCGGTCTCAC CGCGGCCGGC
GCTCGTACCG AGGTCCTGCG GATTCCGGCC GGCTCCGGAC GCGCCGATCT GGCGACCCTG
CTGGACGGGC CCGGGGAGGC AGCCGGCGTA CTGAGCCTGT TGGCCTTGGC GGAGGGTCCC
ACCGACGCCG TGGTGCCGGC TGGTCTTGCC CTCACCATGG CCCTCGTGCA GGCACTCGGT
GACACCGGCC GGGAGGCGCC GCTGTGGTGT GTGACCCGGG AGGCGGTGCA CACCTCCGTC
GTCGAGGACC AGGCGATGAT CTGGGGATTC GGCCGGGTGG CCGCCCTCGA GCACCCCGGG
CGCTGGGGTG GTCTCGTCGA CCTGCCCGCA ACGCTCGATC CCGCCGCGGG CCGACGCCTG
GCCGCGGTCC TCAGGGGCGC GACGGGCGAG GACCAGGTTG CCGTCCGCAC CGACAGCCTG
TACGCGCGGC GCCTCACGGC GGTGACCGAG CCCGTCTCGG ACACCAGCGC CTGGGAGCCC
TCCGGAACGG TGCTGATCAC CGGGGGCACC GGTGCTCTCG GGGCACACGT CGCCCGCTGG
CTCGCCGGCC GCGGTGCCCG GGACCTGCTG CTGGCCAGCC GCTCAGGTCC GGCGGCGCCC
GGCGCGGCCG AGCTGGTGGC GGAGCTGGCG GAGCTTGGGG CCGCGGCCAC CATCGCCGCG
TGCGACGTCG CCGACGCGGA CGACCTCGCG GCTCTGCTGG CGACCGTCCC GGCCGACCGG
CCCCTGGGAG CCGTCTTCCA CGTCGCCGGC CGCATCGAGA CGACGGCTCT CGACGCGACG
ACCCCCGAGA TCCTGGCCGA GGTGCTGGCC GGCAAGGTGG CCGGCGCTCG TAACCTGGCC
GCCCTGGCCG GTGCTGTCGA CCGGTTCGTG CTGTTCTCGT CGGTCGCCGG GGTGTGGGGC
AGCGGCGGGC ACGCGGCGTA CGCCCCGGCC AACGCCTACC TGGACGCCCT GGCCGAGCGG
CGCCGGGCGG CCGGCCTGCC CGCTCTCGCG ATAGCCTGGG GCCCCTGGGC CGACGGCGGC
ATGAGCGCCG GGAAGGACAC CAGGCGGGAG TCCGCCCGGC ACGGTCTGCC GGTGATGCCG
ACCCGGGCCG CGCTCGGTGC CCTGGGCGCG GCCCTGGACG GCGGCGTGCC GAGCGTGACC
GTCGCCGACG TGGTGTGGGA CCGGTTCGTG CCGCTGTTCA CCAGCACCCG GCCGAGCCGC
CTGTTCGCCG ACGTGACCGT GGCGGCGCAG CCCGAACCGG ACGACCACGC CAGCCCCTGG
CTGGACCGGC TGCGGGGTCG CAGCGGGTCG GACCGCGACG CCGAACTGCT GGCCCTCGTG
CGGCACGAGG TGGCACTGAC CATCGGCCAC GTCGACGACC GGGCCGTGGA GATCGATCAA
GCTTTCCGGA ACCTCGGCTT CGACTCGATG GCGGCCGTCG AGCTGCGCGA CCGGATCGCC
ACCGGCACCG GCCTGACCCT GGCTAGCAGC CTCGTCTTCG ACCACCCGAC GGTGCAGTCG
CTGGCACGGC ACCTCGCTGG CGAGCTCGGC CCGGACGGCG GCGATCCGGT CGCCACCGTG
CTGGCCCAGC TGGACGAACT GGAGGCGGCA ATGGCCGGGC TGGCCGGTGG TGACGCCGTC
CGGGAGTCCA TTGAGCCGCG GTTGCAGGCC CTGCTCGCCG GCCTGTCCCG CCCGGTCGGG
CACTCGGCGG AGGCGGTGGC CGACCACCTC CGTACAGCGA GCGTCGAAGA CCTCTACGCC
TTCGTAGACC AAGAATTCGG CCGATGA
 
Protein sequence
MTNEDKLVDY LRWVTADLKE AREQVRAAKQ RENEPLAIVG MSCRLPGGVS SPEQLWQLVD 
AGVDAISRFP ADRGWDIGDV FDPEPGRPGK SYVREGGFLD APGDFDAAFF GMSPKEAVAT
DPQQRLLLET AWEALERAGL DPQALRGSQT GVFVGNNGQD HVIGLSRAPV ELSGYTVSGA
TASILSGRVS YTFGFEGPSL AVDTACSSSL VALHVAGQAL RGGECSLALV GGITVMTTPT
LYVGFSQQRG LSAEARCKAF SDDADGTSMA EGVGWLVLER LSDAQRLGHR VLAVVRGSAI
NQDGASNGMT APSGPAQQRV LSRALASAGL AASQVDLVEA HGTGTALGDP IEAQAVLAVY
GKDRPAGRPL WMGSLKSNIG HTQAVAGIAG IIKAVQAIRH GVLPRTLHVK EPSTQVDWSA
GDVELLTEPK AWPETGQPRR AGVSSFGASG TNAHVILEQA PDVDAEADSG TDAPSPVVPW
LLSARSERAL RGQADRLLAH VTAHPELSPR DVAYSLVRGR AAFEHRGVVL GADRDELLSG
LAELAVGRAA PGVVTGRGSG GLAVLFTGQG AQRTGMGREL YDTFPVFAAA FDAACAQLRP
GLKDAVLSGG AELSETGWGQ PALFAFEVAL FRLVESWGVR PEVLGGHSVG EIVAAHVAGV
WSLEDAARLV SARAELMQAL PSGGSMVAVA ASEDEVRTAI AGFPGVDVAA VNGPASVVVA
GATDAVRVVV QELAAQGHRT KALRVSHAFH SSLMEPMLAA FGEVLRSVTF HHPRIPLLSL
VSGTLGDPAV STAEYWVRHT REAVRFADGI QAAVQAGCTT FLELGPDAAL AGMGRDGLPD
AAWVASVRKN RPEVLATLEA AARVAVQGAT VDWAALTPGG RMVDLPTYAF QHERFWLEYD
VQPRVPDGVD WRYRVGWTPV PDPDAPAVLA GTWLVVIPST VTREVDVWAR MVTGGLTAAG
ARTEVLRIPA GSGRADLATL LDGPGEAAGV LSLLALAEGP TDAVVPAGLA LTMALVQALG
DTGREAPLWC VTREAVHTSV VEDQAMIWGF GRVAALEHPG RWGGLVDLPA TLDPAAGRRL
AAVLRGATGE DQVAVRTDSL YARRLTAVTE PVSDTSAWEP SGTVLITGGT GALGAHVARW
LAGRGARDLL LASRSGPAAP GAAELVAELA ELGAAATIAA CDVADADDLA ALLATVPADR
PLGAVFHVAG RIETTALDAT TPEILAEVLA GKVAGARNLA ALAGAVDRFV LFSSVAGVWG
SGGHAAYAPA NAYLDALAER RRAAGLPALA IAWGPWADGG MSAGKDTRRE SARHGLPVMP
TRAALGALGA ALDGGVPSVT VADVVWDRFV PLFTSTRPSR LFADVTVAAQ PEPDDHASPW
LDRLRGRSGS DRDAELLALV RHEVALTIGH VDDRAVEIDQ AFRNLGFDSM AAVELRDRIA
TGTGLTLASS LVFDHPTVQS LARHLAGELG PDGGDPVATV LAQLDELEAA MAGLAGGDAV
RESIEPRLQA LLAGLSRPVG HSAEAVADHL RTASVEDLYA FVDQEFGR