Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3152 |
Symbol | |
ID | 5706210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3595892 |
End bp | 3600538 |
Gene Length | 4647 bp |
Protein Length | 1548 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641272584 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001537951 |
Protein GI | 159038698 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0379641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.36258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACG AAGACAAGCT GGTCGACTAC CTGCGCTGGG TGACGGCCGA CCTCAAGGAG GCCCGGGAAC AGGTGCGGGC CGCCAAGCAG CGAGAGAACG AGCCGCTTGC CATCGTCGGC ATGTCCTGCC GGCTGCCCGG CGGGGTCAGC AGCCCGGAGC AGCTCTGGCA ACTGGTCGAC GCGGGAGTGG ATGCGATCTC CCGTTTCCCG GCCGACCGCG GGTGGGATAT CGGTGACGTG TTCGATCCGG AACCAGGCCG GCCTGGCAAG TCGTACGTCC GCGAGGGTGG CTTCCTCGAC GCACCAGGCG ACTTCGATGC GGCGTTCTTT GGCATGAGCC CCAAGGAGGC CGTCGCCACT GACCCGCAGC AGCGGCTGCT GCTGGAGACG GCGTGGGAGG CTCTGGAGCG GGCGGGCCTC GACCCGCAGG CCCTGCGGGG CAGCCAGACC GGGGTCTTCG TCGGCAACAA CGGCCAGGAC CACGTCATCG GACTCTCCCG GGCCCCGGTT GAGCTGTCCG GCTATACCGT CAGCGGCGCC ACGGCCAGCA TCCTGTCCGG GCGGGTGTCG TACACGTTCG GCTTCGAGGG GCCCTCCCTG GCAGTCGACA CCGCCTGTTC GTCGTCGCTG GTCGCGCTGC ACGTCGCGGG GCAGGCGCTG CGGGGTGGCG AGTGTTCGCT TGCCCTGGTT GGTGGGATCA CGGTGATGAC CACCCCGACG TTGTACGTGG GCTTCAGCCA GCAGCGCGGG CTGTCGGCGG AGGCGCGGTG CAAGGCGTTC TCGGACGACG CCGACGGCAC CTCGATGGCG GAGGGGGTCG GCTGGCTGGT GCTCGAGCGA CTCTCCGACG CCCAGCGCCT CGGCCACCGG GTGCTCGCGG TCGTCCGGGG CAGCGCGATC AACCAGGACG GCGCGTCCAA CGGCATGACC GCCCCCAGCG GACCCGCTCA GCAGCGGGTG CTCAGCCGGG CGTTGGCCAG CGCCGGCCTC GCGGCGTCGC AGGTCGACCT GGTCGAGGCG CACGGCACCG GCACCGCGCT CGGCGACCCG ATCGAGGCGC AGGCGGTGCT GGCCGTCTAT GGCAAGGACC GACCTGCGGG GCGGCCGCTG TGGATGGGAT CGCTGAAGTC GAACATCGGC CACACCCAGG CGGTCGCCGG GATCGCGGGC ATCATCAAGG CGGTGCAGGC CATCCGGCAC GGCGTGCTGC CGCGGACCCT GCACGTCAAG GAGCCGTCGA CCCAGGTCGA CTGGAGCGCC GGCGACGTCG AATTGCTGAC CGAGCCGAAG GCGTGGCCGG AGACCGGGCA GCCACGCCGG GCGGGAGTGT CCTCTTTCGG CGCGAGCGGC ACGAACGCGC ACGTCATCCT GGAGCAGGCA CCCGATGTCG ACGCCGAAGC GGATTCTGGG ACCGATGCGC CATCGCCGGT GGTGCCTTGG CTGCTGTCAG CCAGGTCGGA GCGGGCTCTG CGGGGTCAGG CGGATCGTCT GCTGGCGCAC GTGACGGCTC ATCCGGAGCT GTCCCCGCGG GACGTGGCCT ACTCGCTGGT GCGCGGCCGG GCAGCCTTCG AGCACCGCGG TGTGGTGTTG GGCGCGGACC GTGACGAGCT GCTGTCCGGG CTGGCTGAGC TTGCTGTGGG CCGAGCGGCG CCGGGGGTGG TGACTGGGCG GGGCTCGGGA GGCTTGGCTG TCCTGTTCAC CGGACAGGGC GCCCAGCGCA CCGGGATGGG CCGTGAGCTG TACGACACGT TCCCGGTCTT CGCGGCCGCC TTCGACGCGG CGTGTGCGCA GCTGCGGCCG GGACTGAAGG ACGCTGTCCT CAGCGGCGGC GCCGAGCTCT CCGAGACGGG GTGGGGGCAG CCGGCGCTGT TCGCGTTCGA GGTGGCGCTG TTCCGCTTGG TCGAGTCCTG GGGGGTACGC CCCGAGGTGC TTGGTGGCCA CTCCGTGGGC GAGATCGTCG CTGCTCATGT GGCGGGTGTG TGGTCGCTGG AGGACGCTGC CCGGCTGGTG TCGGCCCGTG CGGAGTTGAT GCAGGCGCTG CCGTCCGGTG GGTCGATGGT CGCCGTGGCG GCCAGCGAGG ACGAGGTGCG TACGGCCATC GCCGGTTTCC CGGGCGTGGA CGTCGCTGCA GTGAACGGCC CCGCGTCGGT CGTGGTGGCC GGCGCGACTG ACGCGGTCCG GGTAGTGGTG CAAGAGCTGG CCGCCCAAGG GCACCGGACC AAGGCACTCC GGGTGAGTCA CGCGTTCCAC TCGTCGCTGA TGGAGCCGAT GCTGGCGGCG TTCGGGGAGG TGCTGCGCTC CGTGACGTTC CACCATCCCC GAATTCCGCT GCTGTCACTG GTCAGCGGTA CGCTCGGCGA CCCGGCAGTG TCCACTGCAG AATACTGGGT GCGGCATACT CGGGAGGCGG TGCGGTTCGC CGACGGGATC CAGGCCGCCG TGCAGGCCGG GTGCACCACG TTCCTGGAGC TGGGCCCGGA CGCGGCGCTG GCGGGGATGG GCCGCGATGG CCTGCCGGAC GCGGCATGGG TGGCATCCGT ACGGAAGAAC CGTCCGGAGG TGCTCGCAAC GCTGGAGGCG GCCGCCCGCG TCGCTGTCCA GGGAGCGACG GTCGACTGGG CTGCCCTGAC GCCGGGCGGA CGCATGGTGG ACCTGCCCAC CTACGCCTTC CAGCACGAGC GCTTCTGGCT CGAGTACGAC GTGCAGCCCC GGGTTCCCGA CGGTGTGGAC TGGCGCTACC GGGTCGGGTG GACGCCCGTG CCCGACCCGG ACGCTCCGGC GGTACTCGCG GGCACCTGGC TGGTGGTCAT CCCGTCCACC GTCACGCGGG AGGTGGACGT CTGGGCTCGT ATGGTGACCG GCGGTCTCAC CGCGGCCGGC GCTCGTACCG AGGTCCTGCG GATTCCGGCC GGCTCCGGAC GCGCCGATCT GGCGACCCTG CTGGACGGGC CCGGGGAGGC AGCCGGCGTA CTGAGCCTGT TGGCCTTGGC GGAGGGTCCC ACCGACGCCG TGGTGCCGGC TGGTCTTGCC CTCACCATGG CCCTCGTGCA GGCACTCGGT GACACCGGCC GGGAGGCGCC GCTGTGGTGT GTGACCCGGG AGGCGGTGCA CACCTCCGTC GTCGAGGACC AGGCGATGAT CTGGGGATTC GGCCGGGTGG CCGCCCTCGA GCACCCCGGG CGCTGGGGTG GTCTCGTCGA CCTGCCCGCA ACGCTCGATC CCGCCGCGGG CCGACGCCTG GCCGCGGTCC TCAGGGGCGC GACGGGCGAG GACCAGGTTG CCGTCCGCAC CGACAGCCTG TACGCGCGGC GCCTCACGGC GGTGACCGAG CCCGTCTCGG ACACCAGCGC CTGGGAGCCC TCCGGAACGG TGCTGATCAC CGGGGGCACC GGTGCTCTCG GGGCACACGT CGCCCGCTGG CTCGCCGGCC GCGGTGCCCG GGACCTGCTG CTGGCCAGCC GCTCAGGTCC GGCGGCGCCC GGCGCGGCCG AGCTGGTGGC GGAGCTGGCG GAGCTTGGGG CCGCGGCCAC CATCGCCGCG TGCGACGTCG CCGACGCGGA CGACCTCGCG GCTCTGCTGG CGACCGTCCC GGCCGACCGG CCCCTGGGAG CCGTCTTCCA CGTCGCCGGC CGCATCGAGA CGACGGCTCT CGACGCGACG ACCCCCGAGA TCCTGGCCGA GGTGCTGGCC GGCAAGGTGG CCGGCGCTCG TAACCTGGCC GCCCTGGCCG GTGCTGTCGA CCGGTTCGTG CTGTTCTCGT CGGTCGCCGG GGTGTGGGGC AGCGGCGGGC ACGCGGCGTA CGCCCCGGCC AACGCCTACC TGGACGCCCT GGCCGAGCGG CGCCGGGCGG CCGGCCTGCC CGCTCTCGCG ATAGCCTGGG GCCCCTGGGC CGACGGCGGC ATGAGCGCCG GGAAGGACAC CAGGCGGGAG TCCGCCCGGC ACGGTCTGCC GGTGATGCCG ACCCGGGCCG CGCTCGGTGC CCTGGGCGCG GCCCTGGACG GCGGCGTGCC GAGCGTGACC GTCGCCGACG TGGTGTGGGA CCGGTTCGTG CCGCTGTTCA CCAGCACCCG GCCGAGCCGC CTGTTCGCCG ACGTGACCGT GGCGGCGCAG CCCGAACCGG ACGACCACGC CAGCCCCTGG CTGGACCGGC TGCGGGGTCG CAGCGGGTCG GACCGCGACG CCGAACTGCT GGCCCTCGTG CGGCACGAGG TGGCACTGAC CATCGGCCAC GTCGACGACC GGGCCGTGGA GATCGATCAA GCTTTCCGGA ACCTCGGCTT CGACTCGATG GCGGCCGTCG AGCTGCGCGA CCGGATCGCC ACCGGCACCG GCCTGACCCT GGCTAGCAGC CTCGTCTTCG ACCACCCGAC GGTGCAGTCG CTGGCACGGC ACCTCGCTGG CGAGCTCGGC CCGGACGGCG GCGATCCGGT CGCCACCGTG CTGGCCCAGC TGGACGAACT GGAGGCGGCA ATGGCCGGGC TGGCCGGTGG TGACGCCGTC CGGGAGTCCA TTGAGCCGCG GTTGCAGGCC CTGCTCGCCG GCCTGTCCCG CCCGGTCGGG CACTCGGCGG AGGCGGTGGC CGACCACCTC CGTACAGCGA GCGTCGAAGA CCTCTACGCC TTCGTAGACC AAGAATTCGG CCGATGA
|
Protein sequence | MTNEDKLVDY LRWVTADLKE AREQVRAAKQ RENEPLAIVG MSCRLPGGVS SPEQLWQLVD AGVDAISRFP ADRGWDIGDV FDPEPGRPGK SYVREGGFLD APGDFDAAFF GMSPKEAVAT DPQQRLLLET AWEALERAGL DPQALRGSQT GVFVGNNGQD HVIGLSRAPV ELSGYTVSGA TASILSGRVS YTFGFEGPSL AVDTACSSSL VALHVAGQAL RGGECSLALV GGITVMTTPT LYVGFSQQRG LSAEARCKAF SDDADGTSMA EGVGWLVLER LSDAQRLGHR VLAVVRGSAI NQDGASNGMT APSGPAQQRV LSRALASAGL AASQVDLVEA HGTGTALGDP IEAQAVLAVY GKDRPAGRPL WMGSLKSNIG HTQAVAGIAG IIKAVQAIRH GVLPRTLHVK EPSTQVDWSA GDVELLTEPK AWPETGQPRR AGVSSFGASG TNAHVILEQA PDVDAEADSG TDAPSPVVPW LLSARSERAL RGQADRLLAH VTAHPELSPR DVAYSLVRGR AAFEHRGVVL GADRDELLSG LAELAVGRAA PGVVTGRGSG GLAVLFTGQG AQRTGMGREL YDTFPVFAAA FDAACAQLRP GLKDAVLSGG AELSETGWGQ PALFAFEVAL FRLVESWGVR PEVLGGHSVG EIVAAHVAGV WSLEDAARLV SARAELMQAL PSGGSMVAVA ASEDEVRTAI AGFPGVDVAA VNGPASVVVA GATDAVRVVV QELAAQGHRT KALRVSHAFH SSLMEPMLAA FGEVLRSVTF HHPRIPLLSL VSGTLGDPAV STAEYWVRHT REAVRFADGI QAAVQAGCTT FLELGPDAAL AGMGRDGLPD AAWVASVRKN RPEVLATLEA AARVAVQGAT VDWAALTPGG RMVDLPTYAF QHERFWLEYD VQPRVPDGVD WRYRVGWTPV PDPDAPAVLA GTWLVVIPST VTREVDVWAR MVTGGLTAAG ARTEVLRIPA GSGRADLATL LDGPGEAAGV LSLLALAEGP TDAVVPAGLA LTMALVQALG DTGREAPLWC VTREAVHTSV VEDQAMIWGF GRVAALEHPG RWGGLVDLPA TLDPAAGRRL AAVLRGATGE DQVAVRTDSL YARRLTAVTE PVSDTSAWEP SGTVLITGGT GALGAHVARW LAGRGARDLL LASRSGPAAP GAAELVAELA ELGAAATIAA CDVADADDLA ALLATVPADR PLGAVFHVAG RIETTALDAT TPEILAEVLA GKVAGARNLA ALAGAVDRFV LFSSVAGVWG SGGHAAYAPA NAYLDALAER RRAAGLPALA IAWGPWADGG MSAGKDTRRE SARHGLPVMP TRAALGALGA ALDGGVPSVT VADVVWDRFV PLFTSTRPSR LFADVTVAAQ PEPDDHASPW LDRLRGRSGS DRDAELLALV RHEVALTIGH VDDRAVEIDQ AFRNLGFDSM AAVELRDRIA TGTGLTLASS LVFDHPTVQS LARHLAGELG PDGGDPVATV LAQLDELEAA MAGLAGGDAV RESIEPRLQA LLAGLSRPVG HSAEAVADHL RTASVEDLYA FVDQEFGR
|
| |