Gene Sare_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2029 
Symbol 
ID5705683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2319619 
End bp2323410 
Gene Length3792 bp 
Protein Length1263 aa 
Translation table11 
GC content74% 
IMG OID641271519 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001536890 
Protein GI159037637 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0480562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.193313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGG ATGACCCTGC CGGTCGCCCC TTCCAGGTGG CCGTCATCGG TGTCGGCTGC 
CGGCTTCCCG GCGACGTCGA CAGCGCCGAT GCCCTCTGGG AACTACTGCT CAAGGGCGGC
CACACCAGCG CGGAGATCCC CACCCAGCGG TGGCGGGCGT ACCGCGAGCG AGGCCCGGAG
TACGAGGCGG TCCTGCGGGA GACGGTCACC GCCGGCAGCT ATCTCGACGA CATCGCCGGC
TTCGACGCCG AGTTCTTCGG CCTGACCCCA CGCGAGGCCG CCGAGATGGA CCCGCAGCAG
CGGATCCTGC TGGAGGTCGG CTGGACAGCG CTGGAACACG CCGGCCTGCC ACCGACCGGG
CTGGCCGGCA GCGACACCGG CGTCTTCGTC GGAGTCAGTA CCACCGACTA CGGAGACCGG
CTGCTGGAGG ACCTACCCAC CGTCGAGGCG TGGACCGGCA TCGGCGCGGC CACCTGCGCC
CTGGCCAACC GCATCTCCTA CGCCCTCGAC CTGCGCGGAC CGAGTGTCGC CGTCGACACC
GCCTGTTCGG CGTCGCTGGT CGCGGTCCAC CTGGCCTGCC AGAGCCTGCT GCTCGGCGAG
AGCAGCGTCG CCCTGGCCGG CGGCGTCAAC CTGGTGCTCG CGCCCGGACA GAACGTATCG
CTGAACGCCG CCGGCACGCT CGCGCCCGAC GGGGTCAGCA AGTCCTTCGA CCGCGACGCC
GACGGCTACG GCCGAGGTGA GGGCTGCGGT GTCCTGGTGC TCAAGCGACT CGACGACGCG
GTCCGGGACG GCGACCGAGT GCTGGCCGTG ATCATTGGCA GCGCGGTCAA CCAGGACGGA
CGCACCGACG GCATCATGGC GCCCTCCGGG GAGGCCCAGC AGCACGTCGT ACGCCGTGCC
TGCGCCCGCG CCGGCATCAC ACCGGACAGC GTCGACTACG TGGAGGCCCA CGGCACCGGC
ACCCGCCTGG GCGACCCGGT CGAGGCCGGC GCGCTCTCCG CGGTCTACGG CCCCGGCCGA
CCGCCGGAAC GACCCTGCCT GATCGGGTCC ATCAAGTCGA ACATCGGCCA CCTGGAGGGC
GCGGCCGGCG TCGCCGGGCT GATGAAGGCG GTCCTAGCGT TGCACCGGGG CCAGATTCCC
GGCACCCCGC TGCGCGGCCG GTCGATACCC GCCGTCGACG GCGACGGCAC CGGGCTGCGA
CTGGTCACCA GCCCGCTGCC CTGGCCCCGA CGCGACGGGG CCAGCCGGGC CGCCGTCTCC
GGCTTCGGAT ACGGCGGCAC CATCGCCCAC GTCATCCTGG AACAAGCCCC GCCTCTGCCG
TCCCTCGACA CCGCGGACGA CGGCGAACGT CAGCCCCTGG TGCCACTGTC CGCCCGCTCC
GCCGCCGCCC TTCGGGCGCA GGCCGGCCGG CTCGCCGACC GGCTCGCCGC CGACGACCGG
ACGAACCTGG CCGACATCCG ATACACCCTG GCGCACCTAC GCGCCCACCT GAGCCACCGC
GCCGTGGTCA CCGGCGCCGA CCGCGGCGGA CTGGCCGCCG CGCTGCGGCA ACTCGCAGAC
GACCAGGCGG ACGCCAGCAC CGTGTCCGGG GTCGCACCGG GCGGCCGCTC CGTACGCCCG
GTGTGGGTGT TCTCCGGGCA CGGTTCACAC TGGCCCGGAA TGGGCCGCGA TCTGCTCAGT
CACGAGCCAG CGTTCGCCGC CGTGATCGAC GAGATCGAAC CGGTGGTCGC CGAGGAGGCG
GGCTTCTCGC TCCGCACCGC CCTCGGCGCT GCGGAACTTG GCGGCGTCGA CCGGATCCAG
ATCCTGACCT TCGCCATGCA TCTCGGCCTC GCCGCCGTGT GGCAGGCGTA CGGGGTGCGG
CCGGCGGCGG TCATCGGCCA CTCGGTCGGT GAGGTGGCCG CCGCCGTCAC CGCCGGCGTC
GTGAGCCCGG TCGATGGTGC CCGACTGATC TGCCGCCGCT CCGCCCTGCT CCGCCGCGCC
GCCGGGCGAG GCGCGATGGC CATGGTGACC CTGCCCTTCG CCGACGTCGC CGAACGCCTC
GCCGGCCGCG CCAACCTGGT CGCGGCCATC GCCTCCGCCC CCGCCGCCAC GGTGATCTCC
GGCGACATCG CCGCGGTGGA CGAGATCATC GAACAGTGGC CCGCCAACCG GATCGCCGTT
CGCCGGGTGC AGTCCGAGGT GGCCTTCCAC AGCCCGCACA TGGACCCGCT CATCGACGAG
TTGCGCGCCG CGGTGGTCGA CCTGGACCGG GCACCAGCCG TGGTGCCGAT GTACTCGACG
GTGCTCGACG ACCCGCGTGC GACACCCAGC TGCGACGGTG ACTACTGGGC GGCCAACCTG
CGTCGCCCGG TACGCCTGGT CCAAGCGGTC GAGGCCGCCC TGGCCGACGG GCATCGCGCC
TTCCTTGAGA TCTCCGCGCA CCCGGTGGTC GCCCACTCGC TGCGGGAGAC CGCCGACCAC
GCCGACGTGT ACATCGGGAC GACCTTGCGG CGGCACGCCC CCGGCCACCG CACCATGGTG
GCGGCAGTCG CCGGAGCGTA CTGCCACGGA GCCGAGGTCG ACTGGACGCA CCACTACCCG
CAGGGCCGGC TCGTCGACCT GCCCCACTAC GCCTGGCAGC ACCGCCAGCA CTGGCGTGAG
CCCGAGCCAC CGGGCACCAC GGGCGGGCAC GACATCGGGT CGCACACGCT ACTCGGCACC
CCGACCAGCG TGGCCGGCAG CGAGCTACGT CTCTGGCACA CCGTGCTCAC CGACGCCACC
CGCCCGTACC CGGGCCGCCA CCAGGTGCAG GGCGCGGAAC TCGTGCCGGC GGTGGTGTTC
GTCGCCACGT TCCTCGCCGC CGCTGCCCGC GACGGTGCTC CTGTGGCGTT GCGGGAGCTG
TCGATGCGGG TGCCGCTCGC CACTCACGTG CGCCAGGAGA TCCAGGTGGT CGACGATGAA
GGCCAGCTGC GGCTCGCCTC CCGACCGGCC GACGGAGACC CGGCACCGTG GCTGACCCAC
GCCACCGCCC TGGCCGTGCC GGCGACCGGC CCGCTCGCCG GCACCGTGGC CACACCGCCG
GCCGGCGGTG TGGTCGCCGA CGTCGGTCTG ATCGCAGCCC ACCTGAGCGC CGTGGGGGTG
CCGGCCACCG CCTTCGCGTG GACGGTGGAT CGTCTGATCA CTGCCGACGG CGGCCTGCGG
GCGCGGGTCC GCTTCCCGGA GTCGGCCGGC GGCTGGGCGG CGATCGTCGA CGCGGCGGTC
TCCATCGCAC CGGTTGTCTT CCCCGGTCCG CCCCGACTGC GCCTGGTCGA GGGCGCCGAG
TCGGTCACGA TGGCCGGCGC GCCGCCGACG GTCGCGGTGA TCGACGTGAT CCACGACGCC
TCCCGGGAGG ACACCGCCTC GGTCCTGGTC AGCGCGCCCG ACGGCACGAT CGTCGCGCAG
GTCGACGGGT TGCGGTACCC GGTGGTGCCG GCCGCACCGG ACGGGCCGGC CGACCAGCCC
GGTGCGGGCG GCGGGTCGCT GGCCGGGATG GAACCGGACG AATTGCGCGA ACGCCTGATC
GACGAAGTGC GCGCCGCGAT CGCCACGGAG ATGAAGCTTC CCGTCGAGTC ACTGGACCCG
CGTCTGCCTC TCGTACAGCA GGGCTTGGAC TCGGTGATGA CCGTCATCGT GCGGCGGCGG
CTGGAAAAGA CGTACCGCCA GCTGCTTCCG GCCTCGCTGT TCTGGCAGCA ACCGACCGTC
GTGGCGATCG CGGCCGAACT GACCGAGCTG ATCGCCGCCC CGCCGCAGCC GGCCGGGGTG
ACCGCCCGCT GA
 
Protein sequence
MTRDDPAGRP FQVAVIGVGC RLPGDVDSAD ALWELLLKGG HTSAEIPTQR WRAYRERGPE 
YEAVLRETVT AGSYLDDIAG FDAEFFGLTP REAAEMDPQQ RILLEVGWTA LEHAGLPPTG
LAGSDTGVFV GVSTTDYGDR LLEDLPTVEA WTGIGAATCA LANRISYALD LRGPSVAVDT
ACSASLVAVH LACQSLLLGE SSVALAGGVN LVLAPGQNVS LNAAGTLAPD GVSKSFDRDA
DGYGRGEGCG VLVLKRLDDA VRDGDRVLAV IIGSAVNQDG RTDGIMAPSG EAQQHVVRRA
CARAGITPDS VDYVEAHGTG TRLGDPVEAG ALSAVYGPGR PPERPCLIGS IKSNIGHLEG
AAGVAGLMKA VLALHRGQIP GTPLRGRSIP AVDGDGTGLR LVTSPLPWPR RDGASRAAVS
GFGYGGTIAH VILEQAPPLP SLDTADDGER QPLVPLSARS AAALRAQAGR LADRLAADDR
TNLADIRYTL AHLRAHLSHR AVVTGADRGG LAAALRQLAD DQADASTVSG VAPGGRSVRP
VWVFSGHGSH WPGMGRDLLS HEPAFAAVID EIEPVVAEEA GFSLRTALGA AELGGVDRIQ
ILTFAMHLGL AAVWQAYGVR PAAVIGHSVG EVAAAVTAGV VSPVDGARLI CRRSALLRRA
AGRGAMAMVT LPFADVAERL AGRANLVAAI ASAPAATVIS GDIAAVDEII EQWPANRIAV
RRVQSEVAFH SPHMDPLIDE LRAAVVDLDR APAVVPMYST VLDDPRATPS CDGDYWAANL
RRPVRLVQAV EAALADGHRA FLEISAHPVV AHSLRETADH ADVYIGTTLR RHAPGHRTMV
AAVAGAYCHG AEVDWTHHYP QGRLVDLPHY AWQHRQHWRE PEPPGTTGGH DIGSHTLLGT
PTSVAGSELR LWHTVLTDAT RPYPGRHQVQ GAELVPAVVF VATFLAAAAR DGAPVALREL
SMRVPLATHV RQEIQVVDDE GQLRLASRPA DGDPAPWLTH ATALAVPATG PLAGTVATPP
AGGVVADVGL IAAHLSAVGV PATAFAWTVD RLITADGGLR ARVRFPESAG GWAAIVDAAV
SIAPVVFPGP PRLRLVEGAE SVTMAGAPPT VAVIDVIHDA SREDTASVLV SAPDGTIVAQ
VDGLRYPVVP AAPDGPADQP GAGGGSLAGM EPDELRERLI DEVRAAIATE MKLPVESLDP
RLPLVQQGLD SVMTVIVRRR LEKTYRQLLP ASLFWQQPTV VAIAAELTEL IAAPPQPAGV
TAR