Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2029 |
Symbol | |
ID | 5705683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2319619 |
End bp | 2323410 |
Gene Length | 3792 bp |
Protein Length | 1263 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641271519 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001536890 |
Protein GI | 159037637 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0480562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.193313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGG ATGACCCTGC CGGTCGCCCC TTCCAGGTGG CCGTCATCGG TGTCGGCTGC CGGCTTCCCG GCGACGTCGA CAGCGCCGAT GCCCTCTGGG AACTACTGCT CAAGGGCGGC CACACCAGCG CGGAGATCCC CACCCAGCGG TGGCGGGCGT ACCGCGAGCG AGGCCCGGAG TACGAGGCGG TCCTGCGGGA GACGGTCACC GCCGGCAGCT ATCTCGACGA CATCGCCGGC TTCGACGCCG AGTTCTTCGG CCTGACCCCA CGCGAGGCCG CCGAGATGGA CCCGCAGCAG CGGATCCTGC TGGAGGTCGG CTGGACAGCG CTGGAACACG CCGGCCTGCC ACCGACCGGG CTGGCCGGCA GCGACACCGG CGTCTTCGTC GGAGTCAGTA CCACCGACTA CGGAGACCGG CTGCTGGAGG ACCTACCCAC CGTCGAGGCG TGGACCGGCA TCGGCGCGGC CACCTGCGCC CTGGCCAACC GCATCTCCTA CGCCCTCGAC CTGCGCGGAC CGAGTGTCGC CGTCGACACC GCCTGTTCGG CGTCGCTGGT CGCGGTCCAC CTGGCCTGCC AGAGCCTGCT GCTCGGCGAG AGCAGCGTCG CCCTGGCCGG CGGCGTCAAC CTGGTGCTCG CGCCCGGACA GAACGTATCG CTGAACGCCG CCGGCACGCT CGCGCCCGAC GGGGTCAGCA AGTCCTTCGA CCGCGACGCC GACGGCTACG GCCGAGGTGA GGGCTGCGGT GTCCTGGTGC TCAAGCGACT CGACGACGCG GTCCGGGACG GCGACCGAGT GCTGGCCGTG ATCATTGGCA GCGCGGTCAA CCAGGACGGA CGCACCGACG GCATCATGGC GCCCTCCGGG GAGGCCCAGC AGCACGTCGT ACGCCGTGCC TGCGCCCGCG CCGGCATCAC ACCGGACAGC GTCGACTACG TGGAGGCCCA CGGCACCGGC ACCCGCCTGG GCGACCCGGT CGAGGCCGGC GCGCTCTCCG CGGTCTACGG CCCCGGCCGA CCGCCGGAAC GACCCTGCCT GATCGGGTCC ATCAAGTCGA ACATCGGCCA CCTGGAGGGC GCGGCCGGCG TCGCCGGGCT GATGAAGGCG GTCCTAGCGT TGCACCGGGG CCAGATTCCC GGCACCCCGC TGCGCGGCCG GTCGATACCC GCCGTCGACG GCGACGGCAC CGGGCTGCGA CTGGTCACCA GCCCGCTGCC CTGGCCCCGA CGCGACGGGG CCAGCCGGGC CGCCGTCTCC GGCTTCGGAT ACGGCGGCAC CATCGCCCAC GTCATCCTGG AACAAGCCCC GCCTCTGCCG TCCCTCGACA CCGCGGACGA CGGCGAACGT CAGCCCCTGG TGCCACTGTC CGCCCGCTCC GCCGCCGCCC TTCGGGCGCA GGCCGGCCGG CTCGCCGACC GGCTCGCCGC CGACGACCGG ACGAACCTGG CCGACATCCG ATACACCCTG GCGCACCTAC GCGCCCACCT GAGCCACCGC GCCGTGGTCA CCGGCGCCGA CCGCGGCGGA CTGGCCGCCG CGCTGCGGCA ACTCGCAGAC GACCAGGCGG ACGCCAGCAC CGTGTCCGGG GTCGCACCGG GCGGCCGCTC CGTACGCCCG GTGTGGGTGT TCTCCGGGCA CGGTTCACAC TGGCCCGGAA TGGGCCGCGA TCTGCTCAGT CACGAGCCAG CGTTCGCCGC CGTGATCGAC GAGATCGAAC CGGTGGTCGC CGAGGAGGCG GGCTTCTCGC TCCGCACCGC CCTCGGCGCT GCGGAACTTG GCGGCGTCGA CCGGATCCAG ATCCTGACCT TCGCCATGCA TCTCGGCCTC GCCGCCGTGT GGCAGGCGTA CGGGGTGCGG CCGGCGGCGG TCATCGGCCA CTCGGTCGGT GAGGTGGCCG CCGCCGTCAC CGCCGGCGTC GTGAGCCCGG TCGATGGTGC CCGACTGATC TGCCGCCGCT CCGCCCTGCT CCGCCGCGCC GCCGGGCGAG GCGCGATGGC CATGGTGACC CTGCCCTTCG CCGACGTCGC CGAACGCCTC GCCGGCCGCG CCAACCTGGT CGCGGCCATC GCCTCCGCCC CCGCCGCCAC GGTGATCTCC GGCGACATCG CCGCGGTGGA CGAGATCATC GAACAGTGGC CCGCCAACCG GATCGCCGTT CGCCGGGTGC AGTCCGAGGT GGCCTTCCAC AGCCCGCACA TGGACCCGCT CATCGACGAG TTGCGCGCCG CGGTGGTCGA CCTGGACCGG GCACCAGCCG TGGTGCCGAT GTACTCGACG GTGCTCGACG ACCCGCGTGC GACACCCAGC TGCGACGGTG ACTACTGGGC GGCCAACCTG CGTCGCCCGG TACGCCTGGT CCAAGCGGTC GAGGCCGCCC TGGCCGACGG GCATCGCGCC TTCCTTGAGA TCTCCGCGCA CCCGGTGGTC GCCCACTCGC TGCGGGAGAC CGCCGACCAC GCCGACGTGT ACATCGGGAC GACCTTGCGG CGGCACGCCC CCGGCCACCG CACCATGGTG GCGGCAGTCG CCGGAGCGTA CTGCCACGGA GCCGAGGTCG ACTGGACGCA CCACTACCCG CAGGGCCGGC TCGTCGACCT GCCCCACTAC GCCTGGCAGC ACCGCCAGCA CTGGCGTGAG CCCGAGCCAC CGGGCACCAC GGGCGGGCAC GACATCGGGT CGCACACGCT ACTCGGCACC CCGACCAGCG TGGCCGGCAG CGAGCTACGT CTCTGGCACA CCGTGCTCAC CGACGCCACC CGCCCGTACC CGGGCCGCCA CCAGGTGCAG GGCGCGGAAC TCGTGCCGGC GGTGGTGTTC GTCGCCACGT TCCTCGCCGC CGCTGCCCGC GACGGTGCTC CTGTGGCGTT GCGGGAGCTG TCGATGCGGG TGCCGCTCGC CACTCACGTG CGCCAGGAGA TCCAGGTGGT CGACGATGAA GGCCAGCTGC GGCTCGCCTC CCGACCGGCC GACGGAGACC CGGCACCGTG GCTGACCCAC GCCACCGCCC TGGCCGTGCC GGCGACCGGC CCGCTCGCCG GCACCGTGGC CACACCGCCG GCCGGCGGTG TGGTCGCCGA CGTCGGTCTG ATCGCAGCCC ACCTGAGCGC CGTGGGGGTG CCGGCCACCG CCTTCGCGTG GACGGTGGAT CGTCTGATCA CTGCCGACGG CGGCCTGCGG GCGCGGGTCC GCTTCCCGGA GTCGGCCGGC GGCTGGGCGG CGATCGTCGA CGCGGCGGTC TCCATCGCAC CGGTTGTCTT CCCCGGTCCG CCCCGACTGC GCCTGGTCGA GGGCGCCGAG TCGGTCACGA TGGCCGGCGC GCCGCCGACG GTCGCGGTGA TCGACGTGAT CCACGACGCC TCCCGGGAGG ACACCGCCTC GGTCCTGGTC AGCGCGCCCG ACGGCACGAT CGTCGCGCAG GTCGACGGGT TGCGGTACCC GGTGGTGCCG GCCGCACCGG ACGGGCCGGC CGACCAGCCC GGTGCGGGCG GCGGGTCGCT GGCCGGGATG GAACCGGACG AATTGCGCGA ACGCCTGATC GACGAAGTGC GCGCCGCGAT CGCCACGGAG ATGAAGCTTC CCGTCGAGTC ACTGGACCCG CGTCTGCCTC TCGTACAGCA GGGCTTGGAC TCGGTGATGA CCGTCATCGT GCGGCGGCGG CTGGAAAAGA CGTACCGCCA GCTGCTTCCG GCCTCGCTGT TCTGGCAGCA ACCGACCGTC GTGGCGATCG CGGCCGAACT GACCGAGCTG ATCGCCGCCC CGCCGCAGCC GGCCGGGGTG ACCGCCCGCT GA
|
Protein sequence | MTRDDPAGRP FQVAVIGVGC RLPGDVDSAD ALWELLLKGG HTSAEIPTQR WRAYRERGPE YEAVLRETVT AGSYLDDIAG FDAEFFGLTP REAAEMDPQQ RILLEVGWTA LEHAGLPPTG LAGSDTGVFV GVSTTDYGDR LLEDLPTVEA WTGIGAATCA LANRISYALD LRGPSVAVDT ACSASLVAVH LACQSLLLGE SSVALAGGVN LVLAPGQNVS LNAAGTLAPD GVSKSFDRDA DGYGRGEGCG VLVLKRLDDA VRDGDRVLAV IIGSAVNQDG RTDGIMAPSG EAQQHVVRRA CARAGITPDS VDYVEAHGTG TRLGDPVEAG ALSAVYGPGR PPERPCLIGS IKSNIGHLEG AAGVAGLMKA VLALHRGQIP GTPLRGRSIP AVDGDGTGLR LVTSPLPWPR RDGASRAAVS GFGYGGTIAH VILEQAPPLP SLDTADDGER QPLVPLSARS AAALRAQAGR LADRLAADDR TNLADIRYTL AHLRAHLSHR AVVTGADRGG LAAALRQLAD DQADASTVSG VAPGGRSVRP VWVFSGHGSH WPGMGRDLLS HEPAFAAVID EIEPVVAEEA GFSLRTALGA AELGGVDRIQ ILTFAMHLGL AAVWQAYGVR PAAVIGHSVG EVAAAVTAGV VSPVDGARLI CRRSALLRRA AGRGAMAMVT LPFADVAERL AGRANLVAAI ASAPAATVIS GDIAAVDEII EQWPANRIAV RRVQSEVAFH SPHMDPLIDE LRAAVVDLDR APAVVPMYST VLDDPRATPS CDGDYWAANL RRPVRLVQAV EAALADGHRA FLEISAHPVV AHSLRETADH ADVYIGTTLR RHAPGHRTMV AAVAGAYCHG AEVDWTHHYP QGRLVDLPHY AWQHRQHWRE PEPPGTTGGH DIGSHTLLGT PTSVAGSELR LWHTVLTDAT RPYPGRHQVQ GAELVPAVVF VATFLAAAAR DGAPVALREL SMRVPLATHV RQEIQVVDDE GQLRLASRPA DGDPAPWLTH ATALAVPATG PLAGTVATPP AGGVVADVGL IAAHLSAVGV PATAFAWTVD RLITADGGLR ARVRFPESAG GWAAIVDAAV SIAPVVFPGP PRLRLVEGAE SVTMAGAPPT VAVIDVIHDA SREDTASVLV SAPDGTIVAQ VDGLRYPVVP AAPDGPADQP GAGGGSLAGM EPDELRERLI DEVRAAIATE MKLPVESLDP RLPLVQQGLD SVMTVIVRRR LEKTYRQLLP ASLFWQQPTV VAIAAELTEL IAAPPQPAGV TAR
|
| |