Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3086 |
Symbol | |
ID | 5671465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3640327 |
End bp | 3646194 |
Gene Length | 5868 bp |
Protein Length | 1955 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641241984 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_001507404 |
Protein GI | 158314896 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.930582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGCTA CTCCCAGAAC CGGCGAATTT CCCGCCGAAC GTGCCAGGGG TTCCGTGGGC GGGTTGCGTG ACCGCCTCGC CGGCCAACCG GATTCCGATC AGGTTCGAGC CCTGCTGGAC CTGATAGCGG AACTCGTCGG CGCCCTGACT GATCGAGAGC CCGCGGACAT AGCCGCGGAT CCGATCACGC CCTGGCGCCG CCTCGGGATC TATCGGCAGG TCGCCCGGCA GTTGCAGTCC GAGCTGGAGA CCGCGACCGG CCTGCGGCTG CCGGCGACGC TCTTCTTCGA TCTCCCGTCA CCGGACGCGC TGGCCGGCTA TCTGCGCTCG CGTCTCCTCG GCGGCGCGGA GAACCCACAG GTCACGGCGG TCGCGCCCGC AGGAAGGCCC CGGGAGGACG ACCCGGTTGC CATCGTCGGC ATGGCCTGCC GACTACCAGG TGGCGCGGAT TCCCCGGAGG CACTGTGGGA GCTGGTCCGT GACGGCCGGG ATGCCGTCGT CGGCCTTCCC ACCGACCGGG GGTGGGACGT TGACGCGCTT TATCATCCGG ATCGCGAGCA TCCTGGCACC ATTTACACCC GCGAGGGTGG CTTCCTGCCT GACATCGGCA TGTTCGACCC GGGCTTCTTC GGTATAGGCC CGCGCGAGGC GAACGCCATG GATCCCCAGC AGCGGCTGAT GCTGGAGATC TCCTGGGAGG CGTTGGAACG CGCCGGCGTG GATCCCGGAT CCCTGCGCGG CACCCGGACC GGCGTGTTCA CCGGGGTTTC GCTGCAGGAC TACGGTCCGC CCTGGCACCG CGCTCCGGCC AAGGCCCAGG GCCAGCTGCT CACCGGCAAC GCCCTCGGTG TGATCGCCGG CAGGGTCTCC TACACGTTCG GTCTCCAGGG GCCGTCCCTG TGCGTGGACA CCCAGTGCTC CAGCTCCATG GTGGCCATCC ACCTCGCGGG CCAGGCGCTG TTGTCGGGTG AGTGCGACCT GGCGCTGGCC GGCGGCGTCA CGGTCATGAC GACACCGGGC ATGCTCCTGG AGTTCAGTCG GAAGCAGGGT CTCGCTCCCG ACGGGCGGTG CAAGGCCTTC TCCTCGGACG CCGACGGCAC CGGGTGGGCG GACGGCGCCG GGGTTCTCCT GCTGGAGCGG CTCTCCGACG CCCGGCGCCA CGGCCACCCG GTGCTCGCAC TGATCCGCGG CACGGCCGTC AACCAGGACG GCGCCAGCAA CGGCCTGGCC GCACCCAACG GCCTCTCCCA GCAGCAACTG ATCCACCAGA CCCTGGCAAA CGCCGGTCTG GCGCCCGGCG ACGTGGACGT CCTCGAGGCA CACGGGACCG GCACCGCCCT CGGCGACCCG ATCGAGGCCC AGGCCGTCAT CGCCACCTAC GGCCGGCACC GCCCGGCCGG CCGGCCGCTC CACCTCGGGT CGCTGAAGTC GAACATCGGG CACACGCAGG CTGCCAGCGG AGTGGCCGGC GTCATCAAGA TGGTCCAGGC CATCCGGCAC GGAGTGCTGC CGCGCACCCT CCACGTGCGG GAGCCCTCGC CCCACGTGGA CTGGGCGGAC GGAGGCGTCG CGCTGCTCAC CGAGGAGCGT GGCTGGGAAC GCGACGGCGT CGCGCCGCGC CGGGCGGCGG TATCCGCGTT CGGTGTGAGC GGCACCAACG CCCACACCGT GCTGGAAGAG GCCCCGGCGG TGGCCGACCC AGTGCCGACG GCCCCGGTGC CGATCGTCCT GTCGGGCCGG ACCGAGGCGG CGCTGCGGGC CCAGGCGGCA CGGCTCGGCG AACGCCTGGC CCGGGACCCG GACCTCGATC CGGTTGACAT CGCCTTCACC CTCGCCCGGC GCACCCGGTT CGAGTCGCGG GCAGTCGCCG TCGTCCCGGC CGGACCCGCC GGGCGCGCAC GCCTGGCCGG CGCCCTCGCC GCGCTCGCCG AGAACCGCCC CGCGGCGGGG CTCGTCGCGG GCGCCGCCCG GCCGGCGATC GCCCACGGCC GGACCGCCGT CCTGTTCACC GGGCAGGGCA GCCAGCACCC GGGGATGGGC CGTGACCTCT ACCAGGCCTA CCCGGTGTTC GCGCGGGCGC TGGACGAGAT CTGCGGCCGG TTCGCGTCGC TGCTCGAGCG TCCGCTGCGC GAGGTCATGT TCGCTCCCGC CGGCTCGCCG GACGCGGCGC TGCTCGACCA GACCGCCTAC ACCCAGTGCG CGCTGTTCGC GTTCGAGACC GCGCTGTTCC GCCTCGCGGA GTCCTGGGGA TTCGTTCCCG ACACCCTCGC CGGGCACTCC ATCGGCGAAC TGACGGCCGC CCACGTCGCC GGCGTCTGGT CGCTCGACGA CGCGTGCCGG CTGGTCGCCG CCCGCGGCCG GCTGATGCAG GAGTGCCGAC CGGGTGGCGC GATGGCCGCC ATCGGCGCGA GCGAGGCGGA GGTCCGGGCG TCGATCGCCG ACCTCGTCGG CAAGGTCGAG ATCGCCACGG TCAACAGCCC GTCCGCCACG GTCGTCGCCG GCGACGCCGA GCTCGTCGAG CGGGTGGCCG CTGAGTGGTC GGCCCGGGGG CGGCGCACGA AACGGCTGAC CGTCAGCCAT GCTTTTCACT CCCCGCACAT GGAGGACATG CTCACGGCGT TCCGGACGGC GGCCGGGCAG GTCACCTACC ACGCCCCGGC CATACCGGTC ATGTCCAACC TCACCGGTGA ACCGGCCACC GCGACGGAAC TGACGTCCGC CGAGTACTGG GTGCGGCACG TGCGTGAGCC GGTCCGCTTC CTCGACGGGG TTCGCCGGCT GCACCGGGAC GGCGTGACGG CGTTCGTCGA ACTCGGTCCC GACGCGGTCC TGACCGCGCT CGTGCCGGCC TGCCTGCCGG CGAACGCCGC AGGCGGCGGC GCGGAGACGG TGCAGGTAGC GGCCTGCCGG ACCGGACGTC CCGAGCCCGA GACGCTGCTC GCCGCCCTGG CCGAGCTGGA CGCCGCGGGC ATCCCCGTCG CCTGGCGGGC GCTGCCGCCC GTACGCGGCG GAAAACACCT GGACCTGCCG ACCTACGCCT TCCAGCGCAC GCACCACTGG ATGGACGGGA CGCCCGTCAC AACCGCCACC GCGACGGACC GGGGAGCCGC CGAGCGGGCA ACGGATCCGG AACACTGGCG CTACCACGTG CGGTGGGAGC CGTTCTCCGA CGAGCAGGCC ACCGCGACCG TCGCCGACCA CGTCGCCAAC GGCGCGCCGA TGACCGGCCT GTGGCTGCTG GTGACCCCGC CGGCTGGGAT CGACGACGAC ACGCTGTCCC GGCTGTCCTG GGTCGTCGAA CGCCTCGGCG GAACACCGGT CCAGGTCCCC CTCTCCGGGA CGGACGCGGA GCGCGGTCTT GTCGCCGCGG CCCTGTCGAA GCACATCTCC GGCCGCGAGA AGGACATCGG CGGCGTCCTC TCGCTGCTCG CGTTCGATAG CCGGTTCAAC CCCGTGCACC CGGAACTGAC CAACGGACTC GCGCTGACCT GCGCACTCGT GCAGGCACTC GACGACCTCG GCCAGTACGC GCCGTACTGG TGCGCCACGC GCGGCGCCGT CAGCACCGGC CCCGCCGATC CGGTCAGCGC TCCGGCCCAG GCGATGCTCT GGGGACTGGG CCGGACGCTG GCCCTCGAAC GACCCCGTGG CTGGGGCGGT CTCGTCGACC TTCCGGCCGA TCTCGACGAC GAGACGCTGG CCCTGTTCGG CGCCGCGCTG ACCGGCCCCG GCGGCGAGGA CCAGCTGGCC CTGCGGGACG GCGCGCTGTT CGTCGCCCGG CTCGCCCCCG CGGGCCCCCG CCCCGGCGTC GGCACGGGAG CGGGCGTCAG CTCCGCCGGC GGGGCCGACG GGGTGGGGAC GCCCGCGTGG CGCCCGAACG GAACCGTGCT GATCACCGGC GGTACCGGAG CCCTAGGCGC GCAGGTCGCC CGCCGGCTCG CCCGCAACGG GACGTCCCGA CTCGTCCTGG CGAGCCGACG CGGCCCGGCC GCTCCCGGGG CCGCGGAACT GGTCGCCGAG CTCTCCGCGC TGGGCACCGA GGCATCCGTC GTGGCGTGCG ACCTGGCCGA CCGCGAACAG GTCATCGCTC TGCTCGACGA GGCGGCCGGC GGCGGCGAAC CGCTGACCGC GGTCGTCCAC GCCGCCGGCG TCATCGGCCG GACCGCGCCG CTGCGCGAAC TCACCCTCAG CGAGTTCGCG CAGGTGGTGA CGGGCAAGGC CACCGGCGCG GCGCTACTCG ACACACTGAC CCGCCCGGAC GGCGCACGGC CCATCCCGCT GGAAGCCTTC GTGCTGATCT CGTCGATCTC GGCGACGTGG GGCAGCGGCG GCCAGCCGGC CTACTCCGCC GGCAACGCCT ACCTGGACGC CCTCGCCTCG CACCGGGCCG GCCACGGTCT GCCCGCCACC TCCGTCGCGT TCGGCCCCTG GGCCGAGGCC GGTCTCGGCG CCGAGCCGGG CCTGCGCGAC TACCTCCGCG AACGCGGTCT CGCGCCGCTG CCGGTGGAGC CGGCCGTCAC CGTCCTCACC GAAGCGGTCG CTAGCAGCGA GGCGGCGACC ACCGTCGTCG ACGTCGACTG GGGACGCTTC CTGCCGCCGT TCACCGCGCT CCGTCCCAGC CGCTTCTTCG ACGGCCTGCC CACCCGGGCC GAGCACGGTG CCGGACCGGC GCCGACCGCC GGCTCCGCTC CGGACGGGCC AGCGGACGGC AGGCGGATGG ACCTCGCCGG CCTGACCAGC GAGGAACGTG TCAGCGCGCT GGCCCGGCTG GTTCGCGAGG AGGCCGCGGT CATCCTGGAG CACGAGTCAC CCGACAACGT GGACCCGCGG CGCCGGTTCC TCGACCTCGG TTTCGACTCG CTGGCCTCCG TTCAGCTCAG CCGCCGGCTC ACCGCCGCCA CCGGGCTGGC GCTGACACCG CCGGTCGTCT TCGAACACCC GACCGTGACC GAGCTGGCCG AGTACCTCGC GACCCTCGTC GGTGCCGGGC GGCCGGCCGC GCCGACCACC CACGCCGCGC CGGCCGGGGT CCGCGACCTG TACCGGCAGG CGTGCTCGGA CGGGAAGTTC GTCGAGGGCG TCGAGATCCT GCAGGCCGTC GCCAAGCTGC GGCCCGTCTT CCACGACGCG GCCGACTTCG GCCCGGTGCC GCCACCGGTC CGGCTCTCCG CCGGCCCGGC CCCCTGCACG CTCGTCTGCG TGCCCTCAAT GGTCGCCCCG TCCGGGCCGC ACAGCTTCGC CCGGCTCGCC CTCCACCTGC ACGGCCAACG AGACGTCTAC GGACTGTCCC TCCCCGGATT CGGGGAAGGT GAAAAGCTAC CCGCCTCGTC CGTCCTCGTC GTCGAGATAC TGACGGACCT GGTCGCGGCC CATTTCATCG GCGTACCCAT CGCCATCGCC GGATACTCGT CCGGCGGCTG GCTCGCGCAC GCGGTCGCCG CCGGTCTGGA GGAACGCGGC ATCCACCCAA AGGCGGTCCT GCTGCTCGAC ACCTGGCTCC CCGGCGACCG GATTCCCGCC GAGGAGATCC AGGAGGAACT GCGCGGGATC GCCGTGAACG ACCAGGCGTT CGCGCTCATG ACCGAGGCAC AAGTCACCGC CCAGGGCGCC TACCTGACCC TGTTCGAGAA ATGGAAGCCG AACCCGGTCT ACGCACCGAT CGTGCTCGTC CGGGCCGAGG AACGCATGCC CCAGCTCTCC CCCGACGACC AGTCCACGAT CGAGGAACAC GGCTGGACGA CCGACTGGGA GATCGACCAT CTCACGCTGG ACGTCAGCGG CAACCACCAG ACGATGATGA ACGAGCACGC CATCTCCACC GCGCGAAATC TCCACCACTG GCTCAACAAC CTCGACCAGT CGCCCTGA
|
Protein sequence | MHATPRTGEF PAERARGSVG GLRDRLAGQP DSDQVRALLD LIAELVGALT DREPADIAAD PITPWRRLGI YRQVARQLQS ELETATGLRL PATLFFDLPS PDALAGYLRS RLLGGAENPQ VTAVAPAGRP REDDPVAIVG MACRLPGGAD SPEALWELVR DGRDAVVGLP TDRGWDVDAL YHPDREHPGT IYTREGGFLP DIGMFDPGFF GIGPREANAM DPQQRLMLEI SWEALERAGV DPGSLRGTRT GVFTGVSLQD YGPPWHRAPA KAQGQLLTGN ALGVIAGRVS YTFGLQGPSL CVDTQCSSSM VAIHLAGQAL LSGECDLALA GGVTVMTTPG MLLEFSRKQG LAPDGRCKAF SSDADGTGWA DGAGVLLLER LSDARRHGHP VLALIRGTAV NQDGASNGLA APNGLSQQQL IHQTLANAGL APGDVDVLEA HGTGTALGDP IEAQAVIATY GRHRPAGRPL HLGSLKSNIG HTQAASGVAG VIKMVQAIRH GVLPRTLHVR EPSPHVDWAD GGVALLTEER GWERDGVAPR RAAVSAFGVS GTNAHTVLEE APAVADPVPT APVPIVLSGR TEAALRAQAA RLGERLARDP DLDPVDIAFT LARRTRFESR AVAVVPAGPA GRARLAGALA ALAENRPAAG LVAGAARPAI AHGRTAVLFT GQGSQHPGMG RDLYQAYPVF ARALDEICGR FASLLERPLR EVMFAPAGSP DAALLDQTAY TQCALFAFET ALFRLAESWG FVPDTLAGHS IGELTAAHVA GVWSLDDACR LVAARGRLMQ ECRPGGAMAA IGASEAEVRA SIADLVGKVE IATVNSPSAT VVAGDAELVE RVAAEWSARG RRTKRLTVSH AFHSPHMEDM LTAFRTAAGQ VTYHAPAIPV MSNLTGEPAT ATELTSAEYW VRHVREPVRF LDGVRRLHRD GVTAFVELGP DAVLTALVPA CLPANAAGGG AETVQVAACR TGRPEPETLL AALAELDAAG IPVAWRALPP VRGGKHLDLP TYAFQRTHHW MDGTPVTTAT ATDRGAAERA TDPEHWRYHV RWEPFSDEQA TATVADHVAN GAPMTGLWLL VTPPAGIDDD TLSRLSWVVE RLGGTPVQVP LSGTDAERGL VAAALSKHIS GREKDIGGVL SLLAFDSRFN PVHPELTNGL ALTCALVQAL DDLGQYAPYW CATRGAVSTG PADPVSAPAQ AMLWGLGRTL ALERPRGWGG LVDLPADLDD ETLALFGAAL TGPGGEDQLA LRDGALFVAR LAPAGPRPGV GTGAGVSSAG GADGVGTPAW RPNGTVLITG GTGALGAQVA RRLARNGTSR LVLASRRGPA APGAAELVAE LSALGTEASV VACDLADREQ VIALLDEAAG GGEPLTAVVH AAGVIGRTAP LRELTLSEFA QVVTGKATGA ALLDTLTRPD GARPIPLEAF VLISSISATW GSGGQPAYSA GNAYLDALAS HRAGHGLPAT SVAFGPWAEA GLGAEPGLRD YLRERGLAPL PVEPAVTVLT EAVASSEAAT TVVDVDWGRF LPPFTALRPS RFFDGLPTRA EHGAGPAPTA GSAPDGPADG RRMDLAGLTS EERVSALARL VREEAAVILE HESPDNVDPR RRFLDLGFDS LASVQLSRRL TAATGLALTP PVVFEHPTVT ELAEYLATLV GAGRPAAPTT HAAPAGVRDL YRQACSDGKF VEGVEILQAV AKLRPVFHDA ADFGPVPPPV RLSAGPAPCT LVCVPSMVAP SGPHSFARLA LHLHGQRDVY GLSLPGFGEG EKLPASSVLV VEILTDLVAA HFIGVPIAIA GYSSGGWLAH AVAAGLEERG IHPKAVLLLD TWLPGDRIPA EEIQEELRGI AVNDQAFALM TEAQVTAQGA YLTLFEKWKP NPVYAPIVLV RAEERMPQLS PDDQSTIEEH GWTTDWEIDH LTLDVSGNHQ TMMNEHAIST ARNLHHWLNN LDQSP
|
| |