Gene Franean1_3086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3086 
Symbol 
ID5671465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3640327 
End bp3646194 
Gene Length5868 bp 
Protein Length1955 aa 
Translation table11 
GC content73% 
IMG OID641241984 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001507404 
Protein GI158314896 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.930582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCTA CTCCCAGAAC CGGCGAATTT CCCGCCGAAC GTGCCAGGGG TTCCGTGGGC 
GGGTTGCGTG ACCGCCTCGC CGGCCAACCG GATTCCGATC AGGTTCGAGC CCTGCTGGAC
CTGATAGCGG AACTCGTCGG CGCCCTGACT GATCGAGAGC CCGCGGACAT AGCCGCGGAT
CCGATCACGC CCTGGCGCCG CCTCGGGATC TATCGGCAGG TCGCCCGGCA GTTGCAGTCC
GAGCTGGAGA CCGCGACCGG CCTGCGGCTG CCGGCGACGC TCTTCTTCGA TCTCCCGTCA
CCGGACGCGC TGGCCGGCTA TCTGCGCTCG CGTCTCCTCG GCGGCGCGGA GAACCCACAG
GTCACGGCGG TCGCGCCCGC AGGAAGGCCC CGGGAGGACG ACCCGGTTGC CATCGTCGGC
ATGGCCTGCC GACTACCAGG TGGCGCGGAT TCCCCGGAGG CACTGTGGGA GCTGGTCCGT
GACGGCCGGG ATGCCGTCGT CGGCCTTCCC ACCGACCGGG GGTGGGACGT TGACGCGCTT
TATCATCCGG ATCGCGAGCA TCCTGGCACC ATTTACACCC GCGAGGGTGG CTTCCTGCCT
GACATCGGCA TGTTCGACCC GGGCTTCTTC GGTATAGGCC CGCGCGAGGC GAACGCCATG
GATCCCCAGC AGCGGCTGAT GCTGGAGATC TCCTGGGAGG CGTTGGAACG CGCCGGCGTG
GATCCCGGAT CCCTGCGCGG CACCCGGACC GGCGTGTTCA CCGGGGTTTC GCTGCAGGAC
TACGGTCCGC CCTGGCACCG CGCTCCGGCC AAGGCCCAGG GCCAGCTGCT CACCGGCAAC
GCCCTCGGTG TGATCGCCGG CAGGGTCTCC TACACGTTCG GTCTCCAGGG GCCGTCCCTG
TGCGTGGACA CCCAGTGCTC CAGCTCCATG GTGGCCATCC ACCTCGCGGG CCAGGCGCTG
TTGTCGGGTG AGTGCGACCT GGCGCTGGCC GGCGGCGTCA CGGTCATGAC GACACCGGGC
ATGCTCCTGG AGTTCAGTCG GAAGCAGGGT CTCGCTCCCG ACGGGCGGTG CAAGGCCTTC
TCCTCGGACG CCGACGGCAC CGGGTGGGCG GACGGCGCCG GGGTTCTCCT GCTGGAGCGG
CTCTCCGACG CCCGGCGCCA CGGCCACCCG GTGCTCGCAC TGATCCGCGG CACGGCCGTC
AACCAGGACG GCGCCAGCAA CGGCCTGGCC GCACCCAACG GCCTCTCCCA GCAGCAACTG
ATCCACCAGA CCCTGGCAAA CGCCGGTCTG GCGCCCGGCG ACGTGGACGT CCTCGAGGCA
CACGGGACCG GCACCGCCCT CGGCGACCCG ATCGAGGCCC AGGCCGTCAT CGCCACCTAC
GGCCGGCACC GCCCGGCCGG CCGGCCGCTC CACCTCGGGT CGCTGAAGTC GAACATCGGG
CACACGCAGG CTGCCAGCGG AGTGGCCGGC GTCATCAAGA TGGTCCAGGC CATCCGGCAC
GGAGTGCTGC CGCGCACCCT CCACGTGCGG GAGCCCTCGC CCCACGTGGA CTGGGCGGAC
GGAGGCGTCG CGCTGCTCAC CGAGGAGCGT GGCTGGGAAC GCGACGGCGT CGCGCCGCGC
CGGGCGGCGG TATCCGCGTT CGGTGTGAGC GGCACCAACG CCCACACCGT GCTGGAAGAG
GCCCCGGCGG TGGCCGACCC AGTGCCGACG GCCCCGGTGC CGATCGTCCT GTCGGGCCGG
ACCGAGGCGG CGCTGCGGGC CCAGGCGGCA CGGCTCGGCG AACGCCTGGC CCGGGACCCG
GACCTCGATC CGGTTGACAT CGCCTTCACC CTCGCCCGGC GCACCCGGTT CGAGTCGCGG
GCAGTCGCCG TCGTCCCGGC CGGACCCGCC GGGCGCGCAC GCCTGGCCGG CGCCCTCGCC
GCGCTCGCCG AGAACCGCCC CGCGGCGGGG CTCGTCGCGG GCGCCGCCCG GCCGGCGATC
GCCCACGGCC GGACCGCCGT CCTGTTCACC GGGCAGGGCA GCCAGCACCC GGGGATGGGC
CGTGACCTCT ACCAGGCCTA CCCGGTGTTC GCGCGGGCGC TGGACGAGAT CTGCGGCCGG
TTCGCGTCGC TGCTCGAGCG TCCGCTGCGC GAGGTCATGT TCGCTCCCGC CGGCTCGCCG
GACGCGGCGC TGCTCGACCA GACCGCCTAC ACCCAGTGCG CGCTGTTCGC GTTCGAGACC
GCGCTGTTCC GCCTCGCGGA GTCCTGGGGA TTCGTTCCCG ACACCCTCGC CGGGCACTCC
ATCGGCGAAC TGACGGCCGC CCACGTCGCC GGCGTCTGGT CGCTCGACGA CGCGTGCCGG
CTGGTCGCCG CCCGCGGCCG GCTGATGCAG GAGTGCCGAC CGGGTGGCGC GATGGCCGCC
ATCGGCGCGA GCGAGGCGGA GGTCCGGGCG TCGATCGCCG ACCTCGTCGG CAAGGTCGAG
ATCGCCACGG TCAACAGCCC GTCCGCCACG GTCGTCGCCG GCGACGCCGA GCTCGTCGAG
CGGGTGGCCG CTGAGTGGTC GGCCCGGGGG CGGCGCACGA AACGGCTGAC CGTCAGCCAT
GCTTTTCACT CCCCGCACAT GGAGGACATG CTCACGGCGT TCCGGACGGC GGCCGGGCAG
GTCACCTACC ACGCCCCGGC CATACCGGTC ATGTCCAACC TCACCGGTGA ACCGGCCACC
GCGACGGAAC TGACGTCCGC CGAGTACTGG GTGCGGCACG TGCGTGAGCC GGTCCGCTTC
CTCGACGGGG TTCGCCGGCT GCACCGGGAC GGCGTGACGG CGTTCGTCGA ACTCGGTCCC
GACGCGGTCC TGACCGCGCT CGTGCCGGCC TGCCTGCCGG CGAACGCCGC AGGCGGCGGC
GCGGAGACGG TGCAGGTAGC GGCCTGCCGG ACCGGACGTC CCGAGCCCGA GACGCTGCTC
GCCGCCCTGG CCGAGCTGGA CGCCGCGGGC ATCCCCGTCG CCTGGCGGGC GCTGCCGCCC
GTACGCGGCG GAAAACACCT GGACCTGCCG ACCTACGCCT TCCAGCGCAC GCACCACTGG
ATGGACGGGA CGCCCGTCAC AACCGCCACC GCGACGGACC GGGGAGCCGC CGAGCGGGCA
ACGGATCCGG AACACTGGCG CTACCACGTG CGGTGGGAGC CGTTCTCCGA CGAGCAGGCC
ACCGCGACCG TCGCCGACCA CGTCGCCAAC GGCGCGCCGA TGACCGGCCT GTGGCTGCTG
GTGACCCCGC CGGCTGGGAT CGACGACGAC ACGCTGTCCC GGCTGTCCTG GGTCGTCGAA
CGCCTCGGCG GAACACCGGT CCAGGTCCCC CTCTCCGGGA CGGACGCGGA GCGCGGTCTT
GTCGCCGCGG CCCTGTCGAA GCACATCTCC GGCCGCGAGA AGGACATCGG CGGCGTCCTC
TCGCTGCTCG CGTTCGATAG CCGGTTCAAC CCCGTGCACC CGGAACTGAC CAACGGACTC
GCGCTGACCT GCGCACTCGT GCAGGCACTC GACGACCTCG GCCAGTACGC GCCGTACTGG
TGCGCCACGC GCGGCGCCGT CAGCACCGGC CCCGCCGATC CGGTCAGCGC TCCGGCCCAG
GCGATGCTCT GGGGACTGGG CCGGACGCTG GCCCTCGAAC GACCCCGTGG CTGGGGCGGT
CTCGTCGACC TTCCGGCCGA TCTCGACGAC GAGACGCTGG CCCTGTTCGG CGCCGCGCTG
ACCGGCCCCG GCGGCGAGGA CCAGCTGGCC CTGCGGGACG GCGCGCTGTT CGTCGCCCGG
CTCGCCCCCG CGGGCCCCCG CCCCGGCGTC GGCACGGGAG CGGGCGTCAG CTCCGCCGGC
GGGGCCGACG GGGTGGGGAC GCCCGCGTGG CGCCCGAACG GAACCGTGCT GATCACCGGC
GGTACCGGAG CCCTAGGCGC GCAGGTCGCC CGCCGGCTCG CCCGCAACGG GACGTCCCGA
CTCGTCCTGG CGAGCCGACG CGGCCCGGCC GCTCCCGGGG CCGCGGAACT GGTCGCCGAG
CTCTCCGCGC TGGGCACCGA GGCATCCGTC GTGGCGTGCG ACCTGGCCGA CCGCGAACAG
GTCATCGCTC TGCTCGACGA GGCGGCCGGC GGCGGCGAAC CGCTGACCGC GGTCGTCCAC
GCCGCCGGCG TCATCGGCCG GACCGCGCCG CTGCGCGAAC TCACCCTCAG CGAGTTCGCG
CAGGTGGTGA CGGGCAAGGC CACCGGCGCG GCGCTACTCG ACACACTGAC CCGCCCGGAC
GGCGCACGGC CCATCCCGCT GGAAGCCTTC GTGCTGATCT CGTCGATCTC GGCGACGTGG
GGCAGCGGCG GCCAGCCGGC CTACTCCGCC GGCAACGCCT ACCTGGACGC CCTCGCCTCG
CACCGGGCCG GCCACGGTCT GCCCGCCACC TCCGTCGCGT TCGGCCCCTG GGCCGAGGCC
GGTCTCGGCG CCGAGCCGGG CCTGCGCGAC TACCTCCGCG AACGCGGTCT CGCGCCGCTG
CCGGTGGAGC CGGCCGTCAC CGTCCTCACC GAAGCGGTCG CTAGCAGCGA GGCGGCGACC
ACCGTCGTCG ACGTCGACTG GGGACGCTTC CTGCCGCCGT TCACCGCGCT CCGTCCCAGC
CGCTTCTTCG ACGGCCTGCC CACCCGGGCC GAGCACGGTG CCGGACCGGC GCCGACCGCC
GGCTCCGCTC CGGACGGGCC AGCGGACGGC AGGCGGATGG ACCTCGCCGG CCTGACCAGC
GAGGAACGTG TCAGCGCGCT GGCCCGGCTG GTTCGCGAGG AGGCCGCGGT CATCCTGGAG
CACGAGTCAC CCGACAACGT GGACCCGCGG CGCCGGTTCC TCGACCTCGG TTTCGACTCG
CTGGCCTCCG TTCAGCTCAG CCGCCGGCTC ACCGCCGCCA CCGGGCTGGC GCTGACACCG
CCGGTCGTCT TCGAACACCC GACCGTGACC GAGCTGGCCG AGTACCTCGC GACCCTCGTC
GGTGCCGGGC GGCCGGCCGC GCCGACCACC CACGCCGCGC CGGCCGGGGT CCGCGACCTG
TACCGGCAGG CGTGCTCGGA CGGGAAGTTC GTCGAGGGCG TCGAGATCCT GCAGGCCGTC
GCCAAGCTGC GGCCCGTCTT CCACGACGCG GCCGACTTCG GCCCGGTGCC GCCACCGGTC
CGGCTCTCCG CCGGCCCGGC CCCCTGCACG CTCGTCTGCG TGCCCTCAAT GGTCGCCCCG
TCCGGGCCGC ACAGCTTCGC CCGGCTCGCC CTCCACCTGC ACGGCCAACG AGACGTCTAC
GGACTGTCCC TCCCCGGATT CGGGGAAGGT GAAAAGCTAC CCGCCTCGTC CGTCCTCGTC
GTCGAGATAC TGACGGACCT GGTCGCGGCC CATTTCATCG GCGTACCCAT CGCCATCGCC
GGATACTCGT CCGGCGGCTG GCTCGCGCAC GCGGTCGCCG CCGGTCTGGA GGAACGCGGC
ATCCACCCAA AGGCGGTCCT GCTGCTCGAC ACCTGGCTCC CCGGCGACCG GATTCCCGCC
GAGGAGATCC AGGAGGAACT GCGCGGGATC GCCGTGAACG ACCAGGCGTT CGCGCTCATG
ACCGAGGCAC AAGTCACCGC CCAGGGCGCC TACCTGACCC TGTTCGAGAA ATGGAAGCCG
AACCCGGTCT ACGCACCGAT CGTGCTCGTC CGGGCCGAGG AACGCATGCC CCAGCTCTCC
CCCGACGACC AGTCCACGAT CGAGGAACAC GGCTGGACGA CCGACTGGGA GATCGACCAT
CTCACGCTGG ACGTCAGCGG CAACCACCAG ACGATGATGA ACGAGCACGC CATCTCCACC
GCGCGAAATC TCCACCACTG GCTCAACAAC CTCGACCAGT CGCCCTGA
 
Protein sequence
MHATPRTGEF PAERARGSVG GLRDRLAGQP DSDQVRALLD LIAELVGALT DREPADIAAD 
PITPWRRLGI YRQVARQLQS ELETATGLRL PATLFFDLPS PDALAGYLRS RLLGGAENPQ
VTAVAPAGRP REDDPVAIVG MACRLPGGAD SPEALWELVR DGRDAVVGLP TDRGWDVDAL
YHPDREHPGT IYTREGGFLP DIGMFDPGFF GIGPREANAM DPQQRLMLEI SWEALERAGV
DPGSLRGTRT GVFTGVSLQD YGPPWHRAPA KAQGQLLTGN ALGVIAGRVS YTFGLQGPSL
CVDTQCSSSM VAIHLAGQAL LSGECDLALA GGVTVMTTPG MLLEFSRKQG LAPDGRCKAF
SSDADGTGWA DGAGVLLLER LSDARRHGHP VLALIRGTAV NQDGASNGLA APNGLSQQQL
IHQTLANAGL APGDVDVLEA HGTGTALGDP IEAQAVIATY GRHRPAGRPL HLGSLKSNIG
HTQAASGVAG VIKMVQAIRH GVLPRTLHVR EPSPHVDWAD GGVALLTEER GWERDGVAPR
RAAVSAFGVS GTNAHTVLEE APAVADPVPT APVPIVLSGR TEAALRAQAA RLGERLARDP
DLDPVDIAFT LARRTRFESR AVAVVPAGPA GRARLAGALA ALAENRPAAG LVAGAARPAI
AHGRTAVLFT GQGSQHPGMG RDLYQAYPVF ARALDEICGR FASLLERPLR EVMFAPAGSP
DAALLDQTAY TQCALFAFET ALFRLAESWG FVPDTLAGHS IGELTAAHVA GVWSLDDACR
LVAARGRLMQ ECRPGGAMAA IGASEAEVRA SIADLVGKVE IATVNSPSAT VVAGDAELVE
RVAAEWSARG RRTKRLTVSH AFHSPHMEDM LTAFRTAAGQ VTYHAPAIPV MSNLTGEPAT
ATELTSAEYW VRHVREPVRF LDGVRRLHRD GVTAFVELGP DAVLTALVPA CLPANAAGGG
AETVQVAACR TGRPEPETLL AALAELDAAG IPVAWRALPP VRGGKHLDLP TYAFQRTHHW
MDGTPVTTAT ATDRGAAERA TDPEHWRYHV RWEPFSDEQA TATVADHVAN GAPMTGLWLL
VTPPAGIDDD TLSRLSWVVE RLGGTPVQVP LSGTDAERGL VAAALSKHIS GREKDIGGVL
SLLAFDSRFN PVHPELTNGL ALTCALVQAL DDLGQYAPYW CATRGAVSTG PADPVSAPAQ
AMLWGLGRTL ALERPRGWGG LVDLPADLDD ETLALFGAAL TGPGGEDQLA LRDGALFVAR
LAPAGPRPGV GTGAGVSSAG GADGVGTPAW RPNGTVLITG GTGALGAQVA RRLARNGTSR
LVLASRRGPA APGAAELVAE LSALGTEASV VACDLADREQ VIALLDEAAG GGEPLTAVVH
AAGVIGRTAP LRELTLSEFA QVVTGKATGA ALLDTLTRPD GARPIPLEAF VLISSISATW
GSGGQPAYSA GNAYLDALAS HRAGHGLPAT SVAFGPWAEA GLGAEPGLRD YLRERGLAPL
PVEPAVTVLT EAVASSEAAT TVVDVDWGRF LPPFTALRPS RFFDGLPTRA EHGAGPAPTA
GSAPDGPADG RRMDLAGLTS EERVSALARL VREEAAVILE HESPDNVDPR RRFLDLGFDS
LASVQLSRRL TAATGLALTP PVVFEHPTVT ELAEYLATLV GAGRPAAPTT HAAPAGVRDL
YRQACSDGKF VEGVEILQAV AKLRPVFHDA ADFGPVPPPV RLSAGPAPCT LVCVPSMVAP
SGPHSFARLA LHLHGQRDVY GLSLPGFGEG EKLPASSVLV VEILTDLVAA HFIGVPIAIA
GYSSGGWLAH AVAAGLEERG IHPKAVLLLD TWLPGDRIPA EEIQEELRGI AVNDQAFALM
TEAQVTAQGA YLTLFEKWKP NPVYAPIVLV RAEERMPQLS PDDQSTIEEH GWTTDWEIDH
LTLDVSGNHQ TMMNEHAIST ARNLHHWLNN LDQSP