Gene Franean1_5779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5779 
Symbol 
ID5674104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7018216 
End bp7024359 
Gene Length6144 bp 
Protein Length2047 aa 
Translation table11 
GC content73% 
IMG OID641244630 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001510033 
Protein GI158317525 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCGTG AGTCGGTGAC AGTCGACCAT CTTCGCCCTA CCCCCGGTTC GCCCGAGTCG 
GACGGCGTAC TTCTCACGCG ACTGCTGCGT GAGAAGTACG AGCCGGTCGC CGTCGTCGGC
ATGGGCCTGC GGCTGCCGGG CGGTTCGGAG TCACCGGACG AGTTCGAGGA TTTCCTGCGG
GCCGGCCGGT CCGGTGTCGG TCCGTTGCCG AAGGACCGCT GGGACCCCGA CCTGTTCGTC
CCCGACGACC CGTCGGAGAA GGGGAAGATC CAAACAACCG GCGGCGGTTT CCTCGACCGG
ATCGACCTGT TCGACGCGCC GTTCTTCAAC ATCTCGCCGA AAGAAGCGCA GTACGTCGAC
CCGCAGCAGC GGCTGCTTCT CGAAACCGCC TGGCACGCGC TCGAACACGC CAACATCGAC
CCGACGCCGC TGCGTCGAGG CAACGGCGGC GTCTACGTCG GTGCCAGTTC CATCGACTAC
GCCCTCGAAC TCGACGGGCT GCCGTACGAG GCGCTCGACG GCCTGCTCGC GTCGGGCATC
TCGATGTTCC CGCTGTCCGG CCGGTTGTCG TACTTTCTGG GCTGGCGCGG CCCGAGCGTC
AGCGTCGACA CCGCCTGCTC GTCGTCGTTG AGTGCGCTGC ACCTCGCGGT GGAAGGGCTG
CGCCGGCGCG AATGCGACCT GGCGCTGTGC GGCGGGGTGA ACGCGCTGCA CCACCCGCGC
ATCATGGTCA TGTTCTCCCA CGGCCAGATG CTCGCACCGG ACGGTCAGTG CAAGACCTTC
GACGACGCGG CGGACGGCTA CGTCCGGGCC GAGGGTTGCG GCGTCCTCGT CCTCAAACGG
CTCTCCGACG CCGAACGTGA CGGGGACACG GTCCTCGCGC TGATCCGTGG CACCGCCATC
GGCCAGGACG GTGACAGCGC CGGGCTGACC GTGCCCAACG GTCCGGCCCA GGAACTGGTG
ATCCGCCGAG CGATCGCCGC AGCCCGGCTG GAACCGCGCG ACATCCAGTA CGTCGAGGCG
CACGGGACCG GGACGCCGAT CGGCGACCCG ATCGAACTCG GCGCTGTCAA CGACGTCTTC
TCGGCCTCGC ACACCCACGA CGACCCGCTG CTGGTCGGTT CGGTGAAGAC GAACATCGGC
CACACCGAAC CGCTGTCCGG CCTCGTCGGC GTCATCAAGA CCGTCCTGCA GCTGCGCGCT
GGCACGATCT TCCCGCACCT CAACTTCCAC CAGCCGTCGT CCCGGATCCC GTGGGACGTG
TACCCGGTGC GGGTGCCGAC CGAGCTCGAA CCGTGGCCGG GGCCGGTTCG CCGCGCGGTG
GTCAACAGTT TCGGTTTCGC CGGGACGATC GCGGCGGCTG TCGTCGAGCA GGCACCGCGA
CCGGACGCGC CGGCCGCCGG CGAGCTTGCT GGTGACAAGC TTGCTGGTGA CGAGCCTGCC
GGTGACGAGG CCGCGCCGCT GGCGGCGGTG TTCACGCTGT CGGGCAAGAA CGAGGCGGCG
TTGCGCCGGC AGGCCGAAAG CTACCGGCAG TTCGTCGAGG CCAGCCCGGA GCTGGACGTC
GAACGGTTGT GTTACACGAG CAACGTCGGC CGGGCTCATT TCAGTCACCG CCAGGCCGGC
GTGGTCACGA GCCGAGCAGA CCTGGAAAAG CTGCTCGGCG CCGAGTCGCT ACAGCAGCAG
CCGGCGCTGG CGAAGATCCG GAAGGTCGGG TTTCTGTTCA GCGGCCAGGG GAGCCAGTAC
CCCGGTATGG GCGCCTACCT CTACCGGCGG TTCGCCGAGT TCCGTCGTCA GGTCGACGAG
TGCGACCGGC TGTTCGCGCC GCACCTCGAC ACCTCGGTCC GCGCGCTGCT GCTGGGGGAG
AGCGGGCACG ACGAGCTGAT CGACCAGACC CGCTACACCC AACCGGCACT GTTCACGTTG
GAGTACGCGC TCGCCCGGCT CTGGACGTCC TGGAACGTCC GGCCGAACGT GCTGATCGGT
CACAGCATCG GAGAGGTGGC CGCCGCGGCG GTCGCCGAGT TGTTCGACCT CCCCGACGCG
GTCCTGCTGG TGGCCAGCCG TGCCAGGTTG ATGCAGTCGG TCCGGACGCC GGGCGGGATG
GCCGCGGTGG GCGCCGCCCC CGAGCTGGTG GCGCCGATGC TCGACGAGTT CCCCGGCCTG
GCGCTGGCCG CGGTCAACGC CCCCGGTCAG TGCGTCGTCT CCGGCGCCCG TGACGAGCTC
GCGGCGTTGG GCGAGCGGCT TCGGTTGCGC GGGTTGTCGG TGGAGCCGCT CGCCGTGTCG
CACGCCTTCC ACTCGCCGCT GATGGCCGAG GTCGCCGCCG AGCTCCGCGC GGCCGTCGCC
GGCGTCACGT TTCGCGAACC GTCGATCCCG CTCGTCTCGA ACGTCACCGG CCAGCTCGCC
CGGTTCGCGG AGATCGGTAC GCCGGACTAC TGGGTGCGCC ACGTCCGCGA ACCCGTGCTG
TTCATGGCCG GCCTGCGAGC GGTCGAGAAG CGCGGCCAGC ACGCCTTCGT CGAGATCGGC
CCGTCGACCT CGCTGACCGC CCTGGCGCGG CAGTGCCTGC CGGCCGACGA CCACCGCTGG
ATCGCCAGCC TCCGTCGCCG CGACCCGTCC GCCCACACCG TCCTGCACGG GCTCGCCGAG
TTGTACGCCG GCGGCGTGGC CGTCTCCTGG GCGGGAGTGC ACGCGGGCCG GACGTTGTCG
AAGATCGAAC TGCCCGGCTA CGCCTTCGCC CGGAAAAGGT ACTGGCTTCC CGTCGACGGC
GGGCCGAGCA GTCCTGGGTC GGCGGCCGGC TACCACCCGC TGCTCGGGCA GGAGCAGTCT
CCCGGCGAGC GCCGGCCCGG CGAGGCGCGC GAGTTCGTCG CCGAGTACTC GCCCGAACGG
CCCGCCTACC TTGCCGACCA CCCAGGGCCG GACGGCGAGG TGGTGGTGCC CGTCGCGGCC
TACGTCGAAC TGCTGCTCGC CCTCCAGGAC GCGGCCTTCG GGCATACCCG GGGCACGATC
AGTGACCTAC GGGTCCAGGA TCCGCTTCGG CTTGTCGGAG AGGGCCGCGT GCAGGTGCGG
ACCCGGCTGT CCGCCCGCGC CGACGGCCGC TTCGACGTGG TGGTGTCCAG CGGTCGGCCG
GACCAGCCGA CACCGCACGC GAGCGCGGTG CTCGCCGAGG AAGACGTCGA GTCCGCCGTC
CTGTCCGGCG TCGGTGCGGC GCTGCGGGAC CGGGCGCGGT CCCCGGGCGC CGTCGAAGAC
CGAGTGAGCC ACGAGGACCT CTACACCGAC CTCGCTGCCG TCGGGCGGGA GTACGGCGAG
CGCTTCCGGC TGGTCGCCGA GGTGCGCGGG CACAGCGGCG GCTTGCTGAC CGGCACCGTC
GCGGGGCGGC CGGCCACCGT GGTCGAACAC CTCCCGCCCG AGCTGCTCGA GTGCGCGCTG
CAGGCGGTCG CGGCGCTGCA CCCGGACGGC CCGGTGCTCG TATCGAGCGG CGTCGGCCGG
CTGCGGCTGT TCCGCAAGCC GCGAGCCGAG CACCTGCGCG TCGTGGCGCG GCTGCGCAGG
GCGGCGGTTG ACCGGTGGGT GGCCGACGTC GTGCTGTTCG AGGACGAGGT AGTGGTCGCG
GAGCTGCGCG AGGTGCGGTT GGGCGCTGCC GGGGTGTCGT ACCCGTTCCT GCACCGGTTG
GCCTGGCTGC GCCGTACGGC TCCGCCCCAG TTGGCGTCGC AGCCGCGACA CGTGCTGGTG
CTGGGCCGTG AGCCTGGCGA CTCGGCGGCC GGGCTCGCCG AACCCGCCGC GGCGGACGGC
GTGCGGACGA CGTTCCTCGC CGCTGCCGAG GGGGCGGGCG GTGTGGCAGG CGCTCTCGGC
GACCCGACGG TGACCGACCT GTGCTGGTTC TGGCGGTCTG CCCAAGGCGA GACCACGGCG
GCCAGCCTGC GCAACGAGTG CGAGCTGAAC TACCGGGCGC TGCTCGAGCT GGTCGCGGTG
CTGAACGCGG CGGACCTACC CCGGCCGCCT CGGCTGTGGC TGGTCACGAA GCGAGCCCAG
TGGCTGCCCG GCGACGAGGT AGGCACCGGT GAGCAGCTAG CCGCGGCGAC CTTGTGGGGG
TTTGGCCACG CGCTGCTGAA CGAGTACCCC CGCTACCGAG CCTGCCTCGT CGATGTGGCG
GGGGACGCGG ACCTGGCGGG CCTGGTCGAG CAGTGGCAGG CGCCCGACAC CGGTGAGTTC
CAGCTCGCCT ACCGGCGCGG GCGGCGTTAC GTCCGCCGGC TGCTGGCCGG CGAGCGCACC
CCGGACTGGG ACGGCGGGTT CGTGGTGCGG CCGGCCGCAG CCGGCGTCGT CGACGAGATG
ACGGTCGAGC CCGTGGACGA GCCGGCGCCG GTCGGCGACC AGGTCCAGGT CCGAGTCCGG
CTGGCGGCCC TGGACGGCTC GCCGCCGGCA GTGGCACCGC CGGCCGCGAC AGACACCTCG
GGCGAGACGA CGGACACCTC GGCCGCGACG GACACCCCGG GCGCGACGAC GGACACCCCG
GCCGTGGCGG AGCCGCCGGA CGGGCGGGAC GCGCTGCCGG CCGTCGCTGC CGTCGGCTGG
GCGGGCACCG TGGTCGCGGC CGGCGACCAG GCCGCCTTCG CCGTCGGGGA CCACGTCGCG
GTGACCGGCC CGGGAACCGT GAGTCGAACG GTGACCGTCC CGTCGAGCGC CGCGACCGCC
CTCTCGCACG GGAACGACCT CGCGGACGCA CTCGCGGCCC TGGTCCGAGC AGCGACGCCG
GGGGCGCGCG CGGCCGGGCC GGCCGGTACG CGCCAGATCG ACCGGGTCGA ACTGTACGAC
CTCGACGAGG TGCCGGAAGC GCTGGCCGCG GCCCGCCGGG ACACCGGGCC GGTGGTGGCC
TTGGTTCAGG TCGGTCCGGA GCCCGTTGCG GCTGTCGAGC CCGCCGTAGC CGTCCGGGTC
AGCGAGGCGG ACGCCCCACC GGCTCCGGTG CGGCCAGATC GCGTCTACCT GGTGACCGGC
GGGTTGGGCG GGCTCGGCCT GGTCACCGCG CAGAAGCTGG TCGACCTGGG CGCCAAGCGA
TTGGTCCTGA CGAGTCGCAG CGGCCGGCCG ACGCCCGAGG CGACCGACGT ACTGGCGGCG
TTGTCCGACC GGGCCGTGGT GTCCGTCGAG CGCGCCGACG TGGGTTCGGC ACCGGACGTC
GAACGCCTGG TGGAACTGGT GCGTCAGACC GGCCTCCCGC TGGGCGGGAT CGTGCACGCC
GCCGGGGTCG CCGGCAAGTC GCTGATCGGG AACCTGACCT GGGAGGCCGT CGACGAGCAG
CTTCGGGCCC AGGTCTACGG GGGGTGGCTG CTGCACGAGG CCAGCCTCGG CTTCCCGGAG
CTGGACTTCT TCCTGGCCCA CTCCTCCGTC GCCGCGGTGG TGGGCGGTGC CACCCAAGCT
CACTACGCCG CGGCGTTCGC CTTCCTGGAC GGGCTGGTGG CCTGGCGGGC GCGGCAGGGC
CTGCCTGCTC TCGCGGTGAA CTGGGGAGCC TGGGGGCGGG TCGGGATGTC GGCCCGGCTC
GACGAGAACC TGGGTCGGGA GTTGCGGCGC AGCGGCATCC GGCTGTTCTC GCCGGGGCGG
GCGCTGCGAA CCCTGCCCTC GCTGTTGACG GGGGCGGCGC CCAACCTGGT CGCCGGCGTG
TTCGACTGGG ACCGGTACAT CGCGCCGAGC CTGCTCGATA ACGCGCTGTA CTCGCGGGTC
GCCCGTGGCC GGGTGGACCT CGGCGGTGGC TTCGACGTGG CCGCGCTGCT GGCCAAGTCG
CCCGCCGACC GTTCGGCCGC GCTCGCCGAA CTCGTGCTCG ACCTCGTCGG TGCCGCGCTG
CACCTCGACG ACGGCGAACG GGTGGATCCC GCGGCCGAGT TCGTCGCGCT CGGCCTGGAC
TCGCTCATGG CGCTCGAGGT GAAAACCAGC CTGGAGGGCA GCCTGCGGCT TCCGCTGCCG
GCCACCTTGA CCTTCGACCA TCCGTCGCCG CGACAACTGG CCGAGTTCCT CGACGGCCAG
CTCGCTGCCA CCACCACTGC CACTTCCGCC ACTGCCACTT CCGCCACCGC CACTTCCGCC
ACCGCCACTT CCGCCACCGC CACTACCCCT ACCACTACCG CCACGGACCC GGACGCCCGG
GGCCAAGCCG CCCAACGGAG TTGA
 
Protein sequence
MSRESVTVDH LRPTPGSPES DGVLLTRLLR EKYEPVAVVG MGLRLPGGSE SPDEFEDFLR 
AGRSGVGPLP KDRWDPDLFV PDDPSEKGKI QTTGGGFLDR IDLFDAPFFN ISPKEAQYVD
PQQRLLLETA WHALEHANID PTPLRRGNGG VYVGASSIDY ALELDGLPYE ALDGLLASGI
SMFPLSGRLS YFLGWRGPSV SVDTACSSSL SALHLAVEGL RRRECDLALC GGVNALHHPR
IMVMFSHGQM LAPDGQCKTF DDAADGYVRA EGCGVLVLKR LSDAERDGDT VLALIRGTAI
GQDGDSAGLT VPNGPAQELV IRRAIAAARL EPRDIQYVEA HGTGTPIGDP IELGAVNDVF
SASHTHDDPL LVGSVKTNIG HTEPLSGLVG VIKTVLQLRA GTIFPHLNFH QPSSRIPWDV
YPVRVPTELE PWPGPVRRAV VNSFGFAGTI AAAVVEQAPR PDAPAAGELA GDKLAGDEPA
GDEAAPLAAV FTLSGKNEAA LRRQAESYRQ FVEASPELDV ERLCYTSNVG RAHFSHRQAG
VVTSRADLEK LLGAESLQQQ PALAKIRKVG FLFSGQGSQY PGMGAYLYRR FAEFRRQVDE
CDRLFAPHLD TSVRALLLGE SGHDELIDQT RYTQPALFTL EYALARLWTS WNVRPNVLIG
HSIGEVAAAA VAELFDLPDA VLLVASRARL MQSVRTPGGM AAVGAAPELV APMLDEFPGL
ALAAVNAPGQ CVVSGARDEL AALGERLRLR GLSVEPLAVS HAFHSPLMAE VAAELRAAVA
GVTFREPSIP LVSNVTGQLA RFAEIGTPDY WVRHVREPVL FMAGLRAVEK RGQHAFVEIG
PSTSLTALAR QCLPADDHRW IASLRRRDPS AHTVLHGLAE LYAGGVAVSW AGVHAGRTLS
KIELPGYAFA RKRYWLPVDG GPSSPGSAAG YHPLLGQEQS PGERRPGEAR EFVAEYSPER
PAYLADHPGP DGEVVVPVAA YVELLLALQD AAFGHTRGTI SDLRVQDPLR LVGEGRVQVR
TRLSARADGR FDVVVSSGRP DQPTPHASAV LAEEDVESAV LSGVGAALRD RARSPGAVED
RVSHEDLYTD LAAVGREYGE RFRLVAEVRG HSGGLLTGTV AGRPATVVEH LPPELLECAL
QAVAALHPDG PVLVSSGVGR LRLFRKPRAE HLRVVARLRR AAVDRWVADV VLFEDEVVVA
ELREVRLGAA GVSYPFLHRL AWLRRTAPPQ LASQPRHVLV LGREPGDSAA GLAEPAAADG
VRTTFLAAAE GAGGVAGALG DPTVTDLCWF WRSAQGETTA ASLRNECELN YRALLELVAV
LNAADLPRPP RLWLVTKRAQ WLPGDEVGTG EQLAAATLWG FGHALLNEYP RYRACLVDVA
GDADLAGLVE QWQAPDTGEF QLAYRRGRRY VRRLLAGERT PDWDGGFVVR PAAAGVVDEM
TVEPVDEPAP VGDQVQVRVR LAALDGSPPA VAPPAATDTS GETTDTSAAT DTPGATTDTP
AVAEPPDGRD ALPAVAAVGW AGTVVAAGDQ AAFAVGDHVA VTGPGTVSRT VTVPSSAATA
LSHGNDLADA LAALVRAATP GARAAGPAGT RQIDRVELYD LDEVPEALAA ARRDTGPVVA
LVQVGPEPVA AVEPAVAVRV SEADAPPAPV RPDRVYLVTG GLGGLGLVTA QKLVDLGAKR
LVLTSRSGRP TPEATDVLAA LSDRAVVSVE RADVGSAPDV ERLVELVRQT GLPLGGIVHA
AGVAGKSLIG NLTWEAVDEQ LRAQVYGGWL LHEASLGFPE LDFFLAHSSV AAVVGGATQA
HYAAAFAFLD GLVAWRARQG LPALAVNWGA WGRVGMSARL DENLGRELRR SGIRLFSPGR
ALRTLPSLLT GAAPNLVAGV FDWDRYIAPS LLDNALYSRV ARGRVDLGGG FDVAALLAKS
PADRSAALAE LVLDLVGAAL HLDDGERVDP AAEFVALGLD SLMALEVKTS LEGSLRLPLP
ATLTFDHPSP RQLAEFLDGQ LAATTTATSA TATSATATSA TATSATATTP TTTATDPDAR
GQAAQRS