Gene Franean1_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3954 
Symbol 
ID5672315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4728974 
End bp4735990 
Gene Length7017 bp 
Protein Length2338 aa 
Translation table11 
GC content76% 
IMG OID641242833 
Producterythronolide synthase 
Protein accessionYP_001508250 
Protein GI158315742 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR02813] polyketide-type polyunsaturated fatty acid synthase PfaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0557912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGA GCGGACCGCC TCGGCGACTG GCCGACACTC CCATCGCCAT AGTCGGAATG 
GCGGGCCTCT TCCCGCACGC CCACGACTTC CGGGAGTTCT GGCAGAACAT CGTCGACGCC
CGGGACTGCA TCGAGGACAT CCCGGCCAGT CGCTGGAACA TCGACGACTA CTACGACGCC
GACCCGACCG TGCCGGACCG CACGTACTCC CGGCGCGGCG GGTTCGTGCC CGACGTCGCC
TTCGACCCGC TGGAGTTCGG GCTGCCCCCG AACCAGCTCG AGGTCACCAG CACCCTGCAG
ACGCTGAGCC TCGTCGTCGC GCGCGACCTG CTGCTCGACG CCGGCGCCCA CTCGGCACAC
TCCGCTGACA GGGCGGACTG GTACGACGCC GACTCGACCG GCGTCGTGCT CGGTGTGACC
GGGCCCGTCC CGCTGATGCA CCCGCTCGCC GCCCGGCTGA CGACGCCGGT GCTGCGCGAG
GTCGTGCGGG CCTGCGGGCT GACCGAGGAC GACGCGCAGG CCATCGCCAC CCGCTACGCC
GAGGCGTTCG CGCCCTGGGA GGAGAACTCC TTCCCCGGTC TGCTGGCGAA CGTCACCGCC
GGGCGGATCG CGAACCGGCT CGGCCTCGGC GGAATGAACT CCACCGTCGA CGCCGCCTGC
GCGGCCTCGC TCTCCGCGGT GCGGATGGCG ATCGCCGAGC TGGTCGATGG CCGCGCAGAC
ATGATGATCG CCGGTGGGGC CGACACCGAG AACTCGATCT TCGGATACAT GTGCTTCAGC
AAGACCCAGG CGCTGTCGAA GTCCGACCGG ATCCGCCCGT TCGACGAGGG CGCCGACGGC
ACGCTCATCG GCGAGGGCAT CGGCATGCTC GCCCTGCGCC GCCTCGCCGA CGCCGAGCGA
GACGGCAACC GGATCTACGC GGTCATCCGC GGCATCGGCT CGGCCAGCGA CGGGCGCTCG
AAGAGCATCT ACGCGCCGCG GGCCGAGGGC CAGGAGCGCG CGCTGCGCCG CGCCTACGCC
GACGCCGACT GCTCACCCGC CTCGGTGGAG CTGTTCGAGG CGCACGCGAC CGGCACCGCG
GTCGGTGACC GCACCGAGCT GACCGCGCTG GACGCCGTCC TGCGCGAGGC CGCCGGGGAC
GAGGCCCGCT TCGCCGCGAT CGGCAGTGTC AAGTCGCAGA TCGGGCACAC CAAGGGCGCG
GCCGGCACCG CCAGCCTGAT GAAGCTGGCG CTCAGCCTGT ACCAGAAGAC GCTGCCACCG
ACGATCAACG TCGAACGGCC CAGCGGCCCG CTCGCCGACG ACAACACCCC GCTGTACGTC
AACACGCACA CCCGGCCGTG GGTCCGCGAC CCGCGGCGCC CGGTGCGGCG CGCGGCGGCG
TCCGCGATGG GCTTCGGCGG GACGAACTTC CATGTCGTGT TGGAGGAGCA CCGGGCTGAG
CGGCCCGCGA CCGGCCTGCT GCACCGCACC GCCCGCGCCT GGCTGTGGCA CGCGCCCGAC
CCGGCGGCGC TGCGCGCGGC CCTCGCCGCC GGCGAGCCGC CCGCCGACGG ACTGATCTCC
ACCGACGGAC TGATCCCCGC CGACCACGCC AGGGTCGGGT TCGTCACCCC GGCCGCCAAC
CCCGACGGCG GGGCCACCGA CGGCGCGGAC CTGCGCCGCA TCGCGCTCGA GCAGCTGGCC
GCGGCGCCCG ATGCCGAACA GTGGACGCAC CCGGCCGGCG TGTACTACCG CCGCCGGGCG
CTGCCCGACC CGCGGGTGGG CGCGCTGTTC GCCGGTCAGG GCAGCCAGTA CCTGGAGATG
GGTCTGGACG CCGCCCTCGG CGTCCCGACC GTCGCGCAGG CGCTCGACGA CGCGAACGCC
GTGTTCGCCG ACGACGACGC GCGGCTGGCC GCCGTGATGT ACCCGCCGCC CGTCCTGGAC
CCGCAGGTTC GCCAGGAGCA GGAGTCACGG CTGCGGGCCA CCCGGTACGC GCAGCCGGCG
ATCGGCGCGC TGTCCGCCGG GCAGTTCCGC TACCTGCGGG AGCTCGGCCT GGACTGCGCC
GGCTACCTCG GCCACAGCTT CGGCGAGCTG ACCGCCCTGT GGGCGGCCGG CGCCCTCGAC
GACACCGAGT TCTTCCGGCT GGCCCGGGCC CGCGGCGCCG CGATGGCGCC CCCGGACCCG
GCCACCGGCG GGAACGCCGG CGACGGGAAC GCCGCCGGCG GCGACCCGGG CACGATGGCC
GCCGTCCAGG CGAGCCGGGA GCAGATCACC GAGATCCTCA CCGACTTCCC CGACGTCGTC
GTGTGCAACA ACAACGCCCC TGACCAGGTG GTCGTCGGCG GGGGGACGCA GGCCGTGGAG
GCGGTGGTCG AGGAGCTGGC CCGGCGCGGG ATGACCGGCC GGCTGCTGCC CGTCTCGGCC
GCGTTCCACA CCCGCTACGT CGCCCACGCG GTCGAGCGGT TCGCCCAGGA CCTGGCGCAG
GCCCGCATCG GCGCCCCGGC CGCTCCGGTG TACGCGAACA CGGCGGGCGC CGTCTACGGC
CCCGACGCCG ACGCGAACCG GGCCGTGCTC GCCGGCCAGC TGCTCGCCCC GGTCGACTTC
GTCGCCGGGG TCACGGCGAT GCGCGCCGCC GGGTGCAACA TCTTCGTCGA GTTCGGCCCC
AAACAGGTGC TGGCCCAGCT CACCCGGCGC ATCCTCGACG ACGCGCCGGT CACGGTCGTC
TCCACCGACG GCGGCCCGCT GCGTGACGGC GACGTCGTAC TCAAGCAGGC CGCGGTGCAG
CTCGCCGTGC TGGGCCTGCC GCTGGCGGAC ATCAACCGCC ACGCCGCGAC GGCCGGCGCC
GAGGGCGACG CCGGGCAGCC GCGGCGCGCG GCGATGACGG TCACGCTCAC CGGCGCCGAG
TACGTGCCGC AGTCCCGCCG GGCCCTGTAC CAGGCGGGTC TGACCGACGG CTTCCAGGTC
TCCGCGGTCG CGGCGGCCAT GAACGGCGGG CCGGCGTCCG TGGCCACGCC GCCCGCCGCC
CCGGCCGAGC CGGTTCCGGC CCCGCAGCCC GTACTCACCG CGGCTCCGCA GCCCGCACTC
GCCGCGGCCG CGCCGGCGGC GGCCGAGCTG CCGGCGGTTC CCGCGGTGGC CGGGTCACCC
GCGGCCGCCG GCCGGCCCGC AGGCTCCGCG CCGGTTCCGG CGACCGCGGC GCTCGGTGCG
GGTCCGGCCG GCGCCGGGGT CAGTGAGGCC GTCGCCCTGC ACCTCGACCT GCACGGGCGC
TACCTGGACG GCCAGCTCCG GGTCACCGAG GAGCTGGTGG GCCTGCTGCG CGACGGCGCC
CAGAACGGTG ACCAGGCCGG CTGGGTGCCC GCCGCGGTCG AGCAGGTCCG GGACCAGAGC
CTCGCGGCGG GTCGGGCACA CGTCCACGCC AACGAGGTCC TCGCCGCGCT GGCCGGCCTC
GAGCTCGCGG CAGGCGGGCA TGCCCCGGCA GCGCCGGCCG CGGGTGCGCC CGCCCGGGTG
GGGCTCGCCC CCCGTTCCAC CAGCGCCGCC GGGCCGGCCG CCCTCGCACC GCCGCCCGCC
GCGCCCACTG TGCCCGCCCC GGTGATCGCC GACGCGCCCG CGATGGTGCC CGTCGGAGCC
GCCGACCTCG TCGCAGTGCC GGCGCAGCCG ACGGGCAACG GCCACCACCC GCCCGTGCCC
GGCGGCCACC CCGCCGGTTC ACCCGCCACC GGCACTCCCG CCACTGGGGC ACCCGCGGCG
GCCGGCAGTG GGCCCGACGC GGACTCGGTG CGCACGGCCC TGCTCGATGT CGTCGCCGAC
CGCACCGGCT ACCCCGCCGA AATGATCGAC ACCGGCATGG ACCTCGAAGC CGACCTCGGC
GTCGACTCCA TCAAACGAGT CCAAATCCTC GGCGCCCTCC AGGAGCACTT CCCCACCCTC
CCCAGCGCCG GCCCCGAAAC CCTCGCCGAA ATGCGCACCC TCAACCACAT CACCGACTAC
GTCCTCACCT CGCTCGCCGC CGGCGCACCC CCAGCCGCAT CGACGCCACC GAACGGCACG
CTCAACGGCA GCCCCGCCGC AGCCCCCGGG GTCGACCCGG ACACCGTCCG CACCGCGCTG
CTCGGCGTCG TCGCCGACCG CACCGGCTAC CCCGCCGAAA TGATCGACAC CGGCATGGAC
CTCGAAGCCG ACCTCGGCGT CGACTCCATC AAACGAGTCC AAATCCTCGG CGCCCTCCAG
GAGCACTTCC CCACCCTCCC CAGCGCCGGC CCCGAAACCC TCGCCGAAAT GCGCACCCTC
AACCACATCA CCGACTACGT CCTCACCTCC CTGGGTCCGG TGAACCCGGC CCCGGACGGC
GCCCCGAACG GCACCACCCC GAAAGGAGAT CAGCCGAAAG GAGGTGAGCC GACCGGAACG
GCCGGCCTGA ACGGTCACCA CGCCGCCGGC CACGGCGATG ACGCCGTACT GCCCCACCGG
CCGGTCGAGC TGGTGCCGGC GGCGCCGGTC GACATCCTGG ACGCCACCCC GTTCGGCGCC
GACCCGGTCG CCGTGCTCAT CGACGCGACC GGCCACGACG CGCCGGCGCC TGAGCTGACG
GCACTCGCCG ACGGGCTGAC CGCCCGCGGG TTCGCCGTCC GCACCGTGCG GCTGCCCGGC
CACGGCGGGA CGGCCGGCGA GAGCGGGACG GCCGGCGACG GCGCCGACCA GGCCGACCCG
CTCGACGGGT GGGACGCGGC CGAGGTCGAG CAGGCCCTGG CCGGCGCGCT CGGTGGCGAC
GGTGCCGTCG ACCTGTGCGC CCTGCTGATC GCCGCGAACG GCGGCGACGG CGCCTGGGCG
GCGGGGATCC GGCGGCTCGC CGACACCGTC CTGGTCGCCA AGCACGCCGC CGGGCCGCTG
TCGCGAGCCG CGGCGCGCGG CGGGCGGGCG GCGTTCTGCG CCGTCACCCG GCTGGACGGC
GGCCTGGGAC TGCGCGGCGA CGTGCCAGCG GTGGAGCGCC TCGTCGGCGG TGCGGCCGGG
GTGGTCAAGA CGCTGCTGCG CGAGGAGCCG GCGCTGTTCT GCCGCGCGTT GGACCTCCAC
CCCGCCCACG CCCCGGCCGC GGTCGCGGAG CTGGTTCTCA CCGAGCTGTG GGACGCGGCG
ACCGACCTGG CCGAGGTAGG CCTGGACGCG ACGGGAGCGC GCTGGACCGT CCGGCCCGGG
CCCTTCGGCG ACCGGGCCGC ACAGGACCAC CGGGCCGCAC AGGCCGAGGA CCGGGAGGCC
GGCACCGGCG CCGCCCTCCC GGTCCTCGGC CCCGACGACA TCATCGTCGT GACCGGCGGG
GCACGCGGGG TCACCGCGGA CTGCGTGCGT GCCCTCGCGG CGCGGACCCG GGCCCGGTTC
GTCCTGCTCG GCCGCACGGC GGCCGACAGC GACCCCGAGT GGGCGACCGG CGTCGCCGAC
GGCGGTCTGC TCGCCGCCGC CGCGGCGGCC CTGGCCGCGC AGGCCGGTCC TGGCGCGCCG
CGCCCGACGC CCCGCCAGGC CGAGGCGGCC CGCCGCGACA TCCTGGCCCG CAGGGAGATC
CGGGACACCC TGGCCGCGCT CACCGCGGCC GGCTCCGACG CCGAGTACCT GGCCGTGGAC
ATCGCCGACC GGGACGCGGT GCGCGCCGCG CTGGAGCCCT ACCGGGGCCG CGCCGCCGCC
CTGGTGCACG GCGCGGGCGC GCTGGCCGAC AGCGCGCTGA CCGCCAAGAC GCCCGACGCG
GTCCGCCGCG TGCTCACGCC CAAGCTGACC GGGCTGCGCA ACGTCCTCGA CGCGCTCGGC
GACGGGCCGG GCTCGCCCCT GCGCCACCTG GTGCTGTTCG CCTCGGTGGC CGGGCTGATG
GGCAACCCCG GCCAGGCCGA CTACGCCGCG GCGAACGAGG CGCTGGGCCG CCTCGCCGCG
GGTTGGAAGC ACGCCGAGCG CGGCCGGCAC GTCACCGCGA TCGACTGGGG TGCCTGGGAC
GGCGGCATGG TCGACGCCGA CCTGCGGGAG CTCTTCCGCT CCCGCGGCGT CGCCCTGATC
GACCGGACGA CGGGCGCCGA GGCGTTCGCC GAGCAGTTCG AGCCGGCCCG GGTGGACGAC
GTGTGCGTCC TGGTCGGCTC CGCCGAGGCG CTGACCGGTG GGTCGGACGT CCGCCCCGCC
CCGGCGCTGG TCGCCCGCCG CGAGCTGCGC GAGATCGCCG AGCACCCGGT CATCCGGGAC
CACCAGGTAG GCGGCTTCCC CGTCCTGCCG GCGACGTTCG GGCTCGGCTG GCTGGTCAAC
GTGGTCGAGC GGGCCCACCC GGGGCTGACG GTGGTCGAGG CCCGCGGGTT CGACGTCCAC
AAGGGCGTCG TGTTCGACGG GACGGGCGAG CTGGCGTTCC AGGTGCACAC CGCGGCGGCC
GAGCGCGAGG GCGCCGGCAC GAGTGGACGG GTCGTCGTGA AGGCGAGCGT GCGCAGCCCC
GGCGCGTTGC CGGCCGGGCT GTCGCGCTAC GCCGCCACGC TGGTGCTCGC CGCCGCCCCG
CCGGCTCCCG CCGAGGTCGA GGACTGGCCC CGGCTCGAAG CCGAGTGGAC GGCGCCCGGC
CCCGAGGACG GCCTGGAGAT CTACACCGGC GCGACGCTGT TCCACGGCCC GCTGCTGCAG
GGCATCCGAC GGATCGTGGC GCGCGGCGAC GACCGGCTGG TCGTCGAATG CCGCCTGGAC
GGCGCGGAGG TGGGCCGGGG CGCGTTCGCC GGCGCCCTGC ACGACCCGGT GCTGGCCGAC
CTGCTGCTGC ACGGCCCGTC CGTGCTCGGC CGGTGGCTGA CCGGCCAGGC GTGCCTGCCG
CTGGCCGTCG GCCGGATCGA CTACCGCGCG CCGCTGCCCG CCGGCGAGCC GTTCGCCGTG
GTGATCGACG GCGCGGCGGT GCGCGACACC GGGGTCACCA GCCGGGTGAC CGCGGTCGGG
CGCGACGGGC GGATCCTCGT CCGCCTGCAC GACGTGGCGA TGGTCGGGAC GCCCGACATG
GCCGCGAAGT TCGCCGAGGG CACCGCGAGC TGGCACAAGA AGGAGGTGTC CGCATGA
 
Protein sequence
MSESGPPRRL ADTPIAIVGM AGLFPHAHDF REFWQNIVDA RDCIEDIPAS RWNIDDYYDA 
DPTVPDRTYS RRGGFVPDVA FDPLEFGLPP NQLEVTSTLQ TLSLVVARDL LLDAGAHSAH
SADRADWYDA DSTGVVLGVT GPVPLMHPLA ARLTTPVLRE VVRACGLTED DAQAIATRYA
EAFAPWEENS FPGLLANVTA GRIANRLGLG GMNSTVDAAC AASLSAVRMA IAELVDGRAD
MMIAGGADTE NSIFGYMCFS KTQALSKSDR IRPFDEGADG TLIGEGIGML ALRRLADAER
DGNRIYAVIR GIGSASDGRS KSIYAPRAEG QERALRRAYA DADCSPASVE LFEAHATGTA
VGDRTELTAL DAVLREAAGD EARFAAIGSV KSQIGHTKGA AGTASLMKLA LSLYQKTLPP
TINVERPSGP LADDNTPLYV NTHTRPWVRD PRRPVRRAAA SAMGFGGTNF HVVLEEHRAE
RPATGLLHRT ARAWLWHAPD PAALRAALAA GEPPADGLIS TDGLIPADHA RVGFVTPAAN
PDGGATDGAD LRRIALEQLA AAPDAEQWTH PAGVYYRRRA LPDPRVGALF AGQGSQYLEM
GLDAALGVPT VAQALDDANA VFADDDARLA AVMYPPPVLD PQVRQEQESR LRATRYAQPA
IGALSAGQFR YLRELGLDCA GYLGHSFGEL TALWAAGALD DTEFFRLARA RGAAMAPPDP
ATGGNAGDGN AAGGDPGTMA AVQASREQIT EILTDFPDVV VCNNNAPDQV VVGGGTQAVE
AVVEELARRG MTGRLLPVSA AFHTRYVAHA VERFAQDLAQ ARIGAPAAPV YANTAGAVYG
PDADANRAVL AGQLLAPVDF VAGVTAMRAA GCNIFVEFGP KQVLAQLTRR ILDDAPVTVV
STDGGPLRDG DVVLKQAAVQ LAVLGLPLAD INRHAATAGA EGDAGQPRRA AMTVTLTGAE
YVPQSRRALY QAGLTDGFQV SAVAAAMNGG PASVATPPAA PAEPVPAPQP VLTAAPQPAL
AAAAPAAAEL PAVPAVAGSP AAAGRPAGSA PVPATAALGA GPAGAGVSEA VALHLDLHGR
YLDGQLRVTE ELVGLLRDGA QNGDQAGWVP AAVEQVRDQS LAAGRAHVHA NEVLAALAGL
ELAAGGHAPA APAAGAPARV GLAPRSTSAA GPAALAPPPA APTVPAPVIA DAPAMVPVGA
ADLVAVPAQP TGNGHHPPVP GGHPAGSPAT GTPATGAPAA AGSGPDADSV RTALLDVVAD
RTGYPAEMID TGMDLEADLG VDSIKRVQIL GALQEHFPTL PSAGPETLAE MRTLNHITDY
VLTSLAAGAP PAASTPPNGT LNGSPAAAPG VDPDTVRTAL LGVVADRTGY PAEMIDTGMD
LEADLGVDSI KRVQILGALQ EHFPTLPSAG PETLAEMRTL NHITDYVLTS LGPVNPAPDG
APNGTTPKGD QPKGGEPTGT AGLNGHHAAG HGDDAVLPHR PVELVPAAPV DILDATPFGA
DPVAVLIDAT GHDAPAPELT ALADGLTARG FAVRTVRLPG HGGTAGESGT AGDGADQADP
LDGWDAAEVE QALAGALGGD GAVDLCALLI AANGGDGAWA AGIRRLADTV LVAKHAAGPL
SRAAARGGRA AFCAVTRLDG GLGLRGDVPA VERLVGGAAG VVKTLLREEP ALFCRALDLH
PAHAPAAVAE LVLTELWDAA TDLAEVGLDA TGARWTVRPG PFGDRAAQDH RAAQAEDREA
GTGAALPVLG PDDIIVVTGG ARGVTADCVR ALAARTRARF VLLGRTAADS DPEWATGVAD
GGLLAAAAAA LAAQAGPGAP RPTPRQAEAA RRDILARREI RDTLAALTAA GSDAEYLAVD
IADRDAVRAA LEPYRGRAAA LVHGAGALAD SALTAKTPDA VRRVLTPKLT GLRNVLDALG
DGPGSPLRHL VLFASVAGLM GNPGQADYAA ANEALGRLAA GWKHAERGRH VTAIDWGAWD
GGMVDADLRE LFRSRGVALI DRTTGAEAFA EQFEPARVDD VCVLVGSAEA LTGGSDVRPA
PALVARRELR EIAEHPVIRD HQVGGFPVLP ATFGLGWLVN VVERAHPGLT VVEARGFDVH
KGVVFDGTGE LAFQVHTAAA EREGAGTSGR VVVKASVRSP GALPAGLSRY AATLVLAAAP
PAPAEVEDWP RLEAEWTAPG PEDGLEIYTG ATLFHGPLLQ GIRRIVARGD DRLVVECRLD
GAEVGRGAFA GALHDPVLAD LLLHGPSVLG RWLTGQACLP LAVGRIDYRA PLPAGEPFAV
VIDGAAVRDT GVTSRVTAVG RDGRILVRLH DVAMVGTPDM AAKFAEGTAS WHKKEVSA