Gene Franean1_5614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5614 
Symbol 
ID5673941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6811590 
End bp6816791 
Gene Length5202 bp 
Protein Length1733 aa 
Translation table11 
GC content76% 
IMG OID641244467 
ProductBeta-ketoacyl synthase 
Protein accessionYP_001509871 
Protein GI158317363 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0396665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACA CATCCCGCCA CAGCGGCGGC GGCCCCCGCG GCGCGGGCGG GCAGGAGATC 
GCGATCGTCG GGATGTCCGC GCTGTTCCCC GGCGCGGGCG ACCTGGACAC CTACTGGCAC
AACATCGTCG GTGGCGTGGA CGCGATCAGC GACGTCCCAC CGGGCCGCTG GGACCTTGAC
GAGTACTACG CCGGGCCGGC CACGGACGGC GACCGGCAGC CCGGCGGCGA CCGGCGCGAC
GGCGGCCGGC GGCCGGACGG CGCGGGCTTC TACTGCCGGC GCGGCGGGTT TGTCGACGAC
CTGGCGACGT TCGACCCGGC CCGGTTCGGC ATCGTCCCGG TCTCGGTCGA CTGGGCGGAG
CCCGACCAGC TGCTCGCGCT GCGGCTCGCC GCCGAGGCCA TGGACGACGC CGGCGGGGCC
GAGACGCTCG GTGACCGCGG CCGCGTCGGC GTCGTGGTGG GCCGCGGCGG CTACGTCGGG
CCGGGTGTCG CCCGGCTCGA GCAGCGGGTC CGCACCTCGC ACCAACTGGC GACCACGCTG
CGCGAGGTGC TGCCGGACGT CCCGGACGCG GCCGTCGACC AGGTCGTCGA CGCGTTCCTG
GCCCGGCTGG GGCCGCAGCG GCCGGAGGCG TCCATCGGCC TGGTGCCGAA CCTGGCGGCC
TCGCGGATCG CGAACCGTCT CGACCTGCAC GGCCCGGCGT ACACGATCGA CGCGGCCTGC
GCGTCGTCCC TGGTCGCGGT GGACATCGCG GTGCGGGAGC TGGCCGGCGG CCGGGCCGGC
GCGATGATCG TCGGCGGCGT GCACATCGTG CACGAGGTCT CGTTCTGGAG CCTGTTCACG
CTGCTGCGGG CGCTCTCGCC CACCCAGCGG ATCCGGCCGT TCGACCGCCG TGCCGACGGC
CTGCTCATGG GCGAGGGCGT CGGGATGATC GTCCTGAAGC GGCTGGCCGA CGCCCGCCGC
GACGGCGACC GGGTCTACGC CGTGCTGCGT GGCGCCGGGA CGTCCAGCGA CGGGCGCACC
GCGAGCCTGA TGGCACCCGC GTCCTCCGGG CAGATCCTGG CGATCGAGCG GGCCTGGGCG
GACGCCGGCC TCGACCCGGC GGCGCCCGGC GCGGTCGGGC TGATCGAGGC GCACGGAACC
GCGACTCCGG CCGGCGACGG GACCGAGCTC TCGACGCTGG CGAAGGTCTT CGGCGGCGAG
GTCGGGGCGA GCCCCGTCGG CCTCGGGTCG GTGAAGTCGA TGATCGGGCA CGCGATGCCC
GCCGCCGGCA TGGCCGGGCT GATCAAGGCC GCGCTGGCGC TGCACCACCG CACCCTGCCG
CCGACGCTGC ACTGCGACGA GCCGCACCCG CTGTTCGACG GGAGCCGTTT CGCCCCGGTC
CGGACGGCGG TCGAGTGGGA GCCGTCCGGT TCCGCGCCGC GGCGGGCGGG GGTGAACGCC
TTCGGGTTCG GCGGCGTGAA TGCGCACATC GTCCTGGAGG AGTGCCTCGA CGACCACGCC
GGCGCCGTCA ACCCGGTGAG CTCCGTCGGT TCCGTCGGTT CCGCCGTCGC CGCCGCCGGG
ACGGGTGCCG TGCTCGGGCC CGACCCGGAG GCCGAGCGGG TCCTGCTGTT CGCGGGCTCG
TCGGCGGCCG AGATCGTCGC CCAGCTCGAC GTCCCGGACG CCGACCTGCT GGGGCGCGAC
GACGCCGCGT CCCCGCCGGC GGGTGGGCCG TTCCGGCTGG CGCTGGTGGC GCCCACCCCG
CGTCGGCTCG CGCTGGCCCG TACGGTGGCG GCCCGCGGCA CCGCCTGGCG CGGGCGGAAC
GATCTCTGGT TCGTCCCCGA GCCCCTGCTC GGCGACCGCC CGGCGTCCGA CGCCGCGGGC
GGACCGCGGC TGGCGTTCCT GTTCTGCGGT CTGGAGGACA AGTTCGCCCC GCGGATCGAC
GACGTGTGCG ACCACTTCGG CCTGCCGCTT CCGGAGGTGG GCGACACCGC CGAGCTCGGC
AGCCACGGCG TCGCCAGCGT CCAGGTGGGA CGGGTCCTCG ACACGGCGCT GCATCGCCTC
GGGGTGGTCC CCGACCTGGT CGCCGGGCAC AGCGTGGGCG AGTGGAACGC GATGCTCTCG
GCCGGGCTGA TGAGCGACGA CTACGCCGAC GAGTTCATCG AGGTCTTCGA CCCGGCCTCG
CTCGAGGTGC CCGGTGTCGT GTTCGGTGCG CTGGGCTGCG GCGCGGAGGT CGCCGCCGAG
GTGATCGCCG GACTGCCCGA GATCGTGGTC TCGCACGACA ACTGCCCGCA CCAGTCGATC
ATCTGCGGGC GGGTGGACAG CGTCGAGACC GCGCTCGCCC GCCTGCGCGC CCGCGGGGTG
CTGGGGCAGA CGCTGCCGTT CCGCTCGGGC TTCCACTCCC CGTTCCTGCG GCCGCACCTC
AAGCGCCTGA AGGACGGCCT TTACAACACC CCTTTGCACG CGGCGCGGGT GCCTGTGTGG
TCCGCCACCA CGGTGGCTCC GTATCCATCC GGACACGACG ACATCCGGGA GCTGGCCGTC
CGGCACCTGC TCGAGCCGGT GCGTTTCCGG GAGCTGACGC GGCGCCTGCA CTCCGCGGGC
GTCCGGGTGT TCGTACAGGT CGGGATGGGC AGCGTCGCCG GATTCGTCGA CGACACCCTC
AACGGGCCGG GGGACGAGCA CGCGTCGCTG GTCACGAACA CCGTCAAGCG GTCCGGCCTC
GACCAGCTCC GGCGGGTCGC GGTCGCGCTG TGGGCCGAGG GCGCCGCGCC GCGCCTGGAC
GTCCTGCCCT GCCACACCAG ACCAGAGCCC GCCACCGTCC CGGCCAGGGA GCCGCGGGCG
CCCCGGGTGC CTCAGACGGC CGCGCCGGCT CGGCCCGGGC GCGCGGTGCG GCTGCGCCTG
GGCACGCCGC TCGTCCGGCT CGGGGCGGAC GCGCCGGACC TCGCGGCGTC GCTCCCGTCC
CGGATCAGCG GCGAGGCCGC GGGACGTCCC GTCGGCGAGA CGCCGGCCCA GGTGGCCGAC
GGGCTCGGCG GGCCCGCGCG GCATGCCGTC CTCGCCGAGC TGGGCGCGGC GGTCGCGGAC
GCGAGAGCGG TGATGTCCAC CGTCACCGAC CGCTGGGTCG CGACGCGGGC GTCCGGTCCC
CCCTCGACGG CGCCGGCAAC GGCCTCCGCG CCCGTTGCGC CGCCCGCCCC GGCGGCGAGT
GGGCCGGGTG GCGGGCGGAC CGTGCGGCGG GAGCTCTCGC TGGCGACGAT GCCCGAGGTC
GCCGACCACT GCTTCTACCG GCAGCCCCCG GGGTGGTCCG ACCCGTCCGA CCTGTTCCCG
GTCGTGCCGC TCACCGGCGT GCTTGAAATG ATCATAACTG AGGCGAGCGC GCTGATGCCC
GGGCGTGCGG TGGTGGCCGT CCGGGACGTG CGGGCCACCC GCTGGCTGGC GATCGAGCCG
CCGGTGCAGG TCACGATCAC CTGCGCGCCG CTGGGCCCGG ACGAGGTCCG GGTGGACGTC
GTCGGCTACA CGCGGGCGAC GGTCGTCTTC GCCGACGCCT ACCCGGCGCC GCCGCCGGTG
ACCGACACGG CCACCGGGTT CGTCCCGGCG CCGGGCGCGG CGGGGGAGGT CGTGATCCCC
GGCGAGTCGC CGTCCCCGCA CACCGGCCGC GCGATCTACG ACGACCGGCT GCTCTTCCAC
GGGCCCGGCT ACCAGGGCGT CGAGTCCGTC GACGGGATGG CCCCGACAGG GCTGCGCGGC
TGGCTGCGGG TGACGTCCGC CAGCGGCGCG CTGCTCGACA ACGCCGGCCA GTTCTTCGGC
TACTGGGGCA TGCAGTACCT GCCGTCGGAC TGGCTGCTCT TCCCGGCCTC GGTCACCTCG
ATGCAGTTCT TCGGCCCACC GCCGCCGGTC GGCGCGCGGA TGTCCTACCT GGGCCGGATC
CGGGACGTGA CAGACCGCAC CGCCACCGCG GACATGGAGA TCCGGGACGC CGGCGGACGG
CTGTGGGGGC ACATCCAGGG CTGGACCGAC CGACGGTTCA CCGAGGACGA CGTCCTGTGG
TCGATGGCAC TGGCGCCGGA GGCGAACACG CAGAGCCACC TGACGCCGGA CGGCTGGGTG
GTGGTCACCG AGCACTGGCG TGACCCGGCG TCCCGGGAGC TGTCGCTGCG CCACTACCTC
GACGCGGACG AGCGGACGCG GCTGGCCCGC CACAATCCGC TGGCTGCCCG GTCCTGGTTG
CTGGGGCGGA TTGCGGCCAA GGACGCCGTG CGGCGCCGCT GGTGGGAGCG CGGGGCCGGC
GCGGTGTGGC CGATCGAGGT CGGCGTGACG GATCTGGTGT CCGGGCGGCT GGTCGTCGGC
ACGGTGCCCA CCCGACCGGG CCTGCCCGAG CTCACCCGCC CCGAGGTGAG CATCGCCCAT
CGCCCGGAGA TCGCCGTCGC CCTCGCGGTG GACGACCCGC GGGACGTCGG CGGCGTGGGG
ATCGCTCTGG AGCGCATCGA ACGGCGCGGG CCCGACGCGG AGGTGGCCGT GCTCGCCGAA
TCCGAGCTGA GCCTGCTCGA CGAACTGGCC GGCGCGGATC CGGATGTCCG CGCGGTCTGG
CTCGCGCGGT TCGCCGCGGC GAAGCAGGCC GTCGCGAAGG CGTACGGGAC CGGGCCGGCC
GGCGATCAGC GGCGGTTCGT CGTCGGCCGG CCGCGACCGG GCGGATCCGC CGTCCTGGCG
GCGGTGCCGC GCGGGCCGGG CGGGGTGGTC GCCCACCCGG TCGGCGCGCC ACCCGAGGCG
ACCGGCCTGC CGGCCCCGAC GCTGGTTCCA GTCACCGTCT CCGCCCCGGA CGGCGGGCCG
GGGGCGGCAC CGGCCGCGGC CGGGCACGCC ACCCCGGCGG AAGTGGGAGC GGGGGCTAGC
GAGACGTGGT GGGTGGCACT GCGCACCATG GACACCACCG GCCGGCCCCG TCCCGACACC
GAACCGAATG ATCACACTGC CGCATCCGGC ACCGCCACAC CCGGCGGTCA CACCGCCGGA
ACCGACGATC GCGTCGCCGC ACCCGGCGGC GCACCCGCCG GCGGCTACAT CGTGGCCTGG
ACCTCGCCAC GGATCGAGCG ATCCGCGAGG CCTCCGGGCG GCGCGGCCGG GAGCCCGCCC
ACGCGGCGGC CCGGCGGTCA ACCCACCAGC CCGGCCGGTG ACCGGCCGGG CGCACGGACC
GGTACGCAGC CCGATCAGCA GAAGGAGCAG ACCCAGCGAT GA
 
Protein sequence
MGDTSRHSGG GPRGAGGQEI AIVGMSALFP GAGDLDTYWH NIVGGVDAIS DVPPGRWDLD 
EYYAGPATDG DRQPGGDRRD GGRRPDGAGF YCRRGGFVDD LATFDPARFG IVPVSVDWAE
PDQLLALRLA AEAMDDAGGA ETLGDRGRVG VVVGRGGYVG PGVARLEQRV RTSHQLATTL
REVLPDVPDA AVDQVVDAFL ARLGPQRPEA SIGLVPNLAA SRIANRLDLH GPAYTIDAAC
ASSLVAVDIA VRELAGGRAG AMIVGGVHIV HEVSFWSLFT LLRALSPTQR IRPFDRRADG
LLMGEGVGMI VLKRLADARR DGDRVYAVLR GAGTSSDGRT ASLMAPASSG QILAIERAWA
DAGLDPAAPG AVGLIEAHGT ATPAGDGTEL STLAKVFGGE VGASPVGLGS VKSMIGHAMP
AAGMAGLIKA ALALHHRTLP PTLHCDEPHP LFDGSRFAPV RTAVEWEPSG SAPRRAGVNA
FGFGGVNAHI VLEECLDDHA GAVNPVSSVG SVGSAVAAAG TGAVLGPDPE AERVLLFAGS
SAAEIVAQLD VPDADLLGRD DAASPPAGGP FRLALVAPTP RRLALARTVA ARGTAWRGRN
DLWFVPEPLL GDRPASDAAG GPRLAFLFCG LEDKFAPRID DVCDHFGLPL PEVGDTAELG
SHGVASVQVG RVLDTALHRL GVVPDLVAGH SVGEWNAMLS AGLMSDDYAD EFIEVFDPAS
LEVPGVVFGA LGCGAEVAAE VIAGLPEIVV SHDNCPHQSI ICGRVDSVET ALARLRARGV
LGQTLPFRSG FHSPFLRPHL KRLKDGLYNT PLHAARVPVW SATTVAPYPS GHDDIRELAV
RHLLEPVRFR ELTRRLHSAG VRVFVQVGMG SVAGFVDDTL NGPGDEHASL VTNTVKRSGL
DQLRRVAVAL WAEGAAPRLD VLPCHTRPEP ATVPAREPRA PRVPQTAAPA RPGRAVRLRL
GTPLVRLGAD APDLAASLPS RISGEAAGRP VGETPAQVAD GLGGPARHAV LAELGAAVAD
ARAVMSTVTD RWVATRASGP PSTAPATASA PVAPPAPAAS GPGGGRTVRR ELSLATMPEV
ADHCFYRQPP GWSDPSDLFP VVPLTGVLEM IITEASALMP GRAVVAVRDV RATRWLAIEP
PVQVTITCAP LGPDEVRVDV VGYTRATVVF ADAYPAPPPV TDTATGFVPA PGAAGEVVIP
GESPSPHTGR AIYDDRLLFH GPGYQGVESV DGMAPTGLRG WLRVTSASGA LLDNAGQFFG
YWGMQYLPSD WLLFPASVTS MQFFGPPPPV GARMSYLGRI RDVTDRTATA DMEIRDAGGR
LWGHIQGWTD RRFTEDDVLW SMALAPEANT QSHLTPDGWV VVTEHWRDPA SRELSLRHYL
DADERTRLAR HNPLAARSWL LGRIAAKDAV RRRWWERGAG AVWPIEVGVT DLVSGRLVVG
TVPTRPGLPE LTRPEVSIAH RPEIAVALAV DDPRDVGGVG IALERIERRG PDAEVAVLAE
SELSLLDELA GADPDVRAVW LARFAAAKQA VAKAYGTGPA GDQRRFVVGR PRPGGSAVLA
AVPRGPGGVV AHPVGAPPEA TGLPAPTLVP VTVSAPDGGP GAAPAAAGHA TPAEVGAGAS
ETWWVALRTM DTTGRPRPDT EPNDHTAASG TATPGGHTAG TDDRVAAPGG APAGGYIVAW
TSPRIERSAR PPGGAAGSPP TRRPGGQPTS PAGDRPGART GTQPDQQKEQ TQR