Gene Franean1_4268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4268 
Symbol 
ID5672623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5095447 
End bp5101521 
Gene Length6075 bp 
Protein Length2024 aa 
Translation table11 
GC content77% 
IMG OID641243141 
Producterythronolide synthase 
Protein accessionYP_001508558 
Protein GI158316050 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.35732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCA CCCGCCCAGA GCAGGGCCAC TCCGGGGCGA CCGCGGACCA GGCCCGCGAC 
GCGTGGCGGG CCCGGCTCGC CGAGTTGTCG CCACCCGCCC AGATCGAGGC GCTGCTCGAC
CTCGTCGTCG ACCACGTGGT CGTGGTCACC GGCAGCCACG GCGGCTCGGC GGGCGTCGAC
CGCGGTGCCC CCTGGCGCGC GCTGGGGGTC TACCGGCGCA TCGCGGACGT CCTGCGAGCG
AAGCTCACCG CGGCGACCGG GGTGCGGCTG CCGGCGACGG TGCTCTTCGA GCGCCCCACA
CCGAAGGCGG TGGCCTCGTT CCTGCACGCC GAGATCCTGG GCGTGCGCGA GGAGGCAGCC
GACACCGCGC CCGCGCCCAC GCCGGCCGCA CCGGCAGAGG GGGGCGGGCG CGGCGCCGAT
CCCGTGGTCG TGGTGGGCAT GGGCTGCCGG CTGCCCGGCG CCGACTCGCC GGAGGCCCTG
TGGGAGCTGG TGGCGCAGGG CCGTGACGTC GTCGCCGGCC TGCCGACGGA CCGCGGCTGG
GACCTGGACG GCCTGTACCA CCCGGACCCC GAGCACCCGG GCACCGCCTA CACCCGCCAG
GGCGGCTTCC TCCCCGGGGT CAGCCTGTTC GACGCCGGCT TCTTCGGGAT CGGACCCCGC
GAGGCCACCG CGATGGACCC GCAGCAGCGG CTCATGCTCG AGGTCTCCTG GGAGGCGTTC
GAGCACGCGG GCATCGACCC GCACCGGCTG CGCGGCAGCC GCACCGGGGT GTTCACCGGT
GTCTCCCTGC AGGACTACGG CCCGCCCTGG CACGACGCCC CCGCCGAACT GCAGGGTCAC
CTGCTGACCG GCAACGCGCT GGGCGTGGTC GCCGGCCGGG TCTCCTACAC CTTCGGCTTC
GAGGGCCCCG CGCTGACGGT GGACACCCAG TGCTCCGCCT CGCTCGTCGC CGTCCACCTG
GCGACCGCGG CCCTGTACGC CGGGGAGTGC GACCTCGCCC TCGCCGGCGG CGTCACCGTG
ATGAGCACGC CGTCCATGCT GATGGAGTTC AGCCGCAAGC GCGGGCTGGC GCCCGACGGC
CGGTGCAAGG CGTTCTCCGC CGACGCCGAC GGCACCGGAT GGGCCGAGGG CGCCACCGTG
GTCGTCCTCA CCCGGCTCTC CCACGCCCGC GCGCGGCAGC TGCCGGTGCT GGCCGTCGTC
GCGGGCAGCG CCGTGAACCA GGACGGCGCC AGCAACGGCC TGACCGCCCC CAACGGCCTG
TCCCAGCAGC GGCTCATCCA GCAGGCCCTG GCCAACGCCG GCCTGAGCGC CGGCGACGTC
GACGCGGTCG AGGCGCACGG CACCGGCACC CCGCTCGGCG ACCCCATCGA GGCCCGCGCG
CTGCTCGCTA CCTACGGCGC CGGCCGGTCC CCGGACCGGC CGCTGTACCT GGGCTCCCTC
AAGTCCAACA TCGGCCATGC CCAGGCCGCC GCGGGCACCG CCGGCATGAT CAAGATGATC
GAGGCACTGC GCCACGAGAC GCTGCCCGCC AGCCTGCACA TCACCGTCCC CACCCCCCAC
GTGGACTGGG CGACCGGCGC CGTCACCCTC CTGACACAGC CCACGGCGTG GCCCGCCACC
GCGCGGCCGC GCCGCGCCGC CGTCTCCGCC TTCGGCGTCT CGGGCACCAA CGCCCACCTC
ATCATCGAGG AACCCCCCGG CCCGGCCCGC ATCCCCGCGC AGGCCACGCC TGCCGCCGCG
CCCGCCGGCG CCACCGAGGA CCGGGACGCC GTCCGCGGCG GGGACACTCC CGAGTGGGCG
GCCAACCTGC CCACAACGGT GATCCTCTCC GCCCGCGGTG ACGAGGCGCT GCGCGCGCAC
GCCGCCCGCC TCGCCGACCA CCTCGCCGGC CATCCGGACG TCACCGTCGA CGACGCCGCC
CACACCCTGG GCGCGCGGGC CCAGTTCCCC ACGCGCGCCG CCGTCGTCCT GGACGGGCCG
CCGACGGAAC GGGCGTCCCA CCTGGTTCGG GCGCTGACCA CGCTGGCCGC CGGGCGGGCG
GCGCCCGCTC TCATCCGCGG CAGCACCCCC GTGGACCTAG CCGGCGCCCG GATCGCGATG
GTGTTCACCG GCCAGGGCAG CCAGCTCCAC GGCATGGGAC GGCGGCTGCA CGCCGCCTAC
CCGGCCTTCG CCCGCGCGCT CGACGCCGCG TGCGACCGCT TCGAGCCGCA TCTCGACGAG
CCGCTGCGCG ACGTCATGTT CGCCGCCCCG GACACCCCGC CCGCCGTCCT GCTGGACCAG
ACGCTGTACA CCCAGTGCGC GCTCTTCGCC TACCAGAGCG CCCTGTTCCG GCTGTTGGAG
AGCTTCGGGG TCACCCCGCA CCTGCTGCTC GGCCACTCCG TGGGCGAGCT CACCGCGGCG
CACATCGCCG GCGTCTGGAC CCTGTCTGAC ACGGCCGCGC TGGTCGCCGC CCGCGGCCGT
CTCATGCAGA GCTGTGTGCC CGGCGCCATG GCCGCCGTCG GCGCGACCGA GGACGAGGTC
CGTCGGTGCC TCGACGAGTA CGGGGGCCGG GTCGACATCG CCGCCGTCAA CGGCCCCACC
GCCGTCGTCG TGGCCGGTGA CTCCGACGCC GTCGGCGCGC TCGCGGCGCA CTGGGCCGAG
CAGGGGCGCC CGACCCGCGG CCTCGCCGTC AGTCACGCCT TCCACTCCGC GCACATGGAT
CCCGTCCTCG ACCGCTTCGC CGCCGTCGCC CGCACGCTCG CCTACCGCCC GCCCGCCGTC
CCGATCGCCT CCGCCCTCAC CGGACGTCTG AGCACCGACC CGGATGCCCC GGTCGACCTC
ACCGACCCGG CGTACTGGGT CCGCCACATC CGTGAAACCG TCCGGTTCCA CCAGGCTGTC
ACCCGCGCCG ACGCCGAGAA CATCACCTCC TACCTGGAGA TCGGGCCGAA GCCCACGCTG
ACGCCGTACC TCGGCGACGC CCGCGTGGTC ACCGCGGCAC GCCGCGGGCA GCCCGAGACC
ACCGCTCTAC ACATCGGCCT CGCCGAGCTG TACACGCACG GCACCCCGGT AGCGTGGCGG
GCGGCGAACA ACCCGCGGGC CGCCAGCCGC AGGCCCGCCC ACCGGCACCT GCCCACCTAC
CCGTTCCAGC GCCGCCACCA CTGGACCGAC CCGACCCGCC GATCCGGCGC GCCCGGCGCG
CCCGGCCCGG AGGGGTGGCG CTACCGCATC AGCTGGCGCT CCCGGCCCGT CCCCAGCACC
GGCGCCCTGC CCGGGATGGA CGGCACCCCG CCGCCGCTGA CGGGCCGCTG GCTCCTGCTC
CTCCCCCCCG CGGGCGTCAC CGACGAGGAG GTCTCCCGCG TCTCGTGGAT CGTCGAGCGC
CTCGGCGGCA CCGTGCTCCG CGTGCCGCTG CGCCACGAGG ACACCGACCG GGCCCGCCTC
GCCGCCCGCC TGCGCGCCCT CGGCCAGCCC GGTGCCACCA CGGGCCGAGG CGGCGTGCTG
TCACTGCTCG GCCTCGACAC GACGCGCCAC CCCGACCACC CCGCGCTCAC GACCGGTTTC
GCCCTCACCT GCGCGCTCGC GCAGGCCCTG CACGACATCG AGCTGGACGC GCCGCTGTGG
ACGCTCACCC GTGGCGCCGT CCGGACCGGG GCCGCAGCCG CGACGGCGGC CAGCGACCGG
ATTCTGGATC CGGCCGGGGC GCTGCTCTGG GGGCTGGGGC GGTCCCTCGC CCTCGAGCGG
CCAGGTGCCT GGGGCGGTCT CGTCGACATC CCCGCGGAGC TGGACGACTC CACGCTCGGC
TGGCTCGGCG CCGCACTGAC CGCACCGGAC GGCGAGGACC AGTTCGCCGC CCGCCCGGCG
GGGCTGCTGG TACCCCGCCT GGTCCGGGAG ACCACGCCTG TCACCACCGC ACCGTGGGAC
GTCCGCGGAG CGGTTGTGCT CATCACCGGA GGCACCGGCG CCCTCGGCAC CCGGATCGCC
CGCCGCCTCG CCCGCGACGG AGCCCGCCGG CTGATCCTGG CCAGCCGCCG CGGCCCGGAC
GCGCCCGGGG CGGCCGAGCT GTGCGCCGAG CTGACCGCCC TCGGCGTCCA GGCCGTCGTC
CGTCGCTGCG ACGCCTCCGC CGCCGCCGCC GCGGGCACGG GCACGGGCAC GGGCACGGGC
CAGCTGGCCG CGCTCGTCCG CGAGCTCGCC GCCGATCCCG AGGCCCCGCT CACCGCGATC
GTCCACGCCG CGGGCGTCGA CGGCCCCATG GCCGCGCTGC CCGAGCTCGA CCTGGGCCAG
ATCGCGGCCG TGCTCGCGCC CAAGGCCGGC GCCGCCGAGC TCCTGCATGA GGCCGCTGGG
ACCATCCCGC TCGTGTTCCT GTCGTCGGTC TCGGCGACCT GGGGCAGCGG CGGCCAGGCC
CCGTACAGCG CGGCGAACGC ATACCTCGAC ACGCTGGCCG CGTTCCGGCA CGCCAGCGGG
CGCCCGGCCA CCTCGGTCGC GTTCGGCCCG TGGGCCGAGG CCGGCATGGG CGCCGAACCG
GCCCGCCGTG ACTACCTGCA CCGCCGTGGC CTGAACCCGC TCTCCCCCGG CCGGGCCATC
GATGCCCTGG CCCAGGCGGT CGGCACCGGA TGTCCCGGCC CGGTCACCGT CGCCGACGTC
GACTGGGCAC GGTTCCTGCC CGCCTACACC GCCGCCCGGC CCTCACGCCT GTTCGACGAG
CTACGCGCCA CCGCCGACGA AGCCGCCTCC CCGGCCGCCG ACGGAACGCA GCCCGCGGAT
CTCACCGGGG CGCCGCGCGG TGATGGCGCG GAGCCGGGTG ATGATCTCGC GGCGCTGCCC
GCGGCGGAGC AGGGCCGGGC GCTGCGCGAG CTCGTCCGGG CCGAGGCCGC GGCGGTGCTG
GGCCACCGCG AGGCCGACGA GATCGAGACG GGACGCCGCT TCCTCGAGCT GGGCTTCGAC
TCGCTGGCCT CCGTGCAGCT CAGCCGCCGG CTGGCGGGTG CGACCGGGGT CGCGCTCGCC
ACCGCCGCGG TCTTCGAGCA CCCCACCGTC GCCGACCTCG CCGACCACCT CCACGCCCTG
CTCGCCGGCC GGCCCGCGCC CGCCGCCGCC ACCACCGGCG CCGACGGTGC GGTCGGCGCG
GTCGGCGCGG TCGGGAGCGG CACCCGAACC GGGACCGGCT GGACCTCGAC CAGTGCCAGT
GCCGCCGGCG TCCCCGCCGT AGGCGGCGCG GAGCCGGTCG GCGTGCGTGG GCTCTACCGG
CAGGCGTGCG CTGACGGCAA GTTCGCCGAA GGGGTCGGGG TGCTGCGCGC CGCCGCAAAG
CTGCGGTCGA CGTTCACCGC GCCGAGCGAG CTGGCCAGGC GGCCACGTCC GGTCACCCTC
GCGTCCGGCC CGGCCGGGCC GGCGCTGGTC TGCCTGCCGT CGATGGTCGC GCCGTCCGGG
CCGCACACCT TCGCCCGGCT CGCCCTGCAC CTGCACGGCC GCCGCGCCGT GCACGCGCTG
GCGCACCCCG GCTTCGGCGA CGGCGAGCTT CTGCCCGCCA CCGCCGATCT CGTGGTCGAC
CTGCACGCCG AGACGCTGGC CGCGCACTTC CCCGACACAC CGGTCGCGCT CGCGGGCTAC
TCGTCCGGCG GCTGGTTGGC ACACGCCGTC GCGGCGCGGC TGGAGGCACG CGGCATCCGC
CCGAGCGCCG TGGTCCTGCT CGACACCTGG CTGCCCGGCG ACCGGATCCC CGAGGCCGAC
ATCGCCGAGG AGCTGCGCGG CATCGCCGTC AACGACCAGG CGTTCGCGCT GATGACCGAG
TCCCAGGTGA CCGCGCAGGG CGCCTACCTC GACCTGTTCG AAGGATGGAA ACCCACGTCG
GTGCGTGCGC CGATCGTCCT CGTCCGCGCG GTCCAGCGCA TGCCGGGGCA GCGGGCGGAC
CCCGCGGGCC CGGTGACCGG CTGGGCGGAC GAATGGGACC TGGCCTTCGA CACAAGGGAC
ACCGCCGGCG ACCACCAGTC GATGATGAAC GAGCACGCCG GCTCCACAGC TCACACGGTC
CACGCCTGGC TCGGCGACCT CCACGCACGC CGGTCCCCGG CCCGTACTGT CGCCAGCGGG
ACGGCTATCC GCTGA
 
Protein sequence
MDSTRPEQGH SGATADQARD AWRARLAELS PPAQIEALLD LVVDHVVVVT GSHGGSAGVD 
RGAPWRALGV YRRIADVLRA KLTAATGVRL PATVLFERPT PKAVASFLHA EILGVREEAA
DTAPAPTPAA PAEGGGRGAD PVVVVGMGCR LPGADSPEAL WELVAQGRDV VAGLPTDRGW
DLDGLYHPDP EHPGTAYTRQ GGFLPGVSLF DAGFFGIGPR EATAMDPQQR LMLEVSWEAF
EHAGIDPHRL RGSRTGVFTG VSLQDYGPPW HDAPAELQGH LLTGNALGVV AGRVSYTFGF
EGPALTVDTQ CSASLVAVHL ATAALYAGEC DLALAGGVTV MSTPSMLMEF SRKRGLAPDG
RCKAFSADAD GTGWAEGATV VVLTRLSHAR ARQLPVLAVV AGSAVNQDGA SNGLTAPNGL
SQQRLIQQAL ANAGLSAGDV DAVEAHGTGT PLGDPIEARA LLATYGAGRS PDRPLYLGSL
KSNIGHAQAA AGTAGMIKMI EALRHETLPA SLHITVPTPH VDWATGAVTL LTQPTAWPAT
ARPRRAAVSA FGVSGTNAHL IIEEPPGPAR IPAQATPAAA PAGATEDRDA VRGGDTPEWA
ANLPTTVILS ARGDEALRAH AARLADHLAG HPDVTVDDAA HTLGARAQFP TRAAVVLDGP
PTERASHLVR ALTTLAAGRA APALIRGSTP VDLAGARIAM VFTGQGSQLH GMGRRLHAAY
PAFARALDAA CDRFEPHLDE PLRDVMFAAP DTPPAVLLDQ TLYTQCALFA YQSALFRLLE
SFGVTPHLLL GHSVGELTAA HIAGVWTLSD TAALVAARGR LMQSCVPGAM AAVGATEDEV
RRCLDEYGGR VDIAAVNGPT AVVVAGDSDA VGALAAHWAE QGRPTRGLAV SHAFHSAHMD
PVLDRFAAVA RTLAYRPPAV PIASALTGRL STDPDAPVDL TDPAYWVRHI RETVRFHQAV
TRADAENITS YLEIGPKPTL TPYLGDARVV TAARRGQPET TALHIGLAEL YTHGTPVAWR
AANNPRAASR RPAHRHLPTY PFQRRHHWTD PTRRSGAPGA PGPEGWRYRI SWRSRPVPST
GALPGMDGTP PPLTGRWLLL LPPAGVTDEE VSRVSWIVER LGGTVLRVPL RHEDTDRARL
AARLRALGQP GATTGRGGVL SLLGLDTTRH PDHPALTTGF ALTCALAQAL HDIELDAPLW
TLTRGAVRTG AAAATAASDR ILDPAGALLW GLGRSLALER PGAWGGLVDI PAELDDSTLG
WLGAALTAPD GEDQFAARPA GLLVPRLVRE TTPVTTAPWD VRGAVVLITG GTGALGTRIA
RRLARDGARR LILASRRGPD APGAAELCAE LTALGVQAVV RRCDASAAAA AGTGTGTGTG
QLAALVRELA ADPEAPLTAI VHAAGVDGPM AALPELDLGQ IAAVLAPKAG AAELLHEAAG
TIPLVFLSSV SATWGSGGQA PYSAANAYLD TLAAFRHASG RPATSVAFGP WAEAGMGAEP
ARRDYLHRRG LNPLSPGRAI DALAQAVGTG CPGPVTVADV DWARFLPAYT AARPSRLFDE
LRATADEAAS PAADGTQPAD LTGAPRGDGA EPGDDLAALP AAEQGRALRE LVRAEAAAVL
GHREADEIET GRRFLELGFD SLASVQLSRR LAGATGVALA TAAVFEHPTV ADLADHLHAL
LAGRPAPAAA TTGADGAVGA VGAVGSGTRT GTGWTSTSAS AAGVPAVGGA EPVGVRGLYR
QACADGKFAE GVGVLRAAAK LRSTFTAPSE LARRPRPVTL ASGPAGPALV CLPSMVAPSG
PHTFARLALH LHGRRAVHAL AHPGFGDGEL LPATADLVVD LHAETLAAHF PDTPVALAGY
SSGGWLAHAV AARLEARGIR PSAVVLLDTW LPGDRIPEAD IAEELRGIAV NDQAFALMTE
SQVTAQGAYL DLFEGWKPTS VRAPIVLVRA VQRMPGQRAD PAGPVTGWAD EWDLAFDTRD
TAGDHQSMMN EHAGSTAHTV HAWLGDLHAR RSPARTVASG TAIR