Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4268 |
Symbol | |
ID | 5672623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5095447 |
End bp | 5101521 |
Gene Length | 6075 bp |
Protein Length | 2024 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243141 |
Product | erythronolide synthase |
Protein accession | YP_001508558 |
Protein GI | 158316050 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.35732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGCA CCCGCCCAGA GCAGGGCCAC TCCGGGGCGA CCGCGGACCA GGCCCGCGAC GCGTGGCGGG CCCGGCTCGC CGAGTTGTCG CCACCCGCCC AGATCGAGGC GCTGCTCGAC CTCGTCGTCG ACCACGTGGT CGTGGTCACC GGCAGCCACG GCGGCTCGGC GGGCGTCGAC CGCGGTGCCC CCTGGCGCGC GCTGGGGGTC TACCGGCGCA TCGCGGACGT CCTGCGAGCG AAGCTCACCG CGGCGACCGG GGTGCGGCTG CCGGCGACGG TGCTCTTCGA GCGCCCCACA CCGAAGGCGG TGGCCTCGTT CCTGCACGCC GAGATCCTGG GCGTGCGCGA GGAGGCAGCC GACACCGCGC CCGCGCCCAC GCCGGCCGCA CCGGCAGAGG GGGGCGGGCG CGGCGCCGAT CCCGTGGTCG TGGTGGGCAT GGGCTGCCGG CTGCCCGGCG CCGACTCGCC GGAGGCCCTG TGGGAGCTGG TGGCGCAGGG CCGTGACGTC GTCGCCGGCC TGCCGACGGA CCGCGGCTGG GACCTGGACG GCCTGTACCA CCCGGACCCC GAGCACCCGG GCACCGCCTA CACCCGCCAG GGCGGCTTCC TCCCCGGGGT CAGCCTGTTC GACGCCGGCT TCTTCGGGAT CGGACCCCGC GAGGCCACCG CGATGGACCC GCAGCAGCGG CTCATGCTCG AGGTCTCCTG GGAGGCGTTC GAGCACGCGG GCATCGACCC GCACCGGCTG CGCGGCAGCC GCACCGGGGT GTTCACCGGT GTCTCCCTGC AGGACTACGG CCCGCCCTGG CACGACGCCC CCGCCGAACT GCAGGGTCAC CTGCTGACCG GCAACGCGCT GGGCGTGGTC GCCGGCCGGG TCTCCTACAC CTTCGGCTTC GAGGGCCCCG CGCTGACGGT GGACACCCAG TGCTCCGCCT CGCTCGTCGC CGTCCACCTG GCGACCGCGG CCCTGTACGC CGGGGAGTGC GACCTCGCCC TCGCCGGCGG CGTCACCGTG ATGAGCACGC CGTCCATGCT GATGGAGTTC AGCCGCAAGC GCGGGCTGGC GCCCGACGGC CGGTGCAAGG CGTTCTCCGC CGACGCCGAC GGCACCGGAT GGGCCGAGGG CGCCACCGTG GTCGTCCTCA CCCGGCTCTC CCACGCCCGC GCGCGGCAGC TGCCGGTGCT GGCCGTCGTC GCGGGCAGCG CCGTGAACCA GGACGGCGCC AGCAACGGCC TGACCGCCCC CAACGGCCTG TCCCAGCAGC GGCTCATCCA GCAGGCCCTG GCCAACGCCG GCCTGAGCGC CGGCGACGTC GACGCGGTCG AGGCGCACGG CACCGGCACC CCGCTCGGCG ACCCCATCGA GGCCCGCGCG CTGCTCGCTA CCTACGGCGC CGGCCGGTCC CCGGACCGGC CGCTGTACCT GGGCTCCCTC AAGTCCAACA TCGGCCATGC CCAGGCCGCC GCGGGCACCG CCGGCATGAT CAAGATGATC GAGGCACTGC GCCACGAGAC GCTGCCCGCC AGCCTGCACA TCACCGTCCC CACCCCCCAC GTGGACTGGG CGACCGGCGC CGTCACCCTC CTGACACAGC CCACGGCGTG GCCCGCCACC GCGCGGCCGC GCCGCGCCGC CGTCTCCGCC TTCGGCGTCT CGGGCACCAA CGCCCACCTC ATCATCGAGG AACCCCCCGG CCCGGCCCGC ATCCCCGCGC AGGCCACGCC TGCCGCCGCG CCCGCCGGCG CCACCGAGGA CCGGGACGCC GTCCGCGGCG GGGACACTCC CGAGTGGGCG GCCAACCTGC CCACAACGGT GATCCTCTCC GCCCGCGGTG ACGAGGCGCT GCGCGCGCAC GCCGCCCGCC TCGCCGACCA CCTCGCCGGC CATCCGGACG TCACCGTCGA CGACGCCGCC CACACCCTGG GCGCGCGGGC CCAGTTCCCC ACGCGCGCCG CCGTCGTCCT GGACGGGCCG CCGACGGAAC GGGCGTCCCA CCTGGTTCGG GCGCTGACCA CGCTGGCCGC CGGGCGGGCG GCGCCCGCTC TCATCCGCGG CAGCACCCCC GTGGACCTAG CCGGCGCCCG GATCGCGATG GTGTTCACCG GCCAGGGCAG CCAGCTCCAC GGCATGGGAC GGCGGCTGCA CGCCGCCTAC CCGGCCTTCG CCCGCGCGCT CGACGCCGCG TGCGACCGCT TCGAGCCGCA TCTCGACGAG CCGCTGCGCG ACGTCATGTT CGCCGCCCCG GACACCCCGC CCGCCGTCCT GCTGGACCAG ACGCTGTACA CCCAGTGCGC GCTCTTCGCC TACCAGAGCG CCCTGTTCCG GCTGTTGGAG AGCTTCGGGG TCACCCCGCA CCTGCTGCTC GGCCACTCCG TGGGCGAGCT CACCGCGGCG CACATCGCCG GCGTCTGGAC CCTGTCTGAC ACGGCCGCGC TGGTCGCCGC CCGCGGCCGT CTCATGCAGA GCTGTGTGCC CGGCGCCATG GCCGCCGTCG GCGCGACCGA GGACGAGGTC CGTCGGTGCC TCGACGAGTA CGGGGGCCGG GTCGACATCG CCGCCGTCAA CGGCCCCACC GCCGTCGTCG TGGCCGGTGA CTCCGACGCC GTCGGCGCGC TCGCGGCGCA CTGGGCCGAG CAGGGGCGCC CGACCCGCGG CCTCGCCGTC AGTCACGCCT TCCACTCCGC GCACATGGAT CCCGTCCTCG ACCGCTTCGC CGCCGTCGCC CGCACGCTCG CCTACCGCCC GCCCGCCGTC CCGATCGCCT CCGCCCTCAC CGGACGTCTG AGCACCGACC CGGATGCCCC GGTCGACCTC ACCGACCCGG CGTACTGGGT CCGCCACATC CGTGAAACCG TCCGGTTCCA CCAGGCTGTC ACCCGCGCCG ACGCCGAGAA CATCACCTCC TACCTGGAGA TCGGGCCGAA GCCCACGCTG ACGCCGTACC TCGGCGACGC CCGCGTGGTC ACCGCGGCAC GCCGCGGGCA GCCCGAGACC ACCGCTCTAC ACATCGGCCT CGCCGAGCTG TACACGCACG GCACCCCGGT AGCGTGGCGG GCGGCGAACA ACCCGCGGGC CGCCAGCCGC AGGCCCGCCC ACCGGCACCT GCCCACCTAC CCGTTCCAGC GCCGCCACCA CTGGACCGAC CCGACCCGCC GATCCGGCGC GCCCGGCGCG CCCGGCCCGG AGGGGTGGCG CTACCGCATC AGCTGGCGCT CCCGGCCCGT CCCCAGCACC GGCGCCCTGC CCGGGATGGA CGGCACCCCG CCGCCGCTGA CGGGCCGCTG GCTCCTGCTC CTCCCCCCCG CGGGCGTCAC CGACGAGGAG GTCTCCCGCG TCTCGTGGAT CGTCGAGCGC CTCGGCGGCA CCGTGCTCCG CGTGCCGCTG CGCCACGAGG ACACCGACCG GGCCCGCCTC GCCGCCCGCC TGCGCGCCCT CGGCCAGCCC GGTGCCACCA CGGGCCGAGG CGGCGTGCTG TCACTGCTCG GCCTCGACAC GACGCGCCAC CCCGACCACC CCGCGCTCAC GACCGGTTTC GCCCTCACCT GCGCGCTCGC GCAGGCCCTG CACGACATCG AGCTGGACGC GCCGCTGTGG ACGCTCACCC GTGGCGCCGT CCGGACCGGG GCCGCAGCCG CGACGGCGGC CAGCGACCGG ATTCTGGATC CGGCCGGGGC GCTGCTCTGG GGGCTGGGGC GGTCCCTCGC CCTCGAGCGG CCAGGTGCCT GGGGCGGTCT CGTCGACATC CCCGCGGAGC TGGACGACTC CACGCTCGGC TGGCTCGGCG CCGCACTGAC CGCACCGGAC GGCGAGGACC AGTTCGCCGC CCGCCCGGCG GGGCTGCTGG TACCCCGCCT GGTCCGGGAG ACCACGCCTG TCACCACCGC ACCGTGGGAC GTCCGCGGAG CGGTTGTGCT CATCACCGGA GGCACCGGCG CCCTCGGCAC CCGGATCGCC CGCCGCCTCG CCCGCGACGG AGCCCGCCGG CTGATCCTGG CCAGCCGCCG CGGCCCGGAC GCGCCCGGGG CGGCCGAGCT GTGCGCCGAG CTGACCGCCC TCGGCGTCCA GGCCGTCGTC CGTCGCTGCG ACGCCTCCGC CGCCGCCGCC GCGGGCACGG GCACGGGCAC GGGCACGGGC CAGCTGGCCG CGCTCGTCCG CGAGCTCGCC GCCGATCCCG AGGCCCCGCT CACCGCGATC GTCCACGCCG CGGGCGTCGA CGGCCCCATG GCCGCGCTGC CCGAGCTCGA CCTGGGCCAG ATCGCGGCCG TGCTCGCGCC CAAGGCCGGC GCCGCCGAGC TCCTGCATGA GGCCGCTGGG ACCATCCCGC TCGTGTTCCT GTCGTCGGTC TCGGCGACCT GGGGCAGCGG CGGCCAGGCC CCGTACAGCG CGGCGAACGC ATACCTCGAC ACGCTGGCCG CGTTCCGGCA CGCCAGCGGG CGCCCGGCCA CCTCGGTCGC GTTCGGCCCG TGGGCCGAGG CCGGCATGGG CGCCGAACCG GCCCGCCGTG ACTACCTGCA CCGCCGTGGC CTGAACCCGC TCTCCCCCGG CCGGGCCATC GATGCCCTGG CCCAGGCGGT CGGCACCGGA TGTCCCGGCC CGGTCACCGT CGCCGACGTC GACTGGGCAC GGTTCCTGCC CGCCTACACC GCCGCCCGGC CCTCACGCCT GTTCGACGAG CTACGCGCCA CCGCCGACGA AGCCGCCTCC CCGGCCGCCG ACGGAACGCA GCCCGCGGAT CTCACCGGGG CGCCGCGCGG TGATGGCGCG GAGCCGGGTG ATGATCTCGC GGCGCTGCCC GCGGCGGAGC AGGGCCGGGC GCTGCGCGAG CTCGTCCGGG CCGAGGCCGC GGCGGTGCTG GGCCACCGCG AGGCCGACGA GATCGAGACG GGACGCCGCT TCCTCGAGCT GGGCTTCGAC TCGCTGGCCT CCGTGCAGCT CAGCCGCCGG CTGGCGGGTG CGACCGGGGT CGCGCTCGCC ACCGCCGCGG TCTTCGAGCA CCCCACCGTC GCCGACCTCG CCGACCACCT CCACGCCCTG CTCGCCGGCC GGCCCGCGCC CGCCGCCGCC ACCACCGGCG CCGACGGTGC GGTCGGCGCG GTCGGCGCGG TCGGGAGCGG CACCCGAACC GGGACCGGCT GGACCTCGAC CAGTGCCAGT GCCGCCGGCG TCCCCGCCGT AGGCGGCGCG GAGCCGGTCG GCGTGCGTGG GCTCTACCGG CAGGCGTGCG CTGACGGCAA GTTCGCCGAA GGGGTCGGGG TGCTGCGCGC CGCCGCAAAG CTGCGGTCGA CGTTCACCGC GCCGAGCGAG CTGGCCAGGC GGCCACGTCC GGTCACCCTC GCGTCCGGCC CGGCCGGGCC GGCGCTGGTC TGCCTGCCGT CGATGGTCGC GCCGTCCGGG CCGCACACCT TCGCCCGGCT CGCCCTGCAC CTGCACGGCC GCCGCGCCGT GCACGCGCTG GCGCACCCCG GCTTCGGCGA CGGCGAGCTT CTGCCCGCCA CCGCCGATCT CGTGGTCGAC CTGCACGCCG AGACGCTGGC CGCGCACTTC CCCGACACAC CGGTCGCGCT CGCGGGCTAC TCGTCCGGCG GCTGGTTGGC ACACGCCGTC GCGGCGCGGC TGGAGGCACG CGGCATCCGC CCGAGCGCCG TGGTCCTGCT CGACACCTGG CTGCCCGGCG ACCGGATCCC CGAGGCCGAC ATCGCCGAGG AGCTGCGCGG CATCGCCGTC AACGACCAGG CGTTCGCGCT GATGACCGAG TCCCAGGTGA CCGCGCAGGG CGCCTACCTC GACCTGTTCG AAGGATGGAA ACCCACGTCG GTGCGTGCGC CGATCGTCCT CGTCCGCGCG GTCCAGCGCA TGCCGGGGCA GCGGGCGGAC CCCGCGGGCC CGGTGACCGG CTGGGCGGAC GAATGGGACC TGGCCTTCGA CACAAGGGAC ACCGCCGGCG ACCACCAGTC GATGATGAAC GAGCACGCCG GCTCCACAGC TCACACGGTC CACGCCTGGC TCGGCGACCT CCACGCACGC CGGTCCCCGG CCCGTACTGT CGCCAGCGGG ACGGCTATCC GCTGA
|
Protein sequence | MDSTRPEQGH SGATADQARD AWRARLAELS PPAQIEALLD LVVDHVVVVT GSHGGSAGVD RGAPWRALGV YRRIADVLRA KLTAATGVRL PATVLFERPT PKAVASFLHA EILGVREEAA DTAPAPTPAA PAEGGGRGAD PVVVVGMGCR LPGADSPEAL WELVAQGRDV VAGLPTDRGW DLDGLYHPDP EHPGTAYTRQ GGFLPGVSLF DAGFFGIGPR EATAMDPQQR LMLEVSWEAF EHAGIDPHRL RGSRTGVFTG VSLQDYGPPW HDAPAELQGH LLTGNALGVV AGRVSYTFGF EGPALTVDTQ CSASLVAVHL ATAALYAGEC DLALAGGVTV MSTPSMLMEF SRKRGLAPDG RCKAFSADAD GTGWAEGATV VVLTRLSHAR ARQLPVLAVV AGSAVNQDGA SNGLTAPNGL SQQRLIQQAL ANAGLSAGDV DAVEAHGTGT PLGDPIEARA LLATYGAGRS PDRPLYLGSL KSNIGHAQAA AGTAGMIKMI EALRHETLPA SLHITVPTPH VDWATGAVTL LTQPTAWPAT ARPRRAAVSA FGVSGTNAHL IIEEPPGPAR IPAQATPAAA PAGATEDRDA VRGGDTPEWA ANLPTTVILS ARGDEALRAH AARLADHLAG HPDVTVDDAA HTLGARAQFP TRAAVVLDGP PTERASHLVR ALTTLAAGRA APALIRGSTP VDLAGARIAM VFTGQGSQLH GMGRRLHAAY PAFARALDAA CDRFEPHLDE PLRDVMFAAP DTPPAVLLDQ TLYTQCALFA YQSALFRLLE SFGVTPHLLL GHSVGELTAA HIAGVWTLSD TAALVAARGR LMQSCVPGAM AAVGATEDEV RRCLDEYGGR VDIAAVNGPT AVVVAGDSDA VGALAAHWAE QGRPTRGLAV SHAFHSAHMD PVLDRFAAVA RTLAYRPPAV PIASALTGRL STDPDAPVDL TDPAYWVRHI RETVRFHQAV TRADAENITS YLEIGPKPTL TPYLGDARVV TAARRGQPET TALHIGLAEL YTHGTPVAWR AANNPRAASR RPAHRHLPTY PFQRRHHWTD PTRRSGAPGA PGPEGWRYRI SWRSRPVPST GALPGMDGTP PPLTGRWLLL LPPAGVTDEE VSRVSWIVER LGGTVLRVPL RHEDTDRARL AARLRALGQP GATTGRGGVL SLLGLDTTRH PDHPALTTGF ALTCALAQAL HDIELDAPLW TLTRGAVRTG AAAATAASDR ILDPAGALLW GLGRSLALER PGAWGGLVDI PAELDDSTLG WLGAALTAPD GEDQFAARPA GLLVPRLVRE TTPVTTAPWD VRGAVVLITG GTGALGTRIA RRLARDGARR LILASRRGPD APGAAELCAE LTALGVQAVV RRCDASAAAA AGTGTGTGTG QLAALVRELA ADPEAPLTAI VHAAGVDGPM AALPELDLGQ IAAVLAPKAG AAELLHEAAG TIPLVFLSSV SATWGSGGQA PYSAANAYLD TLAAFRHASG RPATSVAFGP WAEAGMGAEP ARRDYLHRRG LNPLSPGRAI DALAQAVGTG CPGPVTVADV DWARFLPAYT AARPSRLFDE LRATADEAAS PAADGTQPAD LTGAPRGDGA EPGDDLAALP AAEQGRALRE LVRAEAAAVL GHREADEIET GRRFLELGFD SLASVQLSRR LAGATGVALA TAAVFEHPTV ADLADHLHAL LAGRPAPAAA TTGADGAVGA VGAVGSGTRT GTGWTSTSAS AAGVPAVGGA EPVGVRGLYR QACADGKFAE GVGVLRAAAK LRSTFTAPSE LARRPRPVTL ASGPAGPALV CLPSMVAPSG PHTFARLALH LHGRRAVHAL AHPGFGDGEL LPATADLVVD LHAETLAAHF PDTPVALAGY SSGGWLAHAV AARLEARGIR PSAVVLLDTW LPGDRIPEAD IAEELRGIAV NDQAFALMTE SQVTAQGAYL DLFEGWKPTS VRAPIVLVRA VQRMPGQRAD PAGPVTGWAD EWDLAFDTRD TAGDHQSMMN EHAGSTAHTV HAWLGDLHAR RSPARTVASG TAIR
|
| |