Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3471 |
Symbol | |
ID | 5671842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4101039 |
End bp | 4104254 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242359 |
Product | erythronolide synthase |
Protein accession | YP_001507779 |
Protein GI | 158315271 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0801482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.865283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACC AGGCTGTGGC GAGCACCGAG GACCGGCTGC GCGACTATCT GCGGCGGGCC ACCGCCGACC TGCGCCAGGC CCGGCGCCGG ATCCGGGAGC TGGACGCGCG GGTGCACGAG CCGATCGCGA TCGTGGGCAT GGGCTGCCGC CTGCCGGGCG GTGTCGACTC CCCGGAGGCG CTGTGGCGGC TGCTCGACGA GGAGCGGGAC GCCATCGCGG ACCTGCCCAC CGACCGCGGC TGGGATCCGG AGCGGCTCTA CGACCCCGAT CCCGACCGGC CGGGCACCAC CTACGCCCGC GAGGGTGGGT TCCTCGCCGA CGCGGGCGGC TTCGACGCGC CGTTCTTCGG CATCGGCCCG CGCGAGGCGA CCGCCAGCGA CCCGCAGCAC CGGCAGCTGC TCGAGGTCGC CTACGAGACC CTCGAGTACG CCGGGATCGC CCCGGACTCG CTGCACGGCA CCCGGACGGG CGTGTACGCC GGGGTGGTGT CCCAGGCGTA CGTGCCGCCG CCGGAGCGGG TGCCAGCCTC GTTCGAGGGC CACCTGATGA CGGGGAACGC GACGAGCGTC GCGTCCGGGC GGGTGGCCTA CCACCTCGGG CTGGAGGGAC CGGCCCTCAC CCTGGACACC GCCTGCTCCT CGTCGCTGGT AGCGATGCAC CTGGCGGCGC GGGCGCTGCA GCGCGGCGAG TGCGACCTGG CGCTCGCCGG CGGGGTGACG GTGATGGCGA CCCCGGTGCT GCTGGTCGAG TTCAGCCGGC AGCGCGGGCT GTCACCGGAC GGCCGCTGCA AGGCCTTCGC TGCGGCGGCG GACGGCACCG GCTTCGCCGA GGGCGTCGGG CTGGTGGCGC TGGAGCGGCT CTCCGACGCC CGCCGGGCCG GCCGGCGCAT CCTGGCGGTG CTGCGCGGCT CGGCGGTGAA CTCCGACGGG GCCAGCAACG GGCTGACCGC GCCGAACGGC GTCAGCCAGG AGCGGGTGAT CCGCCAGGCG CTGGCCGACG CGCGGCTGGC CCCGCAGCAG GTCGACGTGG TGGAGGCGCA CGGGACGGGC ACCCGGCTCG GCGACCCGAT CGAGGCGCAG GCGCTGCTGG CCACCTACGG GCAGGGCCGG CCGGCCGGCG AGCCGCTGTG GCTGGGGAGC CTGAAGTCCA ACATCGGCCA CACCCAGGCC GCCGCCGGCA TCGCCGGGGT GATCAAGATG ATCCTGGCCA TGCGGCACGA CCGGCTGCCG CGCACCCTGC ACGTCGACGA GCCGACGCCG CACGTCGACT GGGCCGGCGG CGACGTCCAG CTGCTCACCG AGGCCCGCCG GTGGCAGCGC GGCGACCGGC CGCGCCGGGC GGCGGTCTCC TCGTTCGGGA TCAGCGGCAC CAACGCCCAC CTCGTCCTGG AGGAGCCGCC CGCGTCCGAC GAGCCCGCCG CGGAGACCGA GCCCGGGCTC GACTCCGACG AGCCGGCCAC GGGCGGCCAG GCCGGCCCGG CCGCCGGGAC GGGCGACACG CGGCCGCCGG CGGCCGACCC GGCGGCGGCT CCGTGGGTGC TGTCGGGACG GACCGAGGCG GCGCTGCGGG CCGCGGCCGG CCGGCTGCAC GAACGGCTGG ACGCCGACCC CGGTCTCGAC CCGGCCGAGG TCGGGCGGGT GCTGGCCGTC GGGCGCACCG CCCACCCGCA CCGGGCGGCC GTGCTGCCCG GGTCCAACCG CAGGCGGCTG GAGGCGCTCG CCGCGCTGGC GGGCGGCCGG CCCGGCGGGC TGGTCGCGCC CGGCGGCCCG GGCGGCCGGA CGGCGTTCCT GTTCACGGGC CAGGGCAGCC AGCGGCCAGG CATGGGCCGG CACCTCTACG CCAACCGGCC GGTGTTCGCC GCCGCGCTCG ACGAGGTCTT CGCGGCGTTC GAGCCGCTGC TCGACCGGCG GCTGCCCGAC CTGATGTTCA CCGAGCCGGG CACTGACGAG GCGGCGCTGC TCGACCGCAC CGAGTACGCC CAGCCGGCCC TGTTCGCGCT GGAGACGGCG CTGTACCGGC TGCTGGTCGA TCTCGGGCCG GAACCGGACC TGGTCGCCGG GCACTCGCTC GGCGGGCTGA CCGCCGGGCA CGTGGCCGGC CTGCTCTCGC TGGCCGACGC CGCCGCGCTC GTCGCCGCCC GCGGCCGCCT CATGCAGGCC CAGCCCACCG GCGGGCTGAT GGCCGCCTGC GAGGCGACCG CGGACGAGAT CGAGCCACTG CTCGCCGCGC AGGACGGGCA GGACGGCCAG GTCTCGCTGG CGGCCGTCAA CGGCCCGCGG GCCGTCGTCG TCTCCGGGGA CGCGGCGGCG GTCGCCCGGG TCACGGCGGC CGTCGCCGCC CGCGGTGGGC GGACCCGCAC GCTGCGGGTC AGCCACGCCT TCCACTCCGC GCACATGGAC GGGGCGCTGG CCCCGTTCCG CGAGGTCGTC GCCGGGCTGC GCTTCCAGCC GCCGGTCCTG GACGTCGTCT CCGAGCTGAC CGGACGGCCG GCGAGCTTCG AGGAGCTGCG CGACCCCGAC TACTGGGTGC GCCAGCTGCG CTCGCCGGTG CGCTTCGCCG ACGCCGTGGC CACGCTGCAC GCCGCCGGCG CCACCACCTT CCTCGAGGTC GGCCCGGACG CGGTGCTCAC CCCGATGGTG GCCGACTGCC TCCCGGTGGG CGCCGGCGCG GCGGGCGCGG GCCAGGTGGT GGCGGTGCCC GCCCTGGTCC GGGACCGACC GGAGGAGCAG GTGCTCGCCA CCGCCCTGGT CTGGCGGCAT GTGCGCGGCG CGGACGTCCG CTGGGATGCC TGGTTCGGCC CGGCGGCCGG CCCGCCGCCG CACCTGCCCA CCTATCCCTT CGAGCACCGC CGTTACTGGT GGCCCGCCGA GGTCGTCGCC GCCCCCGCGG GCCCGCGGCC CGTGACGGCC GAGGTGGCTG AGGTCCCCGC GGCCGACCAG GCCGACCAGG CCGACCCGCC GGTCGCCGCG GACCGTGGGG AGGTGCTGCT CGCCCTGGTC CGTGAGCACG CCGCCGAGGT GCTCGGCCAT CCCGATCCGG ACTCCATCGG GGCGGACGAC AACTTCCTGG AGATCGGGTT CTCGTCCTTC ACCGCCCTCG AGGTGCGCAA CCGGCTGTGC GAGGCGACCG GCCTGACGCT GCCGGCGGTG CTGCTCTACG ACCACCCGAC ACCGACGGCG GTCGTCCGGT TCCTCGAGGA GAGCCTGGTC GCCTGA
|
Protein sequence | MTDQAVASTE DRLRDYLRRA TADLRQARRR IRELDARVHE PIAIVGMGCR LPGGVDSPEA LWRLLDEERD AIADLPTDRG WDPERLYDPD PDRPGTTYAR EGGFLADAGG FDAPFFGIGP REATASDPQH RQLLEVAYET LEYAGIAPDS LHGTRTGVYA GVVSQAYVPP PERVPASFEG HLMTGNATSV ASGRVAYHLG LEGPALTLDT ACSSSLVAMH LAARALQRGE CDLALAGGVT VMATPVLLVE FSRQRGLSPD GRCKAFAAAA DGTGFAEGVG LVALERLSDA RRAGRRILAV LRGSAVNSDG ASNGLTAPNG VSQERVIRQA LADARLAPQQ VDVVEAHGTG TRLGDPIEAQ ALLATYGQGR PAGEPLWLGS LKSNIGHTQA AAGIAGVIKM ILAMRHDRLP RTLHVDEPTP HVDWAGGDVQ LLTEARRWQR GDRPRRAAVS SFGISGTNAH LVLEEPPASD EPAAETEPGL DSDEPATGGQ AGPAAGTGDT RPPAADPAAA PWVLSGRTEA ALRAAAGRLH ERLDADPGLD PAEVGRVLAV GRTAHPHRAA VLPGSNRRRL EALAALAGGR PGGLVAPGGP GGRTAFLFTG QGSQRPGMGR HLYANRPVFA AALDEVFAAF EPLLDRRLPD LMFTEPGTDE AALLDRTEYA QPALFALETA LYRLLVDLGP EPDLVAGHSL GGLTAGHVAG LLSLADAAAL VAARGRLMQA QPTGGLMAAC EATADEIEPL LAAQDGQDGQ VSLAAVNGPR AVVVSGDAAA VARVTAAVAA RGGRTRTLRV SHAFHSAHMD GALAPFREVV AGLRFQPPVL DVVSELTGRP ASFEELRDPD YWVRQLRSPV RFADAVATLH AAGATTFLEV GPDAVLTPMV ADCLPVGAGA AGAGQVVAVP ALVRDRPEEQ VLATALVWRH VRGADVRWDA WFGPAAGPPP HLPTYPFEHR RYWWPAEVVA APAGPRPVTA EVAEVPAADQ ADQADPPVAA DRGEVLLALV REHAAEVLGH PDPDSIGADD NFLEIGFSSF TALEVRNRLC EATGLTLPAV LLYDHPTPTA VVRFLEESLV A
|
| |