Gene Franean1_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3471 
Symbol 
ID5671842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4101039 
End bp4104254 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content77% 
IMG OID641242359 
Producterythronolide synthase 
Protein accessionYP_001507779 
Protein GI158315271 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0801482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.865283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC AGGCTGTGGC GAGCACCGAG GACCGGCTGC GCGACTATCT GCGGCGGGCC 
ACCGCCGACC TGCGCCAGGC CCGGCGCCGG ATCCGGGAGC TGGACGCGCG GGTGCACGAG
CCGATCGCGA TCGTGGGCAT GGGCTGCCGC CTGCCGGGCG GTGTCGACTC CCCGGAGGCG
CTGTGGCGGC TGCTCGACGA GGAGCGGGAC GCCATCGCGG ACCTGCCCAC CGACCGCGGC
TGGGATCCGG AGCGGCTCTA CGACCCCGAT CCCGACCGGC CGGGCACCAC CTACGCCCGC
GAGGGTGGGT TCCTCGCCGA CGCGGGCGGC TTCGACGCGC CGTTCTTCGG CATCGGCCCG
CGCGAGGCGA CCGCCAGCGA CCCGCAGCAC CGGCAGCTGC TCGAGGTCGC CTACGAGACC
CTCGAGTACG CCGGGATCGC CCCGGACTCG CTGCACGGCA CCCGGACGGG CGTGTACGCC
GGGGTGGTGT CCCAGGCGTA CGTGCCGCCG CCGGAGCGGG TGCCAGCCTC GTTCGAGGGC
CACCTGATGA CGGGGAACGC GACGAGCGTC GCGTCCGGGC GGGTGGCCTA CCACCTCGGG
CTGGAGGGAC CGGCCCTCAC CCTGGACACC GCCTGCTCCT CGTCGCTGGT AGCGATGCAC
CTGGCGGCGC GGGCGCTGCA GCGCGGCGAG TGCGACCTGG CGCTCGCCGG CGGGGTGACG
GTGATGGCGA CCCCGGTGCT GCTGGTCGAG TTCAGCCGGC AGCGCGGGCT GTCACCGGAC
GGCCGCTGCA AGGCCTTCGC TGCGGCGGCG GACGGCACCG GCTTCGCCGA GGGCGTCGGG
CTGGTGGCGC TGGAGCGGCT CTCCGACGCC CGCCGGGCCG GCCGGCGCAT CCTGGCGGTG
CTGCGCGGCT CGGCGGTGAA CTCCGACGGG GCCAGCAACG GGCTGACCGC GCCGAACGGC
GTCAGCCAGG AGCGGGTGAT CCGCCAGGCG CTGGCCGACG CGCGGCTGGC CCCGCAGCAG
GTCGACGTGG TGGAGGCGCA CGGGACGGGC ACCCGGCTCG GCGACCCGAT CGAGGCGCAG
GCGCTGCTGG CCACCTACGG GCAGGGCCGG CCGGCCGGCG AGCCGCTGTG GCTGGGGAGC
CTGAAGTCCA ACATCGGCCA CACCCAGGCC GCCGCCGGCA TCGCCGGGGT GATCAAGATG
ATCCTGGCCA TGCGGCACGA CCGGCTGCCG CGCACCCTGC ACGTCGACGA GCCGACGCCG
CACGTCGACT GGGCCGGCGG CGACGTCCAG CTGCTCACCG AGGCCCGCCG GTGGCAGCGC
GGCGACCGGC CGCGCCGGGC GGCGGTCTCC TCGTTCGGGA TCAGCGGCAC CAACGCCCAC
CTCGTCCTGG AGGAGCCGCC CGCGTCCGAC GAGCCCGCCG CGGAGACCGA GCCCGGGCTC
GACTCCGACG AGCCGGCCAC GGGCGGCCAG GCCGGCCCGG CCGCCGGGAC GGGCGACACG
CGGCCGCCGG CGGCCGACCC GGCGGCGGCT CCGTGGGTGC TGTCGGGACG GACCGAGGCG
GCGCTGCGGG CCGCGGCCGG CCGGCTGCAC GAACGGCTGG ACGCCGACCC CGGTCTCGAC
CCGGCCGAGG TCGGGCGGGT GCTGGCCGTC GGGCGCACCG CCCACCCGCA CCGGGCGGCC
GTGCTGCCCG GGTCCAACCG CAGGCGGCTG GAGGCGCTCG CCGCGCTGGC GGGCGGCCGG
CCCGGCGGGC TGGTCGCGCC CGGCGGCCCG GGCGGCCGGA CGGCGTTCCT GTTCACGGGC
CAGGGCAGCC AGCGGCCAGG CATGGGCCGG CACCTCTACG CCAACCGGCC GGTGTTCGCC
GCCGCGCTCG ACGAGGTCTT CGCGGCGTTC GAGCCGCTGC TCGACCGGCG GCTGCCCGAC
CTGATGTTCA CCGAGCCGGG CACTGACGAG GCGGCGCTGC TCGACCGCAC CGAGTACGCC
CAGCCGGCCC TGTTCGCGCT GGAGACGGCG CTGTACCGGC TGCTGGTCGA TCTCGGGCCG
GAACCGGACC TGGTCGCCGG GCACTCGCTC GGCGGGCTGA CCGCCGGGCA CGTGGCCGGC
CTGCTCTCGC TGGCCGACGC CGCCGCGCTC GTCGCCGCCC GCGGCCGCCT CATGCAGGCC
CAGCCCACCG GCGGGCTGAT GGCCGCCTGC GAGGCGACCG CGGACGAGAT CGAGCCACTG
CTCGCCGCGC AGGACGGGCA GGACGGCCAG GTCTCGCTGG CGGCCGTCAA CGGCCCGCGG
GCCGTCGTCG TCTCCGGGGA CGCGGCGGCG GTCGCCCGGG TCACGGCGGC CGTCGCCGCC
CGCGGTGGGC GGACCCGCAC GCTGCGGGTC AGCCACGCCT TCCACTCCGC GCACATGGAC
GGGGCGCTGG CCCCGTTCCG CGAGGTCGTC GCCGGGCTGC GCTTCCAGCC GCCGGTCCTG
GACGTCGTCT CCGAGCTGAC CGGACGGCCG GCGAGCTTCG AGGAGCTGCG CGACCCCGAC
TACTGGGTGC GCCAGCTGCG CTCGCCGGTG CGCTTCGCCG ACGCCGTGGC CACGCTGCAC
GCCGCCGGCG CCACCACCTT CCTCGAGGTC GGCCCGGACG CGGTGCTCAC CCCGATGGTG
GCCGACTGCC TCCCGGTGGG CGCCGGCGCG GCGGGCGCGG GCCAGGTGGT GGCGGTGCCC
GCCCTGGTCC GGGACCGACC GGAGGAGCAG GTGCTCGCCA CCGCCCTGGT CTGGCGGCAT
GTGCGCGGCG CGGACGTCCG CTGGGATGCC TGGTTCGGCC CGGCGGCCGG CCCGCCGCCG
CACCTGCCCA CCTATCCCTT CGAGCACCGC CGTTACTGGT GGCCCGCCGA GGTCGTCGCC
GCCCCCGCGG GCCCGCGGCC CGTGACGGCC GAGGTGGCTG AGGTCCCCGC GGCCGACCAG
GCCGACCAGG CCGACCCGCC GGTCGCCGCG GACCGTGGGG AGGTGCTGCT CGCCCTGGTC
CGTGAGCACG CCGCCGAGGT GCTCGGCCAT CCCGATCCGG ACTCCATCGG GGCGGACGAC
AACTTCCTGG AGATCGGGTT CTCGTCCTTC ACCGCCCTCG AGGTGCGCAA CCGGCTGTGC
GAGGCGACCG GCCTGACGCT GCCGGCGGTG CTGCTCTACG ACCACCCGAC ACCGACGGCG
GTCGTCCGGT TCCTCGAGGA GAGCCTGGTC GCCTGA
 
Protein sequence
MTDQAVASTE DRLRDYLRRA TADLRQARRR IRELDARVHE PIAIVGMGCR LPGGVDSPEA 
LWRLLDEERD AIADLPTDRG WDPERLYDPD PDRPGTTYAR EGGFLADAGG FDAPFFGIGP
REATASDPQH RQLLEVAYET LEYAGIAPDS LHGTRTGVYA GVVSQAYVPP PERVPASFEG
HLMTGNATSV ASGRVAYHLG LEGPALTLDT ACSSSLVAMH LAARALQRGE CDLALAGGVT
VMATPVLLVE FSRQRGLSPD GRCKAFAAAA DGTGFAEGVG LVALERLSDA RRAGRRILAV
LRGSAVNSDG ASNGLTAPNG VSQERVIRQA LADARLAPQQ VDVVEAHGTG TRLGDPIEAQ
ALLATYGQGR PAGEPLWLGS LKSNIGHTQA AAGIAGVIKM ILAMRHDRLP RTLHVDEPTP
HVDWAGGDVQ LLTEARRWQR GDRPRRAAVS SFGISGTNAH LVLEEPPASD EPAAETEPGL
DSDEPATGGQ AGPAAGTGDT RPPAADPAAA PWVLSGRTEA ALRAAAGRLH ERLDADPGLD
PAEVGRVLAV GRTAHPHRAA VLPGSNRRRL EALAALAGGR PGGLVAPGGP GGRTAFLFTG
QGSQRPGMGR HLYANRPVFA AALDEVFAAF EPLLDRRLPD LMFTEPGTDE AALLDRTEYA
QPALFALETA LYRLLVDLGP EPDLVAGHSL GGLTAGHVAG LLSLADAAAL VAARGRLMQA
QPTGGLMAAC EATADEIEPL LAAQDGQDGQ VSLAAVNGPR AVVVSGDAAA VARVTAAVAA
RGGRTRTLRV SHAFHSAHMD GALAPFREVV AGLRFQPPVL DVVSELTGRP ASFEELRDPD
YWVRQLRSPV RFADAVATLH AAGATTFLEV GPDAVLTPMV ADCLPVGAGA AGAGQVVAVP
ALVRDRPEEQ VLATALVWRH VRGADVRWDA WFGPAAGPPP HLPTYPFEHR RYWWPAEVVA
APAGPRPVTA EVAEVPAADQ ADQADPPVAA DRGEVLLALV REHAAEVLGH PDPDSIGADD
NFLEIGFSSF TALEVRNRLC EATGLTLPAV LLYDHPTPTA VVRFLEESLV A