Gene Franean1_4836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4836 
Symbol 
ID5673177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5778090 
End bp5781521 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content75% 
IMG OID641243692 
Producterythronolide synthase 
Protein accessionYP_001509108 
Protein GI158316600 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0120543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGT CCCATGACAC GCTCGTCGAG GCACTGCGCG CCTCGCTGCT GGAGAACGAG 
CGGCTGCAAC GGGAGAACGA CGGACTGCGC AGGGAGAACG TCCAGCTCTC CGAGCCCACC
CGTCAGACGC CCGTCCGGGA GCCCGTCGCA ATCGTCGGTG CCGGCTGTCG GCTGCCGGGC
GGGGTCACCT CTCCCGACGA TCTGTGGCGA CTGGTGGACG AGGGCAGGGA GGGCCTCTCG
CCGTTCCCCA CCGACCGTGG CTGGGACCTC GAAAAGCTGT TCGACCCCGA CCCGACCCGC
CCCGGGACGT CCTACGTGCG GACCGGCGGG TTCCTGGACG CGGCGGCGTT CGACGCCGGG
TTCTTCGGCA TCGCCCCGCG TGAGGCCCTC GCGATGGACC CCCAGCAGCG GCTGATGCTC
GAAGTGTCCT GGGAGGCGGT CGAACGGGCC GGGATCGACC CGACATCCCT GCGCGGCCGG
CGGGTCGGCG TGTACACGGG CGTGATGTAC CACGACTACG CGACCGGCGT GACCGACGTC
CCGCCGGAGC TGGAGGGGCT GCTCGGGACG GGGAACTCCG GCAGCGTGGT CTCCGGGCGG
GTCTCCTATC TGCTCGGTCT CGAGGGACCG TCGGTCACCG TCGACACGGC CTGCTCGTCC
TCCCTGGTCG CCGTCCACCT GGCGCTGCGG GCGCTGCGGG CCGGTGAGAT CGAGCTTGCC
CTGGTCGGCG GGGTGGCGGT GATGGCCCAG CCGGGTCCGT TCGTCGAGTT CTCCCGCCAG
CGGGGGCTCG CGCCCGACGG CAGGTGCCGG TCGTTCGCGG CGGGCGCCGA CGGCACGGGA
TGGTCCGAGG GCGCCGTGGT GCTGGTCCTC GAGCGACTGG ACGACGCCCA CCGCGACGGA
CGTCGCGTGC GCGCGCTGGT GCGCGGCTCG GCGGTGAACT CCGACGGGGC GTCCAACGGC
CTGACCGCCC CGAACGGCCC GGCGCAGCAG CGGGTGATCC GCCAGGCGCT CGCCGACGCG
GACCTGACCA CCCAGGACGT GGACGTCGTC GAGGGCCACG GGACGGGTAC GCCGCTCGGC
GACCCGATCG AGGCCCAGGC GCTGCTGGCA ACCTACGGAC GCAGGCCGGC ACAGCATCCG
CTGTGGCTCG GGTCACTGAA GTCGAACATC GGCCACACCC AGGCGGCGGC CGGGGTGGCC
GGCATCCTCA AGATCGTGGC GGCGCTGGAG CACGCCGAAC TGCCCCGGAC ACTGCACGCC
GACGTCCCGT CGGACCAGGT CGACTGGGAG TCGGGCGCGG TCCGCCTGCT GAGCGAGGCA
CGGCCGTGGC CCGCGCGTGA CCGGCCGCGC CGCGCCGCGG TCTCGTCCTT CGGCGTGAGC
GGCACCAACG CGCACGTCAT CCTGGAGCAG GCACCTGAGC CACCGCCCGG CGCCGGTGAA
CCACCGCCCG GCGTGCCCTC GACCGCCGGC GAACTCCCTG ACGGGGACGG GAGACCGGCG
GGAGCCGGAG CCGCACCGGC CGCCTGGGTC GTCAGCGCCG CGAGCCGGAC GGCGCTGCGG
GTCCTCGCCA GGCGGGTGGA AACGGCTCTG CATGAACAGC CGCACGTGCC GGTCGACGCC
GCCGGCGCCG CGTTGCGCAC GAGCCGGGCG ACGCTGCGCC ACCGCGCCGT GGTGATCGCC
GAGGACCGGG AGTCCGGCCT CGCGGGCCTG GCGGCGGCGG CCGCCGGTGA GCCGGCGGCG
AACCTGGTGA CCGGTTCGGC GGACGTCGAC GGCCAGACCG TCTTCGTCTT CGCCGGCCAG
GGCGGCCAGT GGGCGGGGAT GGGCGCCGAG CTGCTCGACG CCTCGCCCGT CTTCGCCGAA
GAGGTGGCGC TGGCGGGCCG CGCGCTCGCC CGGCATGTCG GCTGGTCGGT GGAGGACGTG
CTGCGCCAGG TGCCGGGCGC ACCCTCGCTC GACCGGGTGG ACGTCGTCCA GCCGGCTTCC
TTCGCCGTGG CGGCCGGGCT GGTCCGGCTG TGGGAGTCGG TCGGGGTCCG CCCGGACGCG
GTGGTCGGCC ACTCGCAGGG CGAGATCGCC GCCGCCTACA CGGCGGGGGC GCTGAGCCTC
GCCGACGCCG CCGCGGTGGT CGCACTGCGC AGCCAGGCGA TCGCCGGCGG GCTCACCGGC
CGGGGCGGGA TGGCGGCCGT GGCGCTGCCG GTTCCCGCGG TCGCCGAACG GCTCGCGCCG
TTCGCTGACC AGGTCGAGCT CGCCGCCGTG AACGGCCCGG CGTCGGTCGT CCTCGCCGGC
GAGCCCGCCG CGCTCGACGA ACTGGTGGCC GCCTTCGAGA GCGAGGGGAT CCGGGCCCGC
CGGATCCCGG TGGACTACGC CTCGCACAGC CGGCAGGTCG AACGGATCTC CGGTGTCCTC
ATCCAGGCGC TCGCCGGTCT CGACCCGCGC CCGCCGCGGG TGCCGTTCTT CTCGACCGTC
GACGCGAAGT GGGTGGAGGA CGCCGAGCTC GACGGCGGGT ACTGGTACCG CAACCTGCGG
TGCCCGGTCC GGTTCGCGTC CGCCACGCGG AGCCTGCTCG ACGGCGGCTA CCGGGTGTTC
CTCGAGGTGA GCACCCACCC GGTGCTCGCG CCCGCCATCG CGGAGACGGT GGACCAGTGG
GACGGGCCGC CGGTCGCCGT GCTGGAGTCC CTGCGCCGCG ACGACGGCGG TCCGGACCGC
TTCACCCGCT CCGCCGCGGC GGCGTTCGTA CGCGGGGTCA CGGTCGACCT CGCGGGGGCG
TCGCACGCCG CCGCCCACGA CGGGCTGGCC GGCACCGCCG ACGGGCCGGC CGGCACCGGC
GACCGGCCCG TGCCGTCAGG CGGCGGCGCG GTCGAGCTAC CTACGTATCC GTTCCAGCGC
AGGCGCTACT GGCTCGCTTC GACCGGCGCG GGCCGGCCGT CCGGCACGCC TCGTCCGGGT
GGCCGCGGCG GTGGGTTGGA AGTCGACTCC GGAGATGACC ACATGACCGG TGACGTGACG
CGAGACATCT CGGCCCGGCT ACCGGCAGGT TCGGAAGCGG GCGCGGCGCA GGATGCGGAG
CCTGAGCCCG CGGGGTCCGA GAACGCTGGG CCTGGGGACT CAGGGCTTGA GAACGCGGAG
ACGGACGACG CCGGGCAGGA GTTTCGCGCC CGTCTCGCCG AGCTGCCCAC CGCCGAGCGG
CCTGCCCGGC TCATCGAGCT GGTCCGGGCG CAGGCCGCCG CGGTGCTAGG TCACGCCGAC
ACCGACGAGG TCGGCCCGGA CAGCGCGTTC TTCGACATCG GGTTCAGCTC GCTGACCGCC
GTCGAGCTGC GTAACCGTCT CGCGGCGGCG ACCGGCCTCA CGCTGCCCGC GATGTTGCTG
TTCGACCACG CGGTGCCCCG CGAGGTCGCC GCGTACCTGC TGGACCGGCT GGAGGTCGAG
ACCCGTGTTT GA
 
Protein sequence
MATSHDTLVE ALRASLLENE RLQRENDGLR RENVQLSEPT RQTPVREPVA IVGAGCRLPG 
GVTSPDDLWR LVDEGREGLS PFPTDRGWDL EKLFDPDPTR PGTSYVRTGG FLDAAAFDAG
FFGIAPREAL AMDPQQRLML EVSWEAVERA GIDPTSLRGR RVGVYTGVMY HDYATGVTDV
PPELEGLLGT GNSGSVVSGR VSYLLGLEGP SVTVDTACSS SLVAVHLALR ALRAGEIELA
LVGGVAVMAQ PGPFVEFSRQ RGLAPDGRCR SFAAGADGTG WSEGAVVLVL ERLDDAHRDG
RRVRALVRGS AVNSDGASNG LTAPNGPAQQ RVIRQALADA DLTTQDVDVV EGHGTGTPLG
DPIEAQALLA TYGRRPAQHP LWLGSLKSNI GHTQAAAGVA GILKIVAALE HAELPRTLHA
DVPSDQVDWE SGAVRLLSEA RPWPARDRPR RAAVSSFGVS GTNAHVILEQ APEPPPGAGE
PPPGVPSTAG ELPDGDGRPA GAGAAPAAWV VSAASRTALR VLARRVETAL HEQPHVPVDA
AGAALRTSRA TLRHRAVVIA EDRESGLAGL AAAAAGEPAA NLVTGSADVD GQTVFVFAGQ
GGQWAGMGAE LLDASPVFAE EVALAGRALA RHVGWSVEDV LRQVPGAPSL DRVDVVQPAS
FAVAAGLVRL WESVGVRPDA VVGHSQGEIA AAYTAGALSL ADAAAVVALR SQAIAGGLTG
RGGMAAVALP VPAVAERLAP FADQVELAAV NGPASVVLAG EPAALDELVA AFESEGIRAR
RIPVDYASHS RQVERISGVL IQALAGLDPR PPRVPFFSTV DAKWVEDAEL DGGYWYRNLR
CPVRFASATR SLLDGGYRVF LEVSTHPVLA PAIAETVDQW DGPPVAVLES LRRDDGGPDR
FTRSAAAAFV RGVTVDLAGA SHAAAHDGLA GTADGPAGTG DRPVPSGGGA VELPTYPFQR
RRYWLASTGA GRPSGTPRPG GRGGGLEVDS GDDHMTGDVT RDISARLPAG SEAGAAQDAE
PEPAGSENAG PGDSGLENAE TDDAGQEFRA RLAELPTAER PARLIELVRA QAAAVLGHAD
TDEVGPDSAF FDIGFSSLTA VELRNRLAAA TGLTLPAMLL FDHAVPREVA AYLLDRLEVE
TRV