Gene Franean1_3890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3890 
Symbol 
ID5672251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4652720 
End bp4655017 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content76% 
IMG OID641242769 
Producterythronolide synthase 
Protein accessionYP_001508186 
Protein GI158315678 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000750553 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.261243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGTTC CCCTCGACCG CCTCGTCGAG GCGCTGCGCG CGGCGATGAA GGAGAACGAG 
CGGCTGCGGG GGCTGATCGC GCAGGCCGGT GAGCCGATCG CGGTCATCGG GATGGCCTGC
CGTTTCCCGG GCGGGGTGCG CTCACCTGAC GATCTGTGGC GGTTCGTCGC CGGCGGCGGC
GACGCCATCA CCCCGTTCCC GACCGACCGG GGCTGGGACA CCTCGGCCCT CACCTGCTTC
GGGGGCGGGT TCCTGCACGA CGCGGCCGAC TTCGACCCGG AGTTCTTCGG GATGTCGGAG
AACGAGGCGC TGGCCACCGA CCCGCAGCAG CGGCTGCTGC TGGAGACCTC GTGGGAGGCC
TTCGAGCACG CCGGGATAGA CCCGCGGTCG GCGCGGGGCA GCCGCACCGG CGTGTACTTC
GGTGTGATCT TCCACGACTA CGGGTACCGG CTGCGCCCAC CGCCCCCGGG CATCGACGGC
TACGCCTACT TCGGCAGCGC GGGCAGCATC GCGTCCGGGC GGGTCTCCTA CACGCTCGGT
CTCGAGGGGC CGTGTGTGAC GCTGGACGCC GCGTGCGCGT CCTCGCTGGT CGGCCTGCAT
CTGGCCGGTC AGGCCCTGCG GCGCGGCGAG TGCTCGCTCG CCCTGGTCGG CGGGGTGTCG
GTGCTGGCCA CCCCCGAGCT GTGGGAGGAG ACGACCCGGC ACGGGCAGGG CCTGGCCCCG
GACAGCCGCT GCAAGTCCTT CGCCGCCGCG GCTGACGGCA CCGGCATCTC CGAGGGGGTG
GGTGTGCTGC TGGTGGAACG GCTCGGCGAC GCCCGGCGCA ACGGCCATCC GGTGCTGGCG
GTGGTGCGCG GCAGCGCCGT CGAGCAGAGC GGCGCGACCA ACGGCTTCAC CGCGCCCAGC
AGCGCCTCGC AGGAGCGGCT GATCAGCCGG GCGCTGGCTG ACGCCGGCCT CGCGCCGGCC
GACGTCGACG CGGTCGAGGG GCACGGCACC GGAACCCCGG TCGGTGACCC GATCGAGCTG
GCGGCCCTGC TGGCCACCTA CGGGCGCGGA CGGCCCGCCG GCCGGCCGCT GTGGCTGGGG
TCGCTGAAGT CGAACCTGGG CCACACCCAG GCCGCCGGCG GCGCCGGTGC CGTCATCAAG
ATGGCGATGG CCATGCGGCA CGGCGTCCTG CCGCGCACCG TGCACGTCGA CGCCCCGACC
CCGCACGCGG ACTGGTCCGA GGGCGCGGTG GTGCTGCTGA CCGAAGCGGT GCCCTGGCCG
GACACCGCCG GCCGGCCCCG CCGGGTGGGG GTCTCCTCGT TCGGCGCCAA CGGCACACTG
GCGCACGTGA TCCTCGAGCA GGATCAGCCG CCGCCCGCCG ACGCGGCAGC CGGGGATGGG
ACGGCCGGGG ACGCGGTGGA CCGGCCGGTG GTGCCGTGGC CGCTGTCGGC CCGGTCCGCG
CCCGCTCTGC GGGCCCAGGC CCGGCGCCTG CTCGACCTGG TGACCGAGCG GGCCGATCTC
CGCCCGGCCG ACGTCGGGTT CTCGCTCGCG ACCGGCCGCA GCGCCTTCGC GCACCGGGCG
GTGCTGGTCG CCGACGACCG CGCGGAGTTC CTGCGCGGCC TGACGGCGCT GGCCGCCGGC
CCGCCGGAGA GCACTGACGC CGCCCTGGAC GGCCGCACCG GAGCCGGGCT TGTGCTCGTG
CTGCCCGGCG GGGGTGAGTG GGGGCTCGAG GCAGGCCGCG CGCTGGCGGA CGCGTTCCCG
GCCTTCGCCG CGGAGTGGGC AACCCTCCGC GGGGCTCTCA CAGGGGCGGC TCTTCAGACG
GCGCCACAGG TGGCGCTACA GGTGGCGCTG TGCCGGCTGC TGGAGTCCTG GGATGTGCGG
CCCGACGCCG TCGTCGGGCA CGGTGCGGGC GAGGCCGCGG CCGCGTACGT GCGGGGCGCG
CTGTCGCTGG ACGATCTGCG GGCACTGCTG GCGGCGGGCC CCCGCAACGC GTCGCCCGGC
TGCCGTGTTC CCCCGGGCAC CGCGAACGCG CTGGTCGTGG GGCGCGACGC GGCCGGCGTC
GACATCCCGC GCGGCGTGAG GTCATGGGTG CTGCTGCCGC CCGGTCGCAC CGAGATCCGC
GCCGTCACCG AGGCGTCCGC CACGCTGTAC GAGTGGGGGC AGGCCGTCGA CTGGCGGGCG
TTCTTCGCGG GTACCGGCGC CCGGCGGATC GACCTGCCCA CCTATGCCTT CCAGCGCCGG
CGGTACTGGT TGGAGGCCGC GGATCCTGGC CCGACGCCCG CTCCGGGCCT ACCGGCCGCA
CCGGGCGTGC GGCGTTAG
 
Protein sequence
MTVPLDRLVE ALRAAMKENE RLRGLIAQAG EPIAVIGMAC RFPGGVRSPD DLWRFVAGGG 
DAITPFPTDR GWDTSALTCF GGGFLHDAAD FDPEFFGMSE NEALATDPQQ RLLLETSWEA
FEHAGIDPRS ARGSRTGVYF GVIFHDYGYR LRPPPPGIDG YAYFGSAGSI ASGRVSYTLG
LEGPCVTLDA ACASSLVGLH LAGQALRRGE CSLALVGGVS VLATPELWEE TTRHGQGLAP
DSRCKSFAAA ADGTGISEGV GVLLVERLGD ARRNGHPVLA VVRGSAVEQS GATNGFTAPS
SASQERLISR ALADAGLAPA DVDAVEGHGT GTPVGDPIEL AALLATYGRG RPAGRPLWLG
SLKSNLGHTQ AAGGAGAVIK MAMAMRHGVL PRTVHVDAPT PHADWSEGAV VLLTEAVPWP
DTAGRPRRVG VSSFGANGTL AHVILEQDQP PPADAAAGDG TAGDAVDRPV VPWPLSARSA
PALRAQARRL LDLVTERADL RPADVGFSLA TGRSAFAHRA VLVADDRAEF LRGLTALAAG
PPESTDAALD GRTGAGLVLV LPGGGEWGLE AGRALADAFP AFAAEWATLR GALTGAALQT
APQVALQVAL CRLLESWDVR PDAVVGHGAG EAAAAYVRGA LSLDDLRALL AAGPRNASPG
CRVPPGTANA LVVGRDAAGV DIPRGVRSWV LLPPGRTEIR AVTEASATLY EWGQAVDWRA
FFAGTGARRI DLPTYAFQRR RYWLEAADPG PTPAPGLPAA PGVRR