Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3890 |
Symbol | |
ID | 5672251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4652720 |
End bp | 4655017 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641242769 |
Product | erythronolide synthase |
Protein accession | YP_001508186 |
Protein GI | 158315678 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000750553 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.261243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGTTC CCCTCGACCG CCTCGTCGAG GCGCTGCGCG CGGCGATGAA GGAGAACGAG CGGCTGCGGG GGCTGATCGC GCAGGCCGGT GAGCCGATCG CGGTCATCGG GATGGCCTGC CGTTTCCCGG GCGGGGTGCG CTCACCTGAC GATCTGTGGC GGTTCGTCGC CGGCGGCGGC GACGCCATCA CCCCGTTCCC GACCGACCGG GGCTGGGACA CCTCGGCCCT CACCTGCTTC GGGGGCGGGT TCCTGCACGA CGCGGCCGAC TTCGACCCGG AGTTCTTCGG GATGTCGGAG AACGAGGCGC TGGCCACCGA CCCGCAGCAG CGGCTGCTGC TGGAGACCTC GTGGGAGGCC TTCGAGCACG CCGGGATAGA CCCGCGGTCG GCGCGGGGCA GCCGCACCGG CGTGTACTTC GGTGTGATCT TCCACGACTA CGGGTACCGG CTGCGCCCAC CGCCCCCGGG CATCGACGGC TACGCCTACT TCGGCAGCGC GGGCAGCATC GCGTCCGGGC GGGTCTCCTA CACGCTCGGT CTCGAGGGGC CGTGTGTGAC GCTGGACGCC GCGTGCGCGT CCTCGCTGGT CGGCCTGCAT CTGGCCGGTC AGGCCCTGCG GCGCGGCGAG TGCTCGCTCG CCCTGGTCGG CGGGGTGTCG GTGCTGGCCA CCCCCGAGCT GTGGGAGGAG ACGACCCGGC ACGGGCAGGG CCTGGCCCCG GACAGCCGCT GCAAGTCCTT CGCCGCCGCG GCTGACGGCA CCGGCATCTC CGAGGGGGTG GGTGTGCTGC TGGTGGAACG GCTCGGCGAC GCCCGGCGCA ACGGCCATCC GGTGCTGGCG GTGGTGCGCG GCAGCGCCGT CGAGCAGAGC GGCGCGACCA ACGGCTTCAC CGCGCCCAGC AGCGCCTCGC AGGAGCGGCT GATCAGCCGG GCGCTGGCTG ACGCCGGCCT CGCGCCGGCC GACGTCGACG CGGTCGAGGG GCACGGCACC GGAACCCCGG TCGGTGACCC GATCGAGCTG GCGGCCCTGC TGGCCACCTA CGGGCGCGGA CGGCCCGCCG GCCGGCCGCT GTGGCTGGGG TCGCTGAAGT CGAACCTGGG CCACACCCAG GCCGCCGGCG GCGCCGGTGC CGTCATCAAG ATGGCGATGG CCATGCGGCA CGGCGTCCTG CCGCGCACCG TGCACGTCGA CGCCCCGACC CCGCACGCGG ACTGGTCCGA GGGCGCGGTG GTGCTGCTGA CCGAAGCGGT GCCCTGGCCG GACACCGCCG GCCGGCCCCG CCGGGTGGGG GTCTCCTCGT TCGGCGCCAA CGGCACACTG GCGCACGTGA TCCTCGAGCA GGATCAGCCG CCGCCCGCCG ACGCGGCAGC CGGGGATGGG ACGGCCGGGG ACGCGGTGGA CCGGCCGGTG GTGCCGTGGC CGCTGTCGGC CCGGTCCGCG CCCGCTCTGC GGGCCCAGGC CCGGCGCCTG CTCGACCTGG TGACCGAGCG GGCCGATCTC CGCCCGGCCG ACGTCGGGTT CTCGCTCGCG ACCGGCCGCA GCGCCTTCGC GCACCGGGCG GTGCTGGTCG CCGACGACCG CGCGGAGTTC CTGCGCGGCC TGACGGCGCT GGCCGCCGGC CCGCCGGAGA GCACTGACGC CGCCCTGGAC GGCCGCACCG GAGCCGGGCT TGTGCTCGTG CTGCCCGGCG GGGGTGAGTG GGGGCTCGAG GCAGGCCGCG CGCTGGCGGA CGCGTTCCCG GCCTTCGCCG CGGAGTGGGC AACCCTCCGC GGGGCTCTCA CAGGGGCGGC TCTTCAGACG GCGCCACAGG TGGCGCTACA GGTGGCGCTG TGCCGGCTGC TGGAGTCCTG GGATGTGCGG CCCGACGCCG TCGTCGGGCA CGGTGCGGGC GAGGCCGCGG CCGCGTACGT GCGGGGCGCG CTGTCGCTGG ACGATCTGCG GGCACTGCTG GCGGCGGGCC CCCGCAACGC GTCGCCCGGC TGCCGTGTTC CCCCGGGCAC CGCGAACGCG CTGGTCGTGG GGCGCGACGC GGCCGGCGTC GACATCCCGC GCGGCGTGAG GTCATGGGTG CTGCTGCCGC CCGGTCGCAC CGAGATCCGC GCCGTCACCG AGGCGTCCGC CACGCTGTAC GAGTGGGGGC AGGCCGTCGA CTGGCGGGCG TTCTTCGCGG GTACCGGCGC CCGGCGGATC GACCTGCCCA CCTATGCCTT CCAGCGCCGG CGGTACTGGT TGGAGGCCGC GGATCCTGGC CCGACGCCCG CTCCGGGCCT ACCGGCCGCA CCGGGCGTGC GGCGTTAG
|
Protein sequence | MTVPLDRLVE ALRAAMKENE RLRGLIAQAG EPIAVIGMAC RFPGGVRSPD DLWRFVAGGG DAITPFPTDR GWDTSALTCF GGGFLHDAAD FDPEFFGMSE NEALATDPQQ RLLLETSWEA FEHAGIDPRS ARGSRTGVYF GVIFHDYGYR LRPPPPGIDG YAYFGSAGSI ASGRVSYTLG LEGPCVTLDA ACASSLVGLH LAGQALRRGE CSLALVGGVS VLATPELWEE TTRHGQGLAP DSRCKSFAAA ADGTGISEGV GVLLVERLGD ARRNGHPVLA VVRGSAVEQS GATNGFTAPS SASQERLISR ALADAGLAPA DVDAVEGHGT GTPVGDPIEL AALLATYGRG RPAGRPLWLG SLKSNLGHTQ AAGGAGAVIK MAMAMRHGVL PRTVHVDAPT PHADWSEGAV VLLTEAVPWP DTAGRPRRVG VSSFGANGTL AHVILEQDQP PPADAAAGDG TAGDAVDRPV VPWPLSARSA PALRAQARRL LDLVTERADL RPADVGFSLA TGRSAFAHRA VLVADDRAEF LRGLTALAAG PPESTDAALD GRTGAGLVLV LPGGGEWGLE AGRALADAFP AFAAEWATLR GALTGAALQT APQVALQVAL CRLLESWDVR PDAVVGHGAG EAAAAYVRGA LSLDDLRALL AAGPRNASPG CRVPPGTANA LVVGRDAAGV DIPRGVRSWV LLPPGRTEIR AVTEASATLY EWGQAVDWRA FFAGTGARRI DLPTYAFQRR RYWLEAADPG PTPAPGLPAA PGVRR
|
| |