Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1665 |
Symbol | |
ID | 5670067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1990044 |
End bp | 1992857 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240583 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001506009 |
Protein GI | 158313501 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.175798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.265588 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGGAA CGCATCCGTT GGCCGTGCTG GCGGCGAATC TCGCGGAACC CGACCCGACA ATCGTGGGGC GGTTCTCTCG GATCGTCACA GCGCGTGCCA ACGACATTGC GGTACGCGAT GAGAAAGCTG CACTGAGCTA TGCCGAGCTG GACTCGCGTT CATCCGCGCT GGCTCGTTCG CTTGTCGTCG AGGGGGACGG CGGCAACATC GGGATCCTGC TGGGGCAGGG AGCGCCCGCG ATCGTCGCGA TGCTGGGCGC GCTGAAGGCC GGGCGGCCGT TCGTGCCGCT CGACCCGATG CTTCCCGCGG CGCGGCTGGG GCAGATCCTG CGGCTGGCGG GGGTCGCCAC CTGCGTGACC GACAGCGCGC ACACCGAGCT GCTCGCCGCC GCCCGGCTGG AAGCGGCCGA CACCGGTCCC GCGCCTGGTC CCGAGCCCGG AACGGTGCCG GGCTCGGAGC ACACCCTGAT CATTGACGAC GGGCCGCCGG CCGGTACCGC CGAGATCGAC GACGACCTGC TGCCGGGCCG TCGGGCGCTG CCCACCGACC CAGCCTTCCT GGTGTTCAGC TCCGGCTCCA CCGGCGTCCC GAAGGGCGTC GTCTGGCGGA ACCGGACCGT GATGAAGGAC CTGGACGCGG GCATCGAGCG GGTCGGGATG AACGCCGCCG ACCAGATCGC CCTCGTGCTG CCGACCGCGT TCGCCGCCGG CATCACCGTC ATGTTCTGGG GGCTGCTCTG CGGCGCCACG CTGCACCCGT TCGACCCGCG GGCCCGCGGG ATCGGCGCGA TGCCTGCCTG GCTCGTCGAC CGCGGCATCA CCACGCTGCA CCTGACCCCG TCGCTGATGC GCGCGCTGGC CGGCGCCGCC GAGCCCGGGC TGATCCTGGG CGACCTGCGC GCGGTCACCA GCTCCGGTGA GGCCGTGTAC GGCCGGGATG TCGCCGCTCT GCGCGAGCTG CTGCCTACAA CGTGCACGTT CTACAACTGG TCCGGCTCGA CCGAGACCGC CTCGCTCGCC TTCTTCCCGG TCGGATCAGG CGACGAGATT CCGGCCGGTC CACTGCCGGC GGGATGGGCC GTCGACGGCA AGGACATCGA GATCGTCGAC GAGCACGGCA AACCCGTACC GGATGGTGCC ACCGGCGAAA TATCAGTAAC GTCAAGGTAC CTTTCCGGCG GCTACTGGAA CGCCCCCGAG ATGACGGCGC AGCGATTTCG ACCGGTTCGC CTGAGTGTCG ATTCTCCGGC GGACGGCCTG CGGGATTCCG GTGCGCTGGT TTCCGGTGAC GAAGCCCTGG CGGGCCCGGC GAGCCCGGCG GACTCCGCGG GCCCGGACCA CGACCTGCCC GTCTCCGGCG TCACCGTCAC CGGTGAGATC GTCCTCGGTG ACCTCGGGTC CCACGAGGTC GTGTACCGCG GCGGGGATCT GGCCCGGCGA CGCCCGGACG GGTGCATCGA ACTACTCGGC CGCGCCGACG CCGCGGTGAA GATCCGTGGT TACCTGGTCG AGCCCGCCGA GATCGAGACC GTCCTGCTCA CCTCGCCGGA CGTCCTCGAG GCCGTCGTCG TCGCCGAGCG CGTCGAGGGC GAGCCTCCGC GGCTGGTCGC CTACGTCGTG TGCGCCACCG CGGTGAAGTC GGCGACCGTG GCGATCCGCG GCCTGCTGCG GGAGAAGCTG CCCGCCTACA TGGTTCCCTC CTCGGTGATC CTGCTCAACG AGCTGCCCCG CAACGAGCGC GGCAAGATCG ACCGGCCCGC GCTGCCGTCG CCGCCGCCCC GGCCTCCCGC CCGCCGTCCC ACCGGGCGCA GCCACTGGGA GTTCGCGCTG TGCGACCTGT TCGCGCAGAT CCTGAAGGTC GATGAGGTCG GCGTCGACGA CGACTTCCTC GAGCTCGGCG GGGACTCGCT GCTCGCCCAG CAGCTCCTCA GCGAGATCGC CACCCGTCTC GGCGTGACCC TGCCGACCTC GGTGCTGGTG GAGGCGCCGA CCGTCGCCGC CCTCGCGGCC CGGGTGGCAG GTTCCCAACG GGGCATCCCG ACCCATCCGA CCTGCGTGCC GCTCAACGTC GACGGCTCGC GTCCGCCGCT GTTCTGCGTC GCCGGCGCGG GCGGGCTCGC CATCAACTTC CTCGGGCTCT CCCGCCTGCT CGGCTCCGAC CAGCGGGTCT ACGGCCTGCA GGCGCAGGGA ATGGAAAGCC GGGCGCTGCC CGACTGGAGC GTCGAGCGCC ACGCCCGGCG CCACCTGGCC GTGCTCCGCG TGATCCAGCC GACCGGCCCT TACTACCTCG CCGGCTTCTC GTTCGGCGGG CTGGTCGCCC TGGAGATGGC GCACATGCTC GCCGCCGCCG GCGAGGAGGT GGGGATGCTG CTGCTGCTCG ACACCACGCT GCCGCGCTCG GCTCAGTCCG CCGGCGGCGC GCGTCCCAGC GGCGACCCCA GTGGCGGGGG ACGCATCACC CGGCTGCTGC CGGACCGGCT CCCGCTGCCG AACGCGTCCA AGCTACGCAA GGCGGCACGC CTGCCGCTGA CCGGGCTCGT CCGCTACCCC GGGCTCGTCC AGTTCGACGT GTTCTTCGAC CAGGCCCGGT TCATCACCCA GTCGTACCGG GTCCGGCCGT ACGCCGGGCG GACCGTCCTC TACCTCGCCG AGGACAACCC CAAGTCCGGT CGGGACGAGT GGCCCACGCA CCTGACCGGT GACTCCACCA TCGTCACCGT CCCCGGCGAG CACCACACGA TGCTCAACGA GCCGAACGTC TCGGTGCTCG CCGCGGACAT GCGCGCCCGC CTCGGAGAGT CGATGCAGCT CTAG
|
Protein sequence | MVGTHPLAVL AANLAEPDPT IVGRFSRIVT ARANDIAVRD EKAALSYAEL DSRSSALARS LVVEGDGGNI GILLGQGAPA IVAMLGALKA GRPFVPLDPM LPAARLGQIL RLAGVATCVT DSAHTELLAA ARLEAADTGP APGPEPGTVP GSEHTLIIDD GPPAGTAEID DDLLPGRRAL PTDPAFLVFS SGSTGVPKGV VWRNRTVMKD LDAGIERVGM NAADQIALVL PTAFAAGITV MFWGLLCGAT LHPFDPRARG IGAMPAWLVD RGITTLHLTP SLMRALAGAA EPGLILGDLR AVTSSGEAVY GRDVAALREL LPTTCTFYNW SGSTETASLA FFPVGSGDEI PAGPLPAGWA VDGKDIEIVD EHGKPVPDGA TGEISVTSRY LSGGYWNAPE MTAQRFRPVR LSVDSPADGL RDSGALVSGD EALAGPASPA DSAGPDHDLP VSGVTVTGEI VLGDLGSHEV VYRGGDLARR RPDGCIELLG RADAAVKIRG YLVEPAEIET VLLTSPDVLE AVVVAERVEG EPPRLVAYVV CATAVKSATV AIRGLLREKL PAYMVPSSVI LLNELPRNER GKIDRPALPS PPPRPPARRP TGRSHWEFAL CDLFAQILKV DEVGVDDDFL ELGGDSLLAQ QLLSEIATRL GVTLPTSVLV EAPTVAALAA RVAGSQRGIP THPTCVPLNV DGSRPPLFCV AGAGGLAINF LGLSRLLGSD QRVYGLQAQG MESRALPDWS VERHARRHLA VLRVIQPTGP YYLAGFSFGG LVALEMAHML AAAGEEVGML LLLDTTLPRS AQSAGGARPS GDPSGGGRIT RLLPDRLPLP NASKLRKAAR LPLTGLVRYP GLVQFDVFFD QARFITQSYR VRPYAGRTVL YLAEDNPKSG RDEWPTHLTG DSTIVTVPGE HHTMLNEPNV SVLAADMRAR LGESMQL
|
| |