Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1978 |
Symbol | |
ID | 3903686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2322036 |
End bp | 2324816 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637879314 |
Product | beta-ketoacyl synthase |
Protein accession | YP_481081 |
Protein GI | 86740681 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.690285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGAC CAGACGACGT CCGGGATTCC GACATCGCTA TTATCGGGAT GGCGTGCCGT TTTCCCGGGG CCGATACCGT CGAGGAATTC TGGCAGAATC TGTGTGACGG GCGTGAGACC ACCACCTTCT TCACCGAGGC CGAACTGCTG GCAGCCGGCG TCGGCCCCGC ACTCGTGGCG GACCCGCGCT ATGTGCGGGC CGGGCAGATA CTGGCCGACA TCGAGATGTT CGACGCCGAG GTCTTCGGGA TCACGAGCGA GGAGGCGGAG CTCCTGGACC CACAGCAACG GCACTTCCTC GAATGCGCGC TCACGGCGCT TGAGAACGCC GGTTACAACC CGGACACCTG CCCCGGACCG ATCGGTATCT ATGCCGGCGC CGGGCTGAAC ACCTACCTTC TACGCAACCT CTCCGGCCGT TTCCGCCACG GGTCGACGCT GGAACGTTAC CGGCTGATGA TAGGCAACGA CAAGGACTTC CTGGCGACGC GCGTGGCCTA CAAGCTGAAC CTGCGGGGCC CGGCGGTGAA CGTGAACACC GCCTGCTCCA CTTCGCTGGT GGCCGTCAAT CTGGCCTGCC TGAGCCTGCT GGGCGGAGAG TGCGATCTTG CGTTGGCCGG AGCGGCCCAT ATCAAGGTGC CACAGGCCGA GGGCTACCTT TTCCAGGAGG GAATGATCCT GTCGCCGGAC GGCCGGTGCC GGGCTTTCGA CGCCCGAGCG CAGGGCACGA TCCTCGGCAG CGGTGTCGGC ATCGTGGTCC TGAAGCGGTT GGCCGACGCG CTCGCCGACG GCGACTGGAT TCACGCGGTC ATCAAGGGAA GCGCGGTCAA CAACGACGGC GGCGCGAAGA CCGGCTACAC GGCTCCCAGC GTGCAGGGCC AGGCCGCCGT GATCGCCGAG GCTCAGGCTC TTGCCGGTTG CGCCGCCGAG ACGATCACCT ATGTCGAGGC CCACGGTACC GGCACCCCGC TCGGGGACCC GGTCGAGCTC GCCGCCCTGA CCGACGTGTT CCTGCGGCAG GACGTGTTCC TGCGGCAGAC AGGGGCAGGC GCCCGGTGCG CCATCGGTTC GGTCAAGACC AACATCGGCC ACCTGGACGC GGCGTCCGGG ATCGCCGGAC TGATCAAAAC CTCGCTGATG CTGCGACACC GGCGGATCGT GCCGAGCCTG CACTTCGAAA AACCCAATCC CGACATCGAC TTCGACGCCA ATCCCTTCCG CGTCGCCACG GAGTCCCAGG AGTGGCCGGC ATCGGGCACA GCCCCGCGAC GTGCCGGCGT GAGCTCGTTC GGCATTGGCG GCACGAACGC CCATGTGATT CTTGAGGAGC CGCCGCCCCG CGAACGCGAG CCCGGGGTGC CGCCTTCGGA CGGCTGGCAG TTGCTGATGA TCTCTGCCGG CTCGGTCGCG GCGCTCGAGG AGGCAACGGA GAATCTGGCG CGTCATCTGC GCGTGCACCG CGAGACGCTC GACCTCGCCG ACGTGGCGTA CACGCTCGCG GTGGGACGCC GGGCACGTGC GTACCGACGG GCGCTGGTGT GCCGGGACGT CCACGACGCC GCACTGACGC TCGCGCTCGG GGAGCCCGAC CGCGTGTCCA CCGGGCAGGT TGCCGACGGC GAGGCCGACG TAGTCTTCGT TTTCGCGCAC CGTCTTATGG AGAACGGCGA CGACACCCCT CTCGCCGGCC TGTACCGCGA CGCCGCCGTA TACCGCGACG TCGCTGTGTT CCGCGACGCG GTCGACCGGT GCGCGGTGGC CCTCAAGACT CTCGGGGTGA GGCCGCGGAC ACCCGACCCG GCGTCGTTGT TGCGTGCGCG GCATGGCGAC GCCGTCGCCG CCTTCGTCGG GCAGTACGCT CTCGCCGAGC TCTGGACGGC ATGTGGTGTG CGTCCGAGCG CGCTGGTGGG GTTCGGCTCG GGCGATCTGG TCGCCGCCTG CCTCGCCGGG ATCTTCCCCC TCGAGTCCGC GCTCGGGCTG GCCTGGGCGC AGGCGCACGG CGAGCCGGCC GGCAACCTCA CGCCAGCGGC CCCCCGGTAC CCCGTGTGGT CCGCGGCAGT CAACGGTTGG CTGGACGTGG GTGCGCCGGT GCCGTCGGCG GGCTGGGTCG GGCCGCGTGA CGAGACTCCC GACGCCGAGG AACGTCTCGC CCTCCTGCTC GACGGTCGGC GCGAGACATC ATCCGGGCGG AAGGTTGTGC CATTGGAGAT GGACCCGCGC GGCACCCGTC CCGACGACGG CACCCGTCCC GACGACGACG CCCGACACGT GTGGCTGGCG ACCGCGGGGC GCCTGTGGAC AAGCGGCGTC AGCCTCGACT GGTCGGCGTT GCATGCTGGC CAGGGACATC GGCGCGTGCC CCTGCCGACC TACCCGTTCC AGCGCAGGCG CTACTGGGTC GAGGCGGACG AGCACGCGTT TCCGGCGACT GCCGGAGGAA CCGATGCCGG CCCCGCGCCG GCGGAGGGGA CGCTGCGCGA CCGAGTGGAG ACGGCACGGG ACGCCGAGAG ACCCGCACTT GTCATCGATT TCATCCAACA TCAGGTCGCG GAAATGCTCG GACTGGACGA TGCGGCACAG GTGGACCCGG ATCAGAACCT CTTCACGCTG GGGCTGGACT CGCTGAATCT GATCGAGATC GCGGCACGGC TGGGTGCCGA GCTGGAGCAG GACGTCCGGG CCTCGGTCTT CACCGATCAT CCGACTATCC GCGCCTACGT GGAAAAGCTG GCGGCATCGC AGGACCTGCC CGGCACCGGC CCAGGCACCC TGACGGGGTA G
|
Protein sequence | MERPDDVRDS DIAIIGMACR FPGADTVEEF WQNLCDGRET TTFFTEAELL AAGVGPALVA DPRYVRAGQI LADIEMFDAE VFGITSEEAE LLDPQQRHFL ECALTALENA GYNPDTCPGP IGIYAGAGLN TYLLRNLSGR FRHGSTLERY RLMIGNDKDF LATRVAYKLN LRGPAVNVNT ACSTSLVAVN LACLSLLGGE CDLALAGAAH IKVPQAEGYL FQEGMILSPD GRCRAFDARA QGTILGSGVG IVVLKRLADA LADGDWIHAV IKGSAVNNDG GAKTGYTAPS VQGQAAVIAE AQALAGCAAE TITYVEAHGT GTPLGDPVEL AALTDVFLRQ DVFLRQTGAG ARCAIGSVKT NIGHLDAASG IAGLIKTSLM LRHRRIVPSL HFEKPNPDID FDANPFRVAT ESQEWPASGT APRRAGVSSF GIGGTNAHVI LEEPPPRERE PGVPPSDGWQ LLMISAGSVA ALEEATENLA RHLRVHRETL DLADVAYTLA VGRRARAYRR ALVCRDVHDA ALTLALGEPD RVSTGQVADG EADVVFVFAH RLMENGDDTP LAGLYRDAAV YRDVAVFRDA VDRCAVALKT LGVRPRTPDP ASLLRARHGD AVAAFVGQYA LAELWTACGV RPSALVGFGS GDLVAACLAG IFPLESALGL AWAQAHGEPA GNLTPAAPRY PVWSAAVNGW LDVGAPVPSA GWVGPRDETP DAEERLALLL DGRRETSSGR KVVPLEMDPR GTRPDDGTRP DDDARHVWLA TAGRLWTSGV SLDWSALHAG QGHRRVPLPT YPFQRRRYWV EADEHAFPAT AGGTDAGPAP AEGTLRDRVE TARDAERPAL VIDFIQHQVA EMLGLDDAAQ VDPDQNLFTL GLDSLNLIEI AARLGAELEQ DVRASVFTDH PTIRAYVEKL AASQDLPGTG PGTLTG
|
| |