Gene Francci3_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1978 
Symbol 
ID3903686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2322036 
End bp2324816 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content69% 
IMG OID637879314 
Productbeta-ketoacyl synthase 
Protein accessionYP_481081 
Protein GI86740681 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.690285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGAC CAGACGACGT CCGGGATTCC GACATCGCTA TTATCGGGAT GGCGTGCCGT 
TTTCCCGGGG CCGATACCGT CGAGGAATTC TGGCAGAATC TGTGTGACGG GCGTGAGACC
ACCACCTTCT TCACCGAGGC CGAACTGCTG GCAGCCGGCG TCGGCCCCGC ACTCGTGGCG
GACCCGCGCT ATGTGCGGGC CGGGCAGATA CTGGCCGACA TCGAGATGTT CGACGCCGAG
GTCTTCGGGA TCACGAGCGA GGAGGCGGAG CTCCTGGACC CACAGCAACG GCACTTCCTC
GAATGCGCGC TCACGGCGCT TGAGAACGCC GGTTACAACC CGGACACCTG CCCCGGACCG
ATCGGTATCT ATGCCGGCGC CGGGCTGAAC ACCTACCTTC TACGCAACCT CTCCGGCCGT
TTCCGCCACG GGTCGACGCT GGAACGTTAC CGGCTGATGA TAGGCAACGA CAAGGACTTC
CTGGCGACGC GCGTGGCCTA CAAGCTGAAC CTGCGGGGCC CGGCGGTGAA CGTGAACACC
GCCTGCTCCA CTTCGCTGGT GGCCGTCAAT CTGGCCTGCC TGAGCCTGCT GGGCGGAGAG
TGCGATCTTG CGTTGGCCGG AGCGGCCCAT ATCAAGGTGC CACAGGCCGA GGGCTACCTT
TTCCAGGAGG GAATGATCCT GTCGCCGGAC GGCCGGTGCC GGGCTTTCGA CGCCCGAGCG
CAGGGCACGA TCCTCGGCAG CGGTGTCGGC ATCGTGGTCC TGAAGCGGTT GGCCGACGCG
CTCGCCGACG GCGACTGGAT TCACGCGGTC ATCAAGGGAA GCGCGGTCAA CAACGACGGC
GGCGCGAAGA CCGGCTACAC GGCTCCCAGC GTGCAGGGCC AGGCCGCCGT GATCGCCGAG
GCTCAGGCTC TTGCCGGTTG CGCCGCCGAG ACGATCACCT ATGTCGAGGC CCACGGTACC
GGCACCCCGC TCGGGGACCC GGTCGAGCTC GCCGCCCTGA CCGACGTGTT CCTGCGGCAG
GACGTGTTCC TGCGGCAGAC AGGGGCAGGC GCCCGGTGCG CCATCGGTTC GGTCAAGACC
AACATCGGCC ACCTGGACGC GGCGTCCGGG ATCGCCGGAC TGATCAAAAC CTCGCTGATG
CTGCGACACC GGCGGATCGT GCCGAGCCTG CACTTCGAAA AACCCAATCC CGACATCGAC
TTCGACGCCA ATCCCTTCCG CGTCGCCACG GAGTCCCAGG AGTGGCCGGC ATCGGGCACA
GCCCCGCGAC GTGCCGGCGT GAGCTCGTTC GGCATTGGCG GCACGAACGC CCATGTGATT
CTTGAGGAGC CGCCGCCCCG CGAACGCGAG CCCGGGGTGC CGCCTTCGGA CGGCTGGCAG
TTGCTGATGA TCTCTGCCGG CTCGGTCGCG GCGCTCGAGG AGGCAACGGA GAATCTGGCG
CGTCATCTGC GCGTGCACCG CGAGACGCTC GACCTCGCCG ACGTGGCGTA CACGCTCGCG
GTGGGACGCC GGGCACGTGC GTACCGACGG GCGCTGGTGT GCCGGGACGT CCACGACGCC
GCACTGACGC TCGCGCTCGG GGAGCCCGAC CGCGTGTCCA CCGGGCAGGT TGCCGACGGC
GAGGCCGACG TAGTCTTCGT TTTCGCGCAC CGTCTTATGG AGAACGGCGA CGACACCCCT
CTCGCCGGCC TGTACCGCGA CGCCGCCGTA TACCGCGACG TCGCTGTGTT CCGCGACGCG
GTCGACCGGT GCGCGGTGGC CCTCAAGACT CTCGGGGTGA GGCCGCGGAC ACCCGACCCG
GCGTCGTTGT TGCGTGCGCG GCATGGCGAC GCCGTCGCCG CCTTCGTCGG GCAGTACGCT
CTCGCCGAGC TCTGGACGGC ATGTGGTGTG CGTCCGAGCG CGCTGGTGGG GTTCGGCTCG
GGCGATCTGG TCGCCGCCTG CCTCGCCGGG ATCTTCCCCC TCGAGTCCGC GCTCGGGCTG
GCCTGGGCGC AGGCGCACGG CGAGCCGGCC GGCAACCTCA CGCCAGCGGC CCCCCGGTAC
CCCGTGTGGT CCGCGGCAGT CAACGGTTGG CTGGACGTGG GTGCGCCGGT GCCGTCGGCG
GGCTGGGTCG GGCCGCGTGA CGAGACTCCC GACGCCGAGG AACGTCTCGC CCTCCTGCTC
GACGGTCGGC GCGAGACATC ATCCGGGCGG AAGGTTGTGC CATTGGAGAT GGACCCGCGC
GGCACCCGTC CCGACGACGG CACCCGTCCC GACGACGACG CCCGACACGT GTGGCTGGCG
ACCGCGGGGC GCCTGTGGAC AAGCGGCGTC AGCCTCGACT GGTCGGCGTT GCATGCTGGC
CAGGGACATC GGCGCGTGCC CCTGCCGACC TACCCGTTCC AGCGCAGGCG CTACTGGGTC
GAGGCGGACG AGCACGCGTT TCCGGCGACT GCCGGAGGAA CCGATGCCGG CCCCGCGCCG
GCGGAGGGGA CGCTGCGCGA CCGAGTGGAG ACGGCACGGG ACGCCGAGAG ACCCGCACTT
GTCATCGATT TCATCCAACA TCAGGTCGCG GAAATGCTCG GACTGGACGA TGCGGCACAG
GTGGACCCGG ATCAGAACCT CTTCACGCTG GGGCTGGACT CGCTGAATCT GATCGAGATC
GCGGCACGGC TGGGTGCCGA GCTGGAGCAG GACGTCCGGG CCTCGGTCTT CACCGATCAT
CCGACTATCC GCGCCTACGT GGAAAAGCTG GCGGCATCGC AGGACCTGCC CGGCACCGGC
CCAGGCACCC TGACGGGGTA G
 
Protein sequence
MERPDDVRDS DIAIIGMACR FPGADTVEEF WQNLCDGRET TTFFTEAELL AAGVGPALVA 
DPRYVRAGQI LADIEMFDAE VFGITSEEAE LLDPQQRHFL ECALTALENA GYNPDTCPGP
IGIYAGAGLN TYLLRNLSGR FRHGSTLERY RLMIGNDKDF LATRVAYKLN LRGPAVNVNT
ACSTSLVAVN LACLSLLGGE CDLALAGAAH IKVPQAEGYL FQEGMILSPD GRCRAFDARA
QGTILGSGVG IVVLKRLADA LADGDWIHAV IKGSAVNNDG GAKTGYTAPS VQGQAAVIAE
AQALAGCAAE TITYVEAHGT GTPLGDPVEL AALTDVFLRQ DVFLRQTGAG ARCAIGSVKT
NIGHLDAASG IAGLIKTSLM LRHRRIVPSL HFEKPNPDID FDANPFRVAT ESQEWPASGT
APRRAGVSSF GIGGTNAHVI LEEPPPRERE PGVPPSDGWQ LLMISAGSVA ALEEATENLA
RHLRVHRETL DLADVAYTLA VGRRARAYRR ALVCRDVHDA ALTLALGEPD RVSTGQVADG
EADVVFVFAH RLMENGDDTP LAGLYRDAAV YRDVAVFRDA VDRCAVALKT LGVRPRTPDP
ASLLRARHGD AVAAFVGQYA LAELWTACGV RPSALVGFGS GDLVAACLAG IFPLESALGL
AWAQAHGEPA GNLTPAAPRY PVWSAAVNGW LDVGAPVPSA GWVGPRDETP DAEERLALLL
DGRRETSSGR KVVPLEMDPR GTRPDDGTRP DDDARHVWLA TAGRLWTSGV SLDWSALHAG
QGHRRVPLPT YPFQRRRYWV EADEHAFPAT AGGTDAGPAP AEGTLRDRVE TARDAERPAL
VIDFIQHQVA EMLGLDDAAQ VDPDQNLFTL GLDSLNLIEI AARLGAELEQ DVRASVFTDH
PTIRAYVEKL AASQDLPGTG PGTLTG