Gene Francci3_2596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2596 
Symbol 
ID3906502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3063225 
End bp3064439 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content73% 
IMG OID637879921 
Productbeta-ketoacyl synthase 
Protein accessionYP_481687 
Protein GI86741287 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0304] 3-oxoacyl-(acyl-carrier-protein) synthase 
TIGRFAM ID[TIGR03150] beta-ketoacyl-acyl-carrier-protein synthase II 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0887109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTACG ACAAGATCGT GGTAACGGGT CTCGGTGCGA TGACTCCACT CGGCGGCGAC 
CTCGCCGCCA CCTGGGACGG GCTGGTCGCG GGTGCCTCGG GTGTCGCGGT GATCGAAGAC
GAATGGGCGC GGGAGCTGCC GGTCCGACTC GCCGCACAGC TGCCCGTCGA CCCCGCGGCA
ACACTGCCGC GCAAGGACAG CCGCAAGCTC GACCGCGGTG AACAGATGGC GATCGTCACC
GCCCGGCAGG CGTGGGCGGA CGCGGGATCC CCCGAGGTCG CTCCCGAGCG GCTCGCGGTC
GTCATCGGCA CCGGTTACGG CGGGGTGCAG TCCACCCTGG GCCAGCACCG CATCCTGGAG
CAGGCCGGTG CGCGCCGGAT GTCCCCTCAC ACGGTGATCA TGCTAATGCC CAACGGCCCG
GCGGCGTGGG TGAGCATCGA CCTCGGCGCG AAGGCCGGCG CCCGCACCCC GGTCAGCGCC
TGCGCGTCCG GCGCCGAGGC GATCGCCACC GGTATGGAGA TGATCCAACA CGGGCTGGCC
GACGTCGTGG TCGCCGGCGG CGTCGAGGCG CCGGTCGACA CGCTGCCGCT GGCGGCGTTC
GCGCAGATGA AGGCCCTCTC GACCCGGCAC AGCGACCCGG CAGCCGCCTC CCGGCCCTTC
GACGCCGACC GTGACGGCTT CGTGCTCGGC GAGGGCGCCG GCCTGCTGGT CCTGGAACGG
GCCGGGTTCG CCGCCGCCCG CGGCGCCCGC GTGTACGCCG TCGCCGCCGG CGCGGCCACC
AACTCCGACG CCCTCGACAT CGTCACCGCC GACCCCGCCG GGCAGCGCAG GGCCATCGAG
GCCGCACTGG CCGGCGCCGG CCTCACCCCC ACCGATATCG ACCTCGTACA CGCCCACGCC
ACCTCTACCC CCGTCGGCGA CCCCCTGGAG GCCGAGGCGA TCACTCAGAC AATCGGCACC
CATCCGGCGG TCACGGCCAC CAAGTCGATG ACCGGGCACA TGCTCGGCGC CGCCGGCGCG
GTGGGCGCCA TCGCCACGGT GCTCAGCATC CGCGACGGCG TCATACCCCC GGTGCGCAAC
CTTGACCGGC TCAACCCGAC GATCAAGCTC GACGTCGTGT CCGGACCGGC CCGCCACGAG
ACCGTCCGCG CCGCCGTGGC CAACGCCTTC GGCTTCGGTG GGCACAACGT CTCCCTGGCA
TTCACCGCCC CCTGA
 
Protein sequence
MVYDKIVVTG LGAMTPLGGD LAATWDGLVA GASGVAVIED EWARELPVRL AAQLPVDPAA 
TLPRKDSRKL DRGEQMAIVT ARQAWADAGS PEVAPERLAV VIGTGYGGVQ STLGQHRILE
QAGARRMSPH TVIMLMPNGP AAWVSIDLGA KAGARTPVSA CASGAEAIAT GMEMIQHGLA
DVVVAGGVEA PVDTLPLAAF AQMKALSTRH SDPAAASRPF DADRDGFVLG EGAGLLVLER
AGFAAARGAR VYAVAAGAAT NSDALDIVTA DPAGQRRAIE AALAGAGLTP TDIDLVHAHA
TSTPVGDPLE AEAITQTIGT HPAVTATKSM TGHMLGAAGA VGAIATVLSI RDGVIPPVRN
LDRLNPTIKL DVVSGPARHE TVRAAVANAF GFGGHNVSLA FTAP