Gene Francci3_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1330 
Symbol 
ID3906602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1596175 
End bp1597383 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID637878663 
ProductAcetyl-CoA C-acyltransferase 
Protein accessionYP_480436 
Protein GI86740036 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.175377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.933214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGGAT CATCCAGAAA CTTCGTACGC GACGTCGTGT TCGTCGACGG GGTCCGAACC 
CCGTTCGGGA AGGCCAAGGG CGTCTACGCC GAGACCCGTG CCGACGATCT GATCGTGCGG
GTGATCCGGG AGCTGCTGCG ACGCAATCCT TCCCTCCCCC CCGAGCGGAT CGACGAGGTG
GCGATCGCGG CCACCACCCA GATCGGCGAT CAAGGCCTGA CCATCGGCCG GGTCGCCGCC
ATCCTGTCCG GGTTGCCGGA AACGGTGCCG GGCTTCGCGA TCGACCGGAT GTGCGCAGGC
GCCGTCACGG CGGTGACCAC GACGGCCTCG TCGATCGCCG TCGGCGCCTA CGACGTGGCC
GTGGCCGGCG GGGTGGAACA CATGGGCCGC CACCCGATGG GCGAAGGCGC CGACCTCAAC
CCCCGGTTCG TCGCCGAACG GCTCGTGGAC ACCTCCGCGC TGGTGATGGG CTCGACCGCC
GAGAACCTGC ACGACCGGTA CCCCAAGATC ACCAAATCGC GGGCGGACGC CTTCGCTCTG
GCCTCCCAGG AGAAGGTCGC CAAGGCGTAC GCGAACGGTC AGATCCAGCC CGACCTCGTC
CCGGTGGCCG CCCGGCACGT CGAGACGGGC TGGGAGTTCG TCACCGTCGA CGAGCCGCCG
CGACCGGACA CCACGCAGGC GGGGCTCGCC GGGCTGCGCA CGCCGTTCCG GCCGCACGGG
CGGGTGACCG CGGGCAACTC CGCCGGGCTC AACGACGGCG CCACCGGCTG CCTGCTCGCC
GCCGCCGAGG TGGCGGCCGA GCTAAGCCTG CCGGCGAAGA TGAGCCTGGT GGGTTTCGCG
TTCGCCGGAG TCCCCCCCGA GGTGATGGGC GTCGGGCCGA TCCCGTCGAC GGAGAAGGCG
CTGGCCCGCA CCGGCCTGAC CATCGACGAC ATCGGCCTGT TCGAGCTGAA CGAGGCCTTC
GCCGTGCAGG TCCTGGCCTT CCTCGACCAC TTCGGCATCG CCGAGGACGA TCCCCGGGTC
AACCCGTACG GCGGGGCCAT CGCGTTCGGT CACCCACTGG CGTCCAGCGG GGTGCGCCTG
ATGACCCAGC TCGCCCGGCA GTTCGCCGAG CATCCCGAGG TGCGCTATGG CATGACGGCG
ATGTGCGTGG GTCTCGGCAT GGGCGCCACC ACGATCTGGG CGAATCCGCA CCACGCCGCG
GCCAACTGA
 
Protein sequence
MFGSSRNFVR DVVFVDGVRT PFGKAKGVYA ETRADDLIVR VIRELLRRNP SLPPERIDEV 
AIAATTQIGD QGLTIGRVAA ILSGLPETVP GFAIDRMCAG AVTAVTTTAS SIAVGAYDVA
VAGGVEHMGR HPMGEGADLN PRFVAERLVD TSALVMGSTA ENLHDRYPKI TKSRADAFAL
ASQEKVAKAY ANGQIQPDLV PVAARHVETG WEFVTVDEPP RPDTTQAGLA GLRTPFRPHG
RVTAGNSAGL NDGATGCLLA AAEVAAELSL PAKMSLVGFA FAGVPPEVMG VGPIPSTEKA
LARTGLTIDD IGLFELNEAF AVQVLAFLDH FGIAEDDPRV NPYGGAIAFG HPLASSGVRL
MTQLARQFAE HPEVRYGMTA MCVGLGMGAT TIWANPHHAA AN