Gene Francci3_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2997 
Symbol 
ID3905494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3550286 
End bp3551479 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content71% 
IMG OID637880317 
ProductAcetyl-CoA C-acyltransferase 
Protein accessionYP_482083 
Protein GI86741683 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.488801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAACG CAGTCATCGT CGATGGGGTA CGCACGGCGT CGGGACGGGG CAAGCCGGGT 
GGGGCGCTGT CCGAGACCCA TCCGGTGGAG CTGCTGGCCA CCGTGCTGAA GGCACTGATC
GCCCGTAACG ACCTTGATCC GGCGCTGGTC GACGACGTCA TCGCCGGGTG TGTTGACCAG
GCCGGCGAGC AGGCCGTCAA CATCGGGCGC ACCGCGGTGC TGTCGGCCGG GTTCCCCGAG
TCGGTGCCGG CCACCACGAT CGACCGCCAG TGCGGCTCGA GCCAGCAGGC CGCCCATTTC
GCCGCGCAGG GGGTGCTCGC CGGCGCCTAC GACATCGTCA TCGCCGCCGG CGTGGAGTCG
ATGAGCCGGG TGCCGATGGG CTCCACCACT TTCGGCAAGG ACCCCAACGG CCCGAGCCTG
CACGCCCGCT ACCCGGAGGG CCTGGCCCAC CAGGGCATCG GCGCGGAGCT CGTCAGCGCC
CGATGGAAGA TCAACCGGGA GGACCTGGAC ATCTTCTCCG CGCGGTCGCA CCAGCTCGCG
GCGGCCTCCG TCGCGGCGGG GGACTTCGCC GGGGAGATCG TCCCGGTCGA GATCACCCTT
CCGGACGGCA CGACAGCCCA GCACACCGTG GACGAGACGG TCCGAGCGAC CACGACCGTC
GAGACGCTGG CGAAGCTCAA GCCGTCCTTC TACACCGAGG CGTACGCCGC CCGCTTCCCG
GAGATCACCT GGAACATCAC CCCGGGTAAC TCCTCCCCGC TGACCGACGG CGCCTCGGCC
GTCCTGATCA TGAGCGAGAC CAGGGCGAAC AAGCTCGGCC TGCGGCCACG GGCCCGGTTC
CACACGTTCG CGCTCGCCGG GGACGACCCG TTGCTCATGC TGACCGCGCC GATCCCGGCA
ACCCGCAAGG CGCTCAAGCG CGCCGGTCTG AGCATCGACG ACATCGACGC CTTCGAGGTC
AACGAGGCGT TCGCGCCCGT GCCGCTCATG TGGGCCCGCG ACACCGGTGC CGACCCGGCG
AAGCTCAACC CGCGCGGGGG CGCCATCGCG CTGGGCCACC CGCTGGGCGG ATCGGGCACC
CGCCTGCTCA CCACGATGCT CAACTACCTG GAGGCCACCG GCGGCCGTTA CGGCCTGCAG
ACGATGTGCG AGGGCGGCGG CATGGCCAAC GCCACCATCA TCGAACGGCT CTGA
 
Protein sequence
MENAVIVDGV RTASGRGKPG GALSETHPVE LLATVLKALI ARNDLDPALV DDVIAGCVDQ 
AGEQAVNIGR TAVLSAGFPE SVPATTIDRQ CGSSQQAAHF AAQGVLAGAY DIVIAAGVES
MSRVPMGSTT FGKDPNGPSL HARYPEGLAH QGIGAELVSA RWKINREDLD IFSARSHQLA
AASVAAGDFA GEIVPVEITL PDGTTAQHTV DETVRATTTV ETLAKLKPSF YTEAYAARFP
EITWNITPGN SSPLTDGASA VLIMSETRAN KLGLRPRARF HTFALAGDDP LLMLTAPIPA
TRKALKRAGL SIDDIDAFEV NEAFAPVPLM WARDTGADPA KLNPRGGAIA LGHPLGGSGT
RLLTTMLNYL EATGGRYGLQ TMCEGGGMAN ATIIERL