Gene Franean1_4557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4557 
Symbol 
ID5672904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5435821 
End bp5436984 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content72% 
IMG OID641243420 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001508836 
Protein GI158316328 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGGG TTACCGTGCG GGACGCCGTC ATCGTCGAAG CGGTACGGAC GCCGATCGGT 
CGGCGCAACG GCACGCTGCG GGGCGTTCAC CCGGTCGATC TGGCCTCGGA CGTGCTCCGG
ACGCTCACCG AGCGTGCGCA GGTCGCCCCC GAACTCGTCG ACGACGTCAT CTTTGGCTGC
GTCACTCAGA TCGCCGACCA GTCGACGAAC GTCGCCCGCC TGGCCGTGCT GGCCGCCGGC
TGGCCGGAAA GCATCCCGGG GGTGACGCTG GACCGGGCCT GCGGCTCGAG TCAGCAGGCG
GTCCACTTCG CCGCGGCCGG GGTGATGGCG GGCCAGTACG ACCTCGTCGT CGCCGGCGGC
GTCGAGTCGA TGAGCCGGGT TCCGCTGGGT TCGGCCCGGG CCGTCGGTGA GCCGTTCGGC
CCCCGGGTGC GCGCGCGCTA CCAGCGGGAC AGCTTCAGTC AGGGTCTCGG CGCCGAGGAG
ATCGCGCACC GGTGGGGCCT GTCCCGGACC CGGCTCGACG AGTTCGCCCT GGAGTCGCAC
CACCGCGCGA TGGCGGCCGT CGACTCGGGC GCGTTCGACG CGCAGCTCGT TCCCGTCGAG
GGACCGGACG GCGTCCTCAC CGCCGACGAG GGGGTGCGGC GGGAGACGAC GCTCGAGAAG
CTCGGTGCGC TGAAGCCCGC GTTCCGCGAG GACGGGGTGA TCCACGCGGG CAACAGTTCC
CAGATCTCCG ACGGCAGCGC GGCACTGCTG ATCACGACCA GCGAACGGGC GGCCGAGCTG
GGCCTGCGAC CGCTCGCTCG CTTCCACGCC GGTACGGTCG CCGGTGACGA TCCGGTGATC
ATGTTGACCG CTCCGATCCC CGCGACGGAG AAGGTCCTGA AGCGGGCCGG GCTCACGATC
GCCGACATCG GTGTGTACGA GGTGAACGAG GCGTTCGCGA GCGTTCCGCT CGCCTGGGAG
ATCGAGACGG GCGCCGACCA CGCCCGGCTG AACCCGCTCG GGGGCGCGAT CGCGGTGGGC
CACCCGCTCG GCGCCTCGGG CGCGATCCTG ATGACCCGCA TGATCCACCA CATGCGCGAC
CACGGCATCC GCTACGGCCT GCAGACCATG TGCGAGGCCG GCGGCATGGC CAACGCGACC
GTCATCGAGC TGATCGGTGC CTGA
 
Protein sequence
MKGVTVRDAV IVEAVRTPIG RRNGTLRGVH PVDLASDVLR TLTERAQVAP ELVDDVIFGC 
VTQIADQSTN VARLAVLAAG WPESIPGVTL DRACGSSQQA VHFAAAGVMA GQYDLVVAGG
VESMSRVPLG SARAVGEPFG PRVRARYQRD SFSQGLGAEE IAHRWGLSRT RLDEFALESH
HRAMAAVDSG AFDAQLVPVE GPDGVLTADE GVRRETTLEK LGALKPAFRE DGVIHAGNSS
QISDGSAALL ITTSERAAEL GLRPLARFHA GTVAGDDPVI MLTAPIPATE KVLKRAGLTI
ADIGVYEVNE AFASVPLAWE IETGADHARL NPLGGAIAVG HPLGASGAIL MTRMIHHMRD
HGIRYGLQTM CEAGGMANAT VIELIGA