Gene Franean1_4498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4498 
Symbol 
ID5672848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5366864 
End bp5368003 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content69% 
IMG OID641243365 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001508781 
Protein GI158316273 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.647319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGACG CAGTAATCGT CGAGGCCGTC CGCAGCCCGC TAGGGAAGCG CAACGGAGGT 
CTCTCCGCTG TTGCGCCCGT CGACCTGGGC GCGTCGCTGC TGCAAGCGCT GGTCGCGCGG
ACGGGCCTGG ACCCGGCGGT CGTCGACGAC GTCGTCTGGG GCTGTGTAAC CCAGGTTGCG
GAGCAGTCAG TCAACATCGG GCGCAACACC GTCCTCGCGG CAGGCTGGCC CGAAACCGTG
CCCGGGACCA CGGTGGACCG CCAGTGCGGC TCGTCTCAGC AGGCCGTGCA CTTCGCGGCG
GCGAGCGTGA TCGCGGGCTA CTACGATGTC GTCGTCGCCG GCGGGGTGGA ATCCATGAGT
CGGGTGCCGA TGGGCTCCCA GACCATCGGT GCCTGGCCCT TCGGTGATGG CTTCCGCACC
CGCTACCCCG ACATTGAGCC CAACCAGGGC ATCGCGGCCG AGATGATCGC CGAGAAGTAC
GGCTTCTCGC GGGAGGCGCT GGACGAGTTC AGCCTCCGAT CGCATGAACT CGCGGCCGCA
GCCCAGGACA ACGACGCGTT CGCCGCCGAG ATCGTCCCGG TCGACGCGGC CCCCGGCGTC
GTCGCAGACG AGGGCATCCG CCGCGGCGGC GACGTGACGA CCCTCGGCAA GCTGAGCACC
CCGTTCAAGC CGGACGGCGT CATCTCGGCA GGCAACTCCT CCCAGATTAG CGACGGCGCC
TCAGCCCTGC TCATCACGAG CAGCGAGAAG GCGGCGGAGC TTGGTCTGCG CCCGATCGTC
CGGATCCACT CCGCGGTAGT CGTGGGTGAC GACCCCGTCA TGATGCTGAC CGGGCCCATC
CCGGCGACAG CCCGCATCCT GCGCCGCAGC GGGCTGTCTC TCGGGGACAT CGGCACGTTC
GAGGTCAATG AGGCCTTCGC GTCTATCCCC CTGGCCTGGC TCGCTGAGAC GGGCGCCGAC
CCGGCCAGGC TGAACCCACG TGGCGGGGCG ATCGCGCTCG GCCATCCGCT GGGCGGCAGC
GGCGGCCGAC TGATGAGCAC AATGATCAAT CACATGCGCG ATAATGGAAT CCGCTACGGG
CTGCAGACGA TGTGCGAGGG CGGCGGGCTC GCTAACGCCA CGATCCTCGA GCTGCTGTAA
 
Protein sequence
MRDAVIVEAV RSPLGKRNGG LSAVAPVDLG ASLLQALVAR TGLDPAVVDD VVWGCVTQVA 
EQSVNIGRNT VLAAGWPETV PGTTVDRQCG SSQQAVHFAA ASVIAGYYDV VVAGGVESMS
RVPMGSQTIG AWPFGDGFRT RYPDIEPNQG IAAEMIAEKY GFSREALDEF SLRSHELAAA
AQDNDAFAAE IVPVDAAPGV VADEGIRRGG DVTTLGKLST PFKPDGVISA GNSSQISDGA
SALLITSSEK AAELGLRPIV RIHSAVVVGD DPVMMLTGPI PATARILRRS GLSLGDIGTF
EVNEAFASIP LAWLAETGAD PARLNPRGGA IALGHPLGGS GGRLMSTMIN HMRDNGIRYG
LQTMCEGGGL ANATILELL