Gene ECH74115_5284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5284 
SymbolfadA 
ID6968549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4926844 
End bp4928007 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content58% 
IMG OID643388948 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_002273362 
Protein GI209398395 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02445] fatty oxidation complex, beta subunit FadA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0377447 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGG TTGTCATTGT CGATGCAATT CGCACCCCGA TGGGCCGTTC GAAGGGCGGT 
GCTTTTCGTA ACGTGCGTGC AGAAGATCTC TCCGCTCATT TAATGCGTAG CCTGCTGGCG
CGTAACCCGG CGCTGGAAGC GGCGGCCCTC GACGATATTT ACTGGGGGTG TGTGCAGCAG
ACGCTGGAGC AGGGTTTTAA TATCGCCCGT AACGCGGCGC TGCTGGCAGA AGTGCCGCAC
TCTGTCCCGG CGGTTACTGT CAATCGCTTG TGTGGTTCAT CCATGCAGGC ACTGCATGAC
GCAGCACGAA TGATCATGAC CGGCGATGCG CAGGCATGTC TGGTTGGCGG CGTGGAGCAT
ATGGGCCATG TGCCGATGAG TCACGGCGTC GATTTTCACC CTGGCCTGAG CCGCAATGTC
GCCAAAGCGG CGGGCATGAT GGGCTTAACG GCAGAAATGC TGGCGCGTAT GCACGGTATC
AGCCGTGAAA TGCAGGATGC CTTTGCCGCG CGGTCACACG CCCGCGCCTG GGCCGCCACG
CAGTCGGGCG CATTTAAAAA TGAAATCATC CCAACTGGTG GTCACGATGC CGACGGCGTC
CTGAAGCAGT TTAATTACGA CGAAGTGATT CGCCCGGAAA CCACCGTGGA AGCCCTCGCC
ACGCTGCGTC CGGCGTTTGA TCCAGTAAAC GGTACGGTAA CGGCGGGCAC ATCTTCTGCA
CTTTCCGATG GCGCAGCTGC CATGCTGGTG ATGAGTGAAA GCCGCGCCCA TGAATTAGGT
CTTAAGCCGC GCGCTCGTGT GCGTTCGATG GCGGTCGTTG GTTGTGACCC ATCGATCATG
GGTTACGGCC CGGTTCCGGC CTCGAAGCTG GCGCTGAAAA AAGCGGGGCT TTCTGTCAGT
GATATCGGCG TGTTTGAAAT GAATGAAGCC TTTGCCGCGC AGATCCTGCC ATGTATTAAA
GATCTGGGAC TAATTGAGCA GATTGACGAG AAGATCAACC TCAACGGTGG CGCGATCGCG
CTGGGTCATC CGCTGGGTTG TTCCGGTGCG CGTATCAGCA CCACGCTGCT GAATCTGATG
GAACACAAAG ACGTTCAGTT TGGTCTGGCG ACGATGTGTA TCGGTCTGGG TCAGGGTATT
GCGACGGTGT TTGAGCGGGT TTAA
 
Protein sequence
MEQVVIVDAI RTPMGRSKGG AFRNVRAEDL SAHLMRSLLA RNPALEAAAL DDIYWGCVQQ 
TLEQGFNIAR NAALLAEVPH SVPAVTVNRL CGSSMQALHD AARMIMTGDA QACLVGGVEH
MGHVPMSHGV DFHPGLSRNV AKAAGMMGLT AEMLARMHGI SREMQDAFAA RSHARAWAAT
QSGAFKNEII PTGGHDADGV LKQFNYDEVI RPETTVEALA TLRPAFDPVN GTVTAGTSSA
LSDGAAAMLV MSESRAHELG LKPRARVRSM AVVGCDPSIM GYGPVPASKL ALKKAGLSVS
DIGVFEMNEA FAAQILPCIK DLGLIEQIDE KINLNGGAIA LGHPLGCSGA RISTTLLNLM
EHKDVQFGLA TMCIGLGQGI ATVFERV