Gene EcSMS35_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4226 
SymbolfadA 
ID6147277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4322655 
End bp4323818 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content59% 
IMG OID641619049 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_001746177 
Protein GI170682404 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02445] fatty oxidation complex, beta subunit FadA 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.032199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00446338 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAGG TTGTCATTGT CGATGCAATT CGCACCCCGA TGGGCCGTTC GAAGGGCGGT 
GCTTTTCGTC ACGTGCGTGC GGAAGATCTC TCCGCTCATT TAATGCGTAG CCTGCTGGCG
CGTAACCCGG CGCTGGAAGC GGCGGCCCTC GACGATATTT ACTGGGGCTG TGTACAGCAG
ACGCTGGAGC AAGGTTTTAA TATCGCCCGT AACGCGGCGC TGTTGGCAGA AGTACCACAC
TCTGTCCCGG CGGTGACCGT CAATCGCCTG TGTGGTTCAT CCATGCAAGC ATTACATGAC
GCAGCGCGAA TGATCATGAC CGGCGATGCG CAGGCATGTC TGGTTGGCGG CGTGGAGCAT
ATGGGCCATG TGCCGATGAG TCACGGCGTC GATTTTCACC CCGGCCTGAG CCGCAATGTC
GCCAAAGCGG CGGGCATGAT GGGCTTAACG GCAGAAATGC TGGCGCGTAT GCACGGTATC
AGCCGTGAAA TGCAGGATGC CTTTGCCGCG CGGTCACACG CCCGCGCCTG GGCCGCCACG
CAGTCGGGTG CATTTAAAAA TGAAATTATC CCGACCGGTG GTCACGATGC CGACGGCGTC
CTGAAGCAGT TTAATTACGA CGAAGTGATT CGCCCGGAAA CCACCGTGGA AGCCCTCGCC
ACGCTGCGCC CGGCGTTTGA TCCCGTCAGC GGTACGGTGA CGGCAGGTAC ATCTTCTGCG
CTTTCCGATG GCGCAGCTGC CATGCTGGTG ATGAGTGAAA GCCGCGCCCG AGAATTAGGT
CTTAAGCCGC GCGCCCGCGT GCGTTCGATG GCGGTCGTTG GTTGTGACCC ATCGATTATG
GGTTACGGCC CGGTTCCGGC CTCGAAGCTG GCGCTGAAAA AAGCGGGGCT TTCTGCCAGC
GATATCGGCG TATTCGAGAT GAACGAAGCC TTTGCCGCGC AGATCCTGCC ATGTATTAAA
GATCTCGGGC TAATGGAGCA GATTGACGAG AAGATCAACC TCAACGGTGG CGCGATCGCG
CTGGGTCATC CGCTGGGTTG TTCCGGTGCG CGTATCAGCA CCACGCTGCT GAATCTGATG
GAGCGCAAAG ATGCCCAGTT TGGTCTGGCG ACGATGTGTA TCGGTCTGGG TCAGGGTATT
GCGACGGTGT TTGAGCGGAT TTAA
 
Protein sequence
MEQVVIVDAI RTPMGRSKGG AFRHVRAEDL SAHLMRSLLA RNPALEAAAL DDIYWGCVQQ 
TLEQGFNIAR NAALLAEVPH SVPAVTVNRL CGSSMQALHD AARMIMTGDA QACLVGGVEH
MGHVPMSHGV DFHPGLSRNV AKAAGMMGLT AEMLARMHGI SREMQDAFAA RSHARAWAAT
QSGAFKNEII PTGGHDADGV LKQFNYDEVI RPETTVEALA TLRPAFDPVS GTVTAGTSSA
LSDGAAAMLV MSESRARELG LKPRARVRSM AVVGCDPSIM GYGPVPASKL ALKKAGLSAS
DIGVFEMNEA FAAQILPCIK DLGLMEQIDE KINLNGGAIA LGHPLGCSGA RISTTLLNLM
ERKDAQFGLA TMCIGLGQGI ATVFERI