Gene EcHS_A4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4068 
SymbolfadA 
ID5595360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4057869 
End bp4059032 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content58% 
IMG OID640923171 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_001460637 
Protein GI157163319 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02445] fatty oxidation complex, beta subunit FadA 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.142998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGG TTGTCATTGT CGATGCAATT CGCACCCCGA TGGGCCGTTC GAAGGGCGGT 
GCTTTTCGTA ACGTGCGTGC AGAAGATCTC TCCGCTCATT TAATGCGTAG CCTGCTGGCG
CGTAACCCGG CGCTGGAAGC GGCGGCCCTC GACGATATTT ACTGGGGTTG TGTGCAGCAG
ACGCTGGAGC AGGGTTTTAA CATCGCCCGT AACGCGGCGC TGCTGGCAGA AGTACCACAC
TCTGTCCCGG CGGTTACCGT CAATCGCTTG TGTGGTTCAT CCATGCAGGC ACTGCATGAC
GCAGCACGGA TGATTATGAC CGGCGATGCG CAGGCATGTC TGGTTGGCGG CGTGGAGCAT
ATGGGCCATG TGCCGATGAG TCACGGCGTC GATTTTCACC CCGGCCTGAG CCGCAATGTC
GCCAAAGCGG CGGGCATGAT GGGCTTAACA GCAGAAATGC TGGCGCGTAT GCACGGTATC
AGCCGTGAAA TGCAGGATGC CTTTGCCGCG CGATCGCACG CTCGTGCCTG GGCCGCCACG
CAGTCGGCCG CATTTAAAAA TGAAATCATC CCGACCGGTG GTCACGATGC CGACGGCGTC
CTGAAGCAGT TTAATTACGA CGAAGTGATT CGCCCGGAAA CCACCGTGGA AGCCCTCGCC
ACGCTGCGTC CGGCGTTTGA TCCAGTAAAC GGTACGGTAA CGGCGGGCAC ATCTTCTGCA
CTTTCCGATG GCGCAGCTGC CATGCTGGTG ATGAGTGAAA GCCGCGCCCA TGAATTAGGT
CTTAAGCCGC GCGCTCGTGT GCGTTCGATG GCGGTCGTTG GTTGTGACCC ATCGATTATG
GGTTACGGCC CGGTTCCGGC CTCAAAGCTG GCGCTGAAAA AAGCGGGGCT TTCTGCCAGC
GATATCGGCG TGTTTGAGAT GAACGAAGCC TTTGCCGCGC AGATCCTGCC GTGCATTAAA
GATCTCGGGC TAATGGAGCA GATTGACGAG AAGATCAACC TCAATGGTGG CGCGATCGCG
CTGGGTCATC CGCTGGGTTG TTCCGGTGCG CGTATCAGCA CCACGCTGCT GAATCTGATG
GAGCGCAAAG ACGTTCAGTT TGGTCTGGCG ACGATGTGTA TCGGTCTGGG TCAGGGTATT
GCGACGGTGT TTGAGCGGGT TTAA
 
Protein sequence
MEQVVIVDAI RTPMGRSKGG AFRNVRAEDL SAHLMRSLLA RNPALEAAAL DDIYWGCVQQ 
TLEQGFNIAR NAALLAEVPH SVPAVTVNRL CGSSMQALHD AARMIMTGDA QACLVGGVEH
MGHVPMSHGV DFHPGLSRNV AKAAGMMGLT AEMLARMHGI SREMQDAFAA RSHARAWAAT
QSAAFKNEII PTGGHDADGV LKQFNYDEVI RPETTVEALA TLRPAFDPVN GTVTAGTSSA
LSDGAAAMLV MSESRAHELG LKPRARVRSM AVVGCDPSIM GYGPVPASKL ALKKAGLSAS
DIGVFEMNEA FAAQILPCIK DLGLMEQIDE KINLNGGAIA LGHPLGCSGA RISTTLLNLM
ERKDVQFGLA TMCIGLGQGI ATVFERV