Gene EcolC_4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4165 
SymbolfadA 
ID6067123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4603084 
End bp4604247 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content57% 
IMG OID641603593 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_001727089 
Protein GI170022135 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02445] fatty oxidation complex, beta subunit FadA 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000604115 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAGG TTGTCATTGT CGATGCTATT CGCACCCCGA TGGGCCGTTC GAAGGGCGGT 
GCTTTTCGTA ACGTGCGTGC AGAAGATCTC TCCGCTCATT TAATGCGTAG CCTGCTGGCG
CGTAACCCGG CGCTGGAAGC GGCGGCCCTC GACGATATTT ACTGGGGTTG TGTGCAGCAG
ACGCTGGAGC AAGGTTTTAA TATCGCCCGT AACGCGGCGC TGTTGGCAGA AGTGCCGCAC
TCTGTCCCGG CGGTTACCGT CAATCGCTTG TGTGGTTCAT CCATGCAGGC ACTGCATGAC
GCAGCACGAA TGATCATGAC CGGCGATGCG CAGGCATGTC TGGTCGGCGG CGTGGAGCAT
ATGGGCCATG TGCCAATGAG CCACGGTGTC GATTTTCACC CTGGTCTGAG CCGCAATGTC
GCCAAAGCGG CGGGCATGAT GGGCTTAACG GCAGAAATGC TGGCGCGTAT GCACGGTATC
AGCCGTGAAA TGCAGGATGC CTTTGCCGCG CGGTCACACG CCCGCGCCTG GGCTGCCACG
CAGTCGGCCG CATTTAAAAA TGAAATCATC CCAACTGGTG GTCACGATGC CGACGGCGTC
CTGAAGCAGT TTAATTACGA CGAAGTGATT CGCCCGGAAA CCACCGTGGA AGCCCTCGCC
ACGCTGCGTC CGGCGTTTGA TCCAGTAAAC GGTACGGTAA CGGCGGGCAC ATCTTCTGCA
CTTTCCGATG GCGCAGCTGC CATGCTGGTG ATGAGTGAAA GCCGCGCCCA TGAATTAGGT
CTTAAGCCGC GCGCTCGTGT GCGTTCGATG GCGGTCGTTG GTTGTGACCC ATCGATTATG
GGTTACGGCC CGGTTCCGGC CTCGAAGCTG GCGCTGAAAA AAGCGGGGCT TTCTGTCAGT
GATATCGGCG TGTTTGAAAT GAATGAAGCC TTTGCCGCGC AGATCCTGCC ATGTATTAAA
GATCTGGGAC TAATTGAGCA GATTGACGAG AAGATCAACC TCAACGGTGG CGCGATCGCG
CTGGGTCATC CGCTGGGTTG TTCCGGTGCG CGTATCAGCA CCACGCTGCT GAATCTGATG
GAACGCAAAG ACGTTCAGTT TGGTCTGGCG ACGATGTGTA TCGGTCTGGG TCAGGGTATT
GCGACGGTGT TTGAGCGGGT TTAA
 
Protein sequence
MEQVVIVDAI RTPMGRSKGG AFRNVRAEDL SAHLMRSLLA RNPALEAAAL DDIYWGCVQQ 
TLEQGFNIAR NAALLAEVPH SVPAVTVNRL CGSSMQALHD AARMIMTGDA QACLVGGVEH
MGHVPMSHGV DFHPGLSRNV AKAAGMMGLT AEMLARMHGI SREMQDAFAA RSHARAWAAT
QSAAFKNEII PTGGHDADGV LKQFNYDEVI RPETTVEALA TLRPAFDPVN GTVTAGTSSA
LSDGAAAMLV MSESRAHELG LKPRARVRSM AVVGCDPSIM GYGPVPASKL ALKKAGLSVS
DIGVFEMNEA FAAQILPCIK DLGLIEQIDE KINLNGGAIA LGHPLGCSGA RISTTLLNLM
ERKDVQFGLA TMCIGLGQGI ATVFERV