Gene EcolC_1311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1311 
SymbolfadI 
ID6068500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1440400 
End bp1441710 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content55% 
IMG OID641600733 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_001724304 
Protein GI170019350 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02446] fatty oxidation complex, beta subunit FadI 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.995653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGG TTTTACCGCT GGTTACCCGC CAGGGCGATC GTATCGCCAT TGTTAGCGGT 
TTACGTACGC CTTTTGCCCG CCAGGCGACG GCTTTTCATG GCATTCCCGC GGTTGATTTA
GGGAAGATGG TGGTAGGCGA ACTGCTGGCA CGCAGCGAGA TCCCCGCTGA AGTGATTGAA
CAACTGGTCT TTGGTCAGGT CGTACAAATG CCTGAAGCCC CCAACATTGC GCGTGAAATT
GTTCTCGGTA CGGGAATGAA TGTGCATACC GATGCTTACA GCGTCAGCCG CGCTTGCGCT
ACCAGTTTCC AGGCAGTTGC AAACGTCGCA GAAAGCCTGA TGGCGGGAAC TATTCGAGCG
GGGATTGCCG GTGGGGCAGA TTCCTCTTCC GTATTGCCAA TTGGCGTCAG TAAAAAACTG
GCGCGCGTGC TGGTTGATGT CAACAAAGCT CGAACCATGA GCCAGCGACT GAAACTCTTC
TCTCGCCTGC GTTTGCGCGA CTTAATGCCC GTACCGCCTG CGGTAGCAGA ATATTCTACC
GGCTTGCGGA TGGGTGACAC CGCAGAGCAA ATGGCGAAAA CCTACGGCAT CACCCGAGAA
CAGCAAGATG CACTGGCCCA CCGTTCGCAT CAGCGTGCTG CTCAGGCATG GTCAGACGGG
AAACTCAAAG AAGAGGTGAT GACTGCCTTT ATCCCTCCTT ATAAACAACC GTTTGTCGAA
GACAACAATA TTCGCGGTAA TTCCTCGCTT GCTGATTACG CAAAGTTGCG TCCGGCGTTT
GATCGTAAAC ACGGGACGGT AACGGCAGCA AACAGTACGC CGCTGACCGA TGGCGCAGCG
GCGGTGATCC TGATGACCGA ATCGCGGGCG AAAGAATTAG GGCTGGTACC GCTGGGGTAT
CTGCGCAGCT ACGCATTTAC TGCGATAGAT GTCTGGCAGG ACATGTTGCT CGGTCCAGCC
TGGTCAACAC CGCTGGCGCT GGAACGTGCC GGTTTGACGA TGAGCGATCT GACATTGATC
GATATGCACG AAGCCTTTGC AGCTCAGACA CTGGCGAATA TTCAGTTGCT GGGTAGTGAA
CGTTTTGCTC GTGAAGTACT GGGGCGTGCA CATGCCACTG GCGAAGTGGA CGATAGCAAA
TTTAACGTGC TTGGCGGTTC GATTGCTTAT GGGCATCCCT TCGCGGCGAC CGGCGCGCGG
ATGATTACCC AGACACTGCA TGAACTTCGC CGTCGCGGCG GTGGATTTGG TTTAGTTACC
GCCTGTGCTG CCGGTGGGCT TGGCGCGGCA ATGGTTCTGG AGGCGGAATA A
 
Protein sequence
MGQVLPLVTR QGDRIAIVSG LRTPFARQAT AFHGIPAVDL GKMVVGELLA RSEIPAEVIE 
QLVFGQVVQM PEAPNIAREI VLGTGMNVHT DAYSVSRACA TSFQAVANVA ESLMAGTIRA
GIAGGADSSS VLPIGVSKKL ARVLVDVNKA RTMSQRLKLF SRLRLRDLMP VPPAVAEYST
GLRMGDTAEQ MAKTYGITRE QQDALAHRSH QRAAQAWSDG KLKEEVMTAF IPPYKQPFVE
DNNIRGNSSL ADYAKLRPAF DRKHGTVTAA NSTPLTDGAA AVILMTESRA KELGLVPLGY
LRSYAFTAID VWQDMLLGPA WSTPLALERA GLTMSDLTLI DMHEAFAAQT LANIQLLGSE
RFAREVLGRA HATGEVDDSK FNVLGGSIAY GHPFAATGAR MITQTLHELR RRGGGFGLVT
ACAAGGLGAA MVLEAE