Gene EcSMS35_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2501 
SymbolfadI 
ID6146720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2547857 
End bp2549167 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID641617373 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_001744545 
Protein GI170683369 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02446] fatty oxidation complex, beta subunit FadI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGG TTTTACCGCT GGTTACCCGC CTGGGCGATC GTATCGCCAT TGTTAGCGGT 
TTACGTACGC CTTTTGCCCG CCAGGCGACG GCTTTTCATG GCATTCCCGC GGTTGATTTA
GGGAAGATGG TGGTAGGCGA ACTACTAGCA CGCAGCGAAA TTCCCGCCGA AGTGATTGAA
CAACTGGTCT TTGGTCAGGT CGTTCAGATG CCGGAAGCCC CCAACATTGC GCGTGAAATT
GTTCTTGGCA CCGGAATGAA TGTCCATACC GATGCTTACA GCGTCAGCCG CGCTTGCGCT
ACCAGTTTCC AGGCGATTGC AAACGTCGCA GAAAGTCTGA TGGCGGGAAC TATTCGCGCG
GGGATTGCCG GTGGGGCCGA TTCCTCTTCC GTATTGCCCA TTGGCGTTAG CAAAAAACTG
GCGCGCGTGC TGGTTGATGT CAACAAAGCC CGCACAATGA GCCAGCGCCT GAAACTTTTT
TCTCGCTTAC GTTTGCGTGA CTTAATGCCC GTACCGCCTG CGGTAGCAGA ATATTCTACC
GGCTTGCGGA TGGGGGACAC CGCAGAGCAA ATGGCGAAAA CCTACGGCAT CACCCGAGAA
CAGCAAGATG CATTAGCGCA CCGTTCGCAT CAGCGAGCCG CTCAGGCGTG GTCAGACGGT
AAACTCAAAG AAGAGGTGAT GACTGCCTTT ATCCCTCCTT ATAAACAACC GCTTGCCGAA
GACAACAATA TTCGCGGTAA TTCCACGCTT GCTGATTACG CAAAGCTGCG TCCGGCGTTT
GATCGCAAAC ACGGGACGGT AACGGCAGCA AACAGTACAC CGCTGACCGA TGGCGCGGCA
GCGGTGATCC TGATGACTGA ATCGCGGGCG AAAGAATTAG GGCTGGTACC GCTGGGGTAT
CTGCGCAGTT ACGCATTTAC TGCGATAGAT GTCTGGCAGG ACATGTTGCT CGGTCCGGCC
TGGTCAACGC CGCTGGCGCT GGAACGGGCC GGATTGACGA TGAGCGATCT GACCTTGATC
GATATGCACG AAGCCTTTGC AGCTCAGACA CTGGCGAATA TTCAGTTGCT GGGTAGCGAA
CGTTTTGCTC GTGATGTACT GGGGCGTGCA CATGCCACTG GCGAAGTGGA CGATAGCAAA
TTTAACGTGC TTGGCGGTTC GATTGCTTAC GGGCATCCCT TCGCGGCGAC CGGCGCGCGG
ATGATTACCC AGACATTGCA TGAACTTCGC CGTCGCGGCG GTGGATTTGG TTTAGTGACC
GCCTGTGCTG CGGGTGGGCT TGGCGCGGCA ATGGTTGTGG AGGCGGAATA A
 
Protein sequence
MGQVLPLVTR LGDRIAIVSG LRTPFARQAT AFHGIPAVDL GKMVVGELLA RSEIPAEVIE 
QLVFGQVVQM PEAPNIAREI VLGTGMNVHT DAYSVSRACA TSFQAIANVA ESLMAGTIRA
GIAGGADSSS VLPIGVSKKL ARVLVDVNKA RTMSQRLKLF SRLRLRDLMP VPPAVAEYST
GLRMGDTAEQ MAKTYGITRE QQDALAHRSH QRAAQAWSDG KLKEEVMTAF IPPYKQPLAE
DNNIRGNSTL ADYAKLRPAF DRKHGTVTAA NSTPLTDGAA AVILMTESRA KELGLVPLGY
LRSYAFTAID VWQDMLLGPA WSTPLALERA GLTMSDLTLI DMHEAFAAQT LANIQLLGSE
RFARDVLGRA HATGEVDDSK FNVLGGSIAY GHPFAATGAR MITQTLHELR RRGGGFGLVT
ACAAGGLGAA MVVEAE