Gene EcHS_A2493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2493 
SymbolfadI 
ID5594961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2500780 
End bp2502090 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content55% 
IMG OID640921614 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_001459148 
Protein GI157161830 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02446] fatty oxidation complex, beta subunit FadI 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCAGG TTTTACCGCT GGTTACCCGC CAGGGCGATC GTATCGCCAT TGTTAGCGGT 
TTACGTACGC CTTTTGCCCG CCAGGCGACG GCTTTTCATG GCATTCCCGC GGTTGATTTA
GGGAAGATGG TGGTAGGCGA ACTGCTGGCA CGCAGCGAGA TCCCCGCTGA AGTGATTGAA
CAACTGGTCT TTGGTCAGGT CGTACAAATG CCTGAAGCCC CCAACATTGC GCGTGAAATT
GTTCTCGGTA CGGGAATGAA TGTGCATACC GATGCTTACA GCGTCAGCCG CGCTTGCGCT
ACCAGTTTCC AGGCAGTTGC AAACGTCGCA GAAAGCCTGA TGGCGGGAAC TATTCGAGCG
GGGATTGCCG GTGGGGCAGA TTCCTCTTCC GTATTGCCAA TTGGCGTCAG TAAAAAACTG
GCGCGCGTGC TGGTTGATTT CAACAAAGCT CGAATCATGA GACAGCGCCT GAAACTCTTC
TCTCGCCTGC GTTTGCGCGA CTTAATGCCC GTACCGCCTG CGGTAGCAGA ATATTCTACC
GGCTTGCGGA TGGGTGACAC CGCAGAGCAA ATGGCGAAAA CCTACGGCAT CACCCGAGAA
CAGCAAGATG CATTAGCGCA CCGTTCGCAT CAGCGTGCCG CTCAGGCATG GTCAGACGGA
AAACTCAAAG AAGAGGTGAT GACTGCCTTT ATCCCTCCTT ATAAACAACC GCTTGTCGAA
GACAACAATA TTCGCGGTAA TTCCTCGCTT GCCGATTACG CAAAGCTGCG CCCGGCGTTT
GATCGCAAAC ACGGAACGGT AACGGCGGCA AACAGTACGC CGCTGACCGA TGGCGCGGCA
GCGGTGATCC TGATGACTGA ATCCCGGGCG AAAGAATTAG GGCTGGTGCC GCTGGGGTAT
CTGCGCAGCT ACGCATTTAC TGCGATTGAT GTCTGGCAGG ACATGTTGCT CGGTCCAGCC
TGGTCAACAC CGCTGGCGCT GGAGCGTGCC GGTTTGACGA TGAGCGATCT GACATTGATC
GATATGCACG AAGCCTTTGC AGCTCAGACA CTGGCGAATA TTCAGTTGCT GGGTAGTGAA
CGTTTTGCTC GTGATGTACT GGGGCGTGCA CATGCCACTG GCGAAGTGGA CGATAGCAAA
TTTAACGTGC TTGGCGGTTC GATTGCTTAC GGACATCCCT TCGCGGCGAC CGGCGCACGG
ATGATTACCC AGACATTGCA TGAACTTCGC CGTCGCGGCG GTGGATTTGG TTTAGTTACC
GCCTGTGCTG CCGGTGGGCT TGGCGCGGCA ATGGTTCTGG AGGCGGAATA A
 
Protein sequence
MGQVLPLVTR QGDRIAIVSG LRTPFARQAT AFHGIPAVDL GKMVVGELLA RSEIPAEVIE 
QLVFGQVVQM PEAPNIAREI VLGTGMNVHT DAYSVSRACA TSFQAVANVA ESLMAGTIRA
GIAGGADSSS VLPIGVSKKL ARVLVDFNKA RIMRQRLKLF SRLRLRDLMP VPPAVAEYST
GLRMGDTAEQ MAKTYGITRE QQDALAHRSH QRAAQAWSDG KLKEEVMTAF IPPYKQPLVE
DNNIRGNSSL ADYAKLRPAF DRKHGTVTAA NSTPLTDGAA AVILMTESRA KELGLVPLGY
LRSYAFTAID VWQDMLLGPA WSTPLALERA GLTMSDLTLI DMHEAFAAQT LANIQLLGSE
RFARDVLGRA HATGEVDDSK FNVLGGSIAY GHPFAATGAR MITQTLHELR RRGGGFGLVT
ACAAGGLGAA MVLEAE