Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2501 |
Symbol | fadI |
ID | 6146720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2547857 |
End bp | 2549167 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617373 |
Product | 3-ketoacyl-CoA thiolase |
Protein accession | YP_001744545 |
Protein GI | 170683369 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases [TIGR02446] fatty oxidation complex, beta subunit FadI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCAGG TTTTACCGCT GGTTACCCGC CTGGGCGATC GTATCGCCAT TGTTAGCGGT TTACGTACGC CTTTTGCCCG CCAGGCGACG GCTTTTCATG GCATTCCCGC GGTTGATTTA GGGAAGATGG TGGTAGGCGA ACTACTAGCA CGCAGCGAAA TTCCCGCCGA AGTGATTGAA CAACTGGTCT TTGGTCAGGT CGTTCAGATG CCGGAAGCCC CCAACATTGC GCGTGAAATT GTTCTTGGCA CCGGAATGAA TGTCCATACC GATGCTTACA GCGTCAGCCG CGCTTGCGCT ACCAGTTTCC AGGCGATTGC AAACGTCGCA GAAAGTCTGA TGGCGGGAAC TATTCGCGCG GGGATTGCCG GTGGGGCCGA TTCCTCTTCC GTATTGCCCA TTGGCGTTAG CAAAAAACTG GCGCGCGTGC TGGTTGATGT CAACAAAGCC CGCACAATGA GCCAGCGCCT GAAACTTTTT TCTCGCTTAC GTTTGCGTGA CTTAATGCCC GTACCGCCTG CGGTAGCAGA ATATTCTACC GGCTTGCGGA TGGGGGACAC CGCAGAGCAA ATGGCGAAAA CCTACGGCAT CACCCGAGAA CAGCAAGATG CATTAGCGCA CCGTTCGCAT CAGCGAGCCG CTCAGGCGTG GTCAGACGGT AAACTCAAAG AAGAGGTGAT GACTGCCTTT ATCCCTCCTT ATAAACAACC GCTTGCCGAA GACAACAATA TTCGCGGTAA TTCCACGCTT GCTGATTACG CAAAGCTGCG TCCGGCGTTT GATCGCAAAC ACGGGACGGT AACGGCAGCA AACAGTACAC CGCTGACCGA TGGCGCGGCA GCGGTGATCC TGATGACTGA ATCGCGGGCG AAAGAATTAG GGCTGGTACC GCTGGGGTAT CTGCGCAGTT ACGCATTTAC TGCGATAGAT GTCTGGCAGG ACATGTTGCT CGGTCCGGCC TGGTCAACGC CGCTGGCGCT GGAACGGGCC GGATTGACGA TGAGCGATCT GACCTTGATC GATATGCACG AAGCCTTTGC AGCTCAGACA CTGGCGAATA TTCAGTTGCT GGGTAGCGAA CGTTTTGCTC GTGATGTACT GGGGCGTGCA CATGCCACTG GCGAAGTGGA CGATAGCAAA TTTAACGTGC TTGGCGGTTC GATTGCTTAC GGGCATCCCT TCGCGGCGAC CGGCGCGCGG ATGATTACCC AGACATTGCA TGAACTTCGC CGTCGCGGCG GTGGATTTGG TTTAGTGACC GCCTGTGCTG CGGGTGGGCT TGGCGCGGCA ATGGTTGTGG AGGCGGAATA A
|
Protein sequence | MGQVLPLVTR LGDRIAIVSG LRTPFARQAT AFHGIPAVDL GKMVVGELLA RSEIPAEVIE QLVFGQVVQM PEAPNIAREI VLGTGMNVHT DAYSVSRACA TSFQAIANVA ESLMAGTIRA GIAGGADSSS VLPIGVSKKL ARVLVDVNKA RTMSQRLKLF SRLRLRDLMP VPPAVAEYST GLRMGDTAEQ MAKTYGITRE QQDALAHRSH QRAAQAWSDG KLKEEVMTAF IPPYKQPLAE DNNIRGNSTL ADYAKLRPAF DRKHGTVTAA NSTPLTDGAA AVILMTESRA KELGLVPLGY LRSYAFTAID VWQDMLLGPA WSTPLALERA GLTMSDLTLI DMHEAFAAQT LANIQLLGSE RFARDVLGRA HATGEVDDSK FNVLGGSIAY GHPFAATGAR MITQTLHELR RRGGGFGLVT ACAAGGLGAA MVVEAE
|
| |