Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4226 |
Symbol | fadA |
ID | 6147277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4322655 |
End bp | 4323818 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641619049 |
Product | 3-ketoacyl-CoA thiolase |
Protein accession | YP_001746177 |
Protein GI | 170682404 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases [TIGR02445] fatty oxidation complex, beta subunit FadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.032199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00446338 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACAGG TTGTCATTGT CGATGCAATT CGCACCCCGA TGGGCCGTTC GAAGGGCGGT GCTTTTCGTC ACGTGCGTGC GGAAGATCTC TCCGCTCATT TAATGCGTAG CCTGCTGGCG CGTAACCCGG CGCTGGAAGC GGCGGCCCTC GACGATATTT ACTGGGGCTG TGTACAGCAG ACGCTGGAGC AAGGTTTTAA TATCGCCCGT AACGCGGCGC TGTTGGCAGA AGTACCACAC TCTGTCCCGG CGGTGACCGT CAATCGCCTG TGTGGTTCAT CCATGCAAGC ATTACATGAC GCAGCGCGAA TGATCATGAC CGGCGATGCG CAGGCATGTC TGGTTGGCGG CGTGGAGCAT ATGGGCCATG TGCCGATGAG TCACGGCGTC GATTTTCACC CCGGCCTGAG CCGCAATGTC GCCAAAGCGG CGGGCATGAT GGGCTTAACG GCAGAAATGC TGGCGCGTAT GCACGGTATC AGCCGTGAAA TGCAGGATGC CTTTGCCGCG CGGTCACACG CCCGCGCCTG GGCCGCCACG CAGTCGGGTG CATTTAAAAA TGAAATTATC CCGACCGGTG GTCACGATGC CGACGGCGTC CTGAAGCAGT TTAATTACGA CGAAGTGATT CGCCCGGAAA CCACCGTGGA AGCCCTCGCC ACGCTGCGCC CGGCGTTTGA TCCCGTCAGC GGTACGGTGA CGGCAGGTAC ATCTTCTGCG CTTTCCGATG GCGCAGCTGC CATGCTGGTG ATGAGTGAAA GCCGCGCCCG AGAATTAGGT CTTAAGCCGC GCGCCCGCGT GCGTTCGATG GCGGTCGTTG GTTGTGACCC ATCGATTATG GGTTACGGCC CGGTTCCGGC CTCGAAGCTG GCGCTGAAAA AAGCGGGGCT TTCTGCCAGC GATATCGGCG TATTCGAGAT GAACGAAGCC TTTGCCGCGC AGATCCTGCC ATGTATTAAA GATCTCGGGC TAATGGAGCA GATTGACGAG AAGATCAACC TCAACGGTGG CGCGATCGCG CTGGGTCATC CGCTGGGTTG TTCCGGTGCG CGTATCAGCA CCACGCTGCT GAATCTGATG GAGCGCAAAG ATGCCCAGTT TGGTCTGGCG ACGATGTGTA TCGGTCTGGG TCAGGGTATT GCGACGGTGT TTGAGCGGAT TTAA
|
Protein sequence | MEQVVIVDAI RTPMGRSKGG AFRHVRAEDL SAHLMRSLLA RNPALEAAAL DDIYWGCVQQ TLEQGFNIAR NAALLAEVPH SVPAVTVNRL CGSSMQALHD AARMIMTGDA QACLVGGVEH MGHVPMSHGV DFHPGLSRNV AKAAGMMGLT AEMLARMHGI SREMQDAFAA RSHARAWAAT QSGAFKNEII PTGGHDADGV LKQFNYDEVI RPETTVEALA TLRPAFDPVS GTVTAGTSSA LSDGAAAMLV MSESRARELG LKPRARVRSM AVVGCDPSIM GYGPVPASKL ALKKAGLSAS DIGVFEMNEA FAAQILPCIK DLGLMEQIDE KINLNGGAIA LGHPLGCSGA RISTTLLNLM ERKDAQFGLA TMCIGLGQGI ATVFERI
|
| |