Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0119 |
Symbol | aceF |
ID | 5595158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 127334 |
End bp | 129226 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640919306 |
Product | dihydrolipoamide acetyltransferase |
Protein accession | YP_001456901 |
Protein GI | 157159583 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.0825086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATCG AAATCAAAGT ACCGGACATC GGGGCTGATG AAGTTGAAAT CACCGAGATC CTGGTCAAAG TGGGCGACAA AGTTGAAGCC GAACAGTCGC TGATCACCGT AGAAGGCGAC AAAGCCTCTA TGGAAGTTCC GTCTCCGCAG GCGGGTATCG TTAAAGAGAT CAAAGTCTCT GTTGGCGATA AAACCCAGAC CGGCGCACTG ATTATGATTT TCGATTCCGC CGACGGTGCA GCAGACGCTG CACCTGCTCA GGCAGAAGAG AAGAAAGAAG CAGCTCCGGC AGCAGCACCA GCGGCTGCAG CGGCAAAAGA CGTTAACGTT CCGGATATCG GCAGCGACGA AGTTGAAGTG ACCGAAATCC TGGTGAAAGT TGGCGATAAA GTTGAAGCTG AACAGTCGCT GATCACCGTA GAAGGCGATA AAGCGTCTAT GGAAGTTCCG GCTCCGTTTG CTGGCACCGT GAAAGAGATC AAAGTGAACG TGGGTGACAA AGTGTCTACC GGCTCGCTGA TTATGGTCTT CGAAGTCGCG GGTGAAGCAG GCGCGGCAGC TCCGGCGGCT AAACAGGAAG CGGCTCCGGC AGCGGCCCCT GCACCAGCGG CTGGCGTGAA AGAAGTTAAC GTTCCGGATA TCGGCGGTGA CGAAGTTGAA GTGACCGAAG TGATGGTGAA AGTGGGCGAC AAAGTTGCCG CTGAACAGTC ACTGATTACC GTAGAAGGCG ACAAAGCTTC TATGGAAGTT CCGGCTCCGT TTGCAGGCGT CGTGAAGGAA CTGAAAGTCA ACGTTGGCGA TAAAGTGAAA ACTGGCTCGC TGATTATGAT CTTCGAAGTT GAAGGCGCAG CGCCTGCGGC AGCTCCTGCG AAACAGGAAG CGGCAGCGCC GGCTCCGGCA GCAAAAGCTG AAGCTCCGGC AGCAGCACCG GCTGCGAAAG CGGAAGGCAA ATCTGAATTT GCAGAAAACG ACGCTTACGT TCACGCGACT CCGCTGATCC GCCGTCTGGC ACGCGAGTTT GGCGTTAACC TGGCGAAAGT GAAGGGCACT GGCCGTAAAG GTCGTATCCT GCGCGAAGAC GTTCAGGCTT ACGTGAAAGA AGCTATCAAA CGTGCAGAAG CAGCTCCGGC GGCGACTGGC GGCGGTATCC CAGGCATGCT GCCGTGGCCG AAGGTGGACT TCAGCAAGTT TGGTGAAATC GAAGAAGTGG AACTGGGCCG TATCCAGAAA ATTTCTGGTG CGAACCTGAG CCGTAACTGG GTGATGATCC CGCATGTTAC TCACTTCGAC AAAACCGATA TCACCGAGCT GGAAGCGTTC CGTAAACAGC AGAACGAAGA AGCGGCGAAA CGTAAGCTGG ATGTGAAGAT CACCCCGGTT GTCTTCATCA TGAAAGCCGT TGCTGCAGCT CTTGAGCAGA TGCCTCGCTT CAACAGTTCG TTGTCGGAAG ACGGTCAGCG TCTGACCCTG AAGAAATACA TCAACATCGG TGTGGCGGTG GATACCCCGA ACGGTCTGGT TGTTCCGGTA TTCAAAGACG TCAACAAGAA AGGCATCATC GAGCTGTCTC GCGAGCTGAT GACTATTTCT AAGAAAGCGC GTGACGGTAA GCTGACTGCG GGCGAAATGC AGGGCGGTTG CTTCACCATC TCCAGCATCG GCGGCCTGGG TACTACCCAC TTCGCGCCGA TTGTGAACGC GCCGGAAGTG GCTATCCTCG GCGTTTCCAA GTCCGCGATG GAGCCGGTGT GGAATGGTAA AGAGTTCGTG CCGCGTCTGA TGCTGCCGAT TTCTCTCTCC TTCGACCACC GCGTGATCGA CGGTGCTGAT GGTGCCCGTT TCATTACCAT CATTAACAAC ACGCTGTCTG ACATTCGCCG TCTGGTGATG TAA
|
Protein sequence | MAIEIKVPDI GADEVEITEI LVKVGDKVEA EQSLITVEGD KASMEVPSPQ AGIVKEIKVS VGDKTQTGAL IMIFDSADGA ADAAPAQAEE KKEAAPAAAP AAAAAKDVNV PDIGSDEVEV TEILVKVGDK VEAEQSLITV EGDKASMEVP APFAGTVKEI KVNVGDKVST GSLIMVFEVA GEAGAAAPAA KQEAAPAAAP APAAGVKEVN VPDIGGDEVE VTEVMVKVGD KVAAEQSLIT VEGDKASMEV PAPFAGVVKE LKVNVGDKVK TGSLIMIFEV EGAAPAAAPA KQEAAAPAPA AKAEAPAAAP AAKAEGKSEF AENDAYVHAT PLIRRLAREF GVNLAKVKGT GRKGRILRED VQAYVKEAIK RAEAAPAATG GGIPGMLPWP KVDFSKFGEI EEVELGRIQK ISGANLSRNW VMIPHVTHFD KTDITELEAF RKQQNEEAAK RKLDVKITPV VFIMKAVAAA LEQMPRFNSS LSEDGQRLTL KKYINIGVAV DTPNGLVVPV FKDVNKKGII ELSRELMTIS KKARDGKLTA GEMQGGCFTI SSIGGLGTTH FAPIVNAPEV AILGVSKSAM EPVWNGKEFV PRLMLPISLS FDHRVIDGAD GARFITIINN TLSDIRRLVM
|
| |