Gene ECH74115_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0122 
SymbolaceF 
ID6969546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp130165 
End bp132057 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content55% 
IMG OID643384199 
Productdihydrolipoamide acetyltransferase 
Protein accessionYP_002268722 
Protein GI209399253 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00199092 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG AAATCAAAGT ACCGGACATC GGGGCTGATG AAGTTGAAAT CACCGAGATC 
CTGGTCAAAG TGGGCGACAA AGTTGAAGCC GAACAGTCGC TGATCACCGT AGAAGGCGAC
AAAGCTTCTA TGGAAGTTCC GTCTCCGCAG GCGGGTATCG TTAAAGAGAT CAAAGTCTCT
GTTGGCGATA AAACTCAGAC CGGCGCACTG ATTATGATTT TCGATTCCGC CGACGGTGCA
GCAGACGCTG CACCTGCTCA GGCAGAAGAG AAGAAAGAAG CAGCTCCGGC AGCAGTCCCC
GAGGCTGCCG CGGCAAAAGA CGTTAACGTT CCGGATATCG GCAGCGACGA AGTTGAAGTG
ACCGAAATCC TGGTGAAAGT TGGCGACAAA GTTGAAGCTG AGCAGTCGCT GATCACCGTA
GAAGGCGATA AAGCTTCTAT GGAAGTTCCG GCTCCGTTTG CAGGCACCGT GAAAGAAATC
AAAGTGAACA CCGGCGACAA AGTGTCTACC GGCTCGCTGA TTATGGTCTT CGAAGTGGCG
GGTGAAGCAG GCGCGGCAGC TCCGGCGGCT AAACAGGAAG CGGCTCCGGC AGCGGCCCCT
GCACCAGCGG CTGGCGTGAA AGAAGTTAAC GTTCCGGATA TCGGCGGTGA CGAAGTTGAA
GTGACCGAAG TGATGGTGAA AGTGGGCGAC AAAGTTGCCG CTGAACAGTC ACTGATTACC
GTAGAAGGCG ATAAAGCTTC TATGGAAGTT CCGGCTCCGT TTGCAGGCGT CGTGAAGGAA
CTGAAAGTCA ACGTTGGCGA TAAAGTGAAA ACTGGCTCGC TGATTATGAT CTTCGAAGTT
GAAGGCGCAG CGCCTGCGGC AGCTCCTGCG AAACAGGAAG CAGCAGCGCC GGCTCCGGCA
GCAAAAGCTG AAGCCCCGGC AGCAGCACCG GCTGCGAAAG CGGAAGGCAA ATCTGAATTT
GCTGAAAATG ACGCTTACGT TCACGCGACC CCGCTGATCC GCCGTCTGGC ACGCGAGTTT
GGCGTTAACC TTGCGAAAGT GAAGGGCACT GGCCGTAAAG GTCGTATCCT GCGCGAAGAC
GTTCAGGCTT ACGTGAAAGA AGCTATCAAA CGTGCAGAAG CAGCTCCGGC AGCGACTGGC
GGTGGTATCC CTGGCATGCT GCCGTGGCCG AAGGTGGACT TCAGCAAGTT TGGTGAAATC
GAAGAAGTGG AACTGGGCCG TATCCAGAAA ATTTCTGGTG CGAACCTGAG CCGTAACTGG
GTGATGATCC CGCATGTTAC TCACTTCGAC AAAACCGATA TCACCGAGCT GGAAGCGTTC
CGTAAACAGC AGAACGAAGA AGCGGCGAAA CGTAAGCTGG ATGTGAAGAT CACCCCGGTT
GTCTTCATCA TGAAAGCCGT TGCTGCAGCG CTTGAGCAGA TGCCTCGCTT CAACAGTTCG
CTGTCGGAAG ACGGTCAGCG TCTGACCCTG AAGAAATACA TCAACATCGG TGTGGCGGTG
GATACCCCGA ACGGTCTGGT TGTTCCGGTA TTCAAAGACG TCAACAAGAA AGGCATCATC
GAACTGTCTC GCGAGCTGAT GACTATTTCT AAGAAAGCGC GTGACGGTAA GCTGACTGCG
GGCGAAATGC AGGGCGGTTG CTTCACCATC TCCAGCATCG GCGGCCTGGG TACTACCCAC
TTCGCGCCGA TTGTGAACGC GCCGGAAGTG GCTATCCTCG GCGTTTCCAA GTCCGCGATG
GAGCCGGTGT GGAATGGTAA AGAGTTCGTG CCGCGTCTGA TGCTGCCGAT TTCTCTCTCC
TTCGACCACC GCGTGATCGA CGGTGCTGAT GGTGCCCGTT TCATTACCAT CATTAACAAC
ACGCTGTCTG ACATTCGCCG TCTGGTGATG TAA
 
Protein sequence
MAIEIKVPDI GADEVEITEI LVKVGDKVEA EQSLITVEGD KASMEVPSPQ AGIVKEIKVS 
VGDKTQTGAL IMIFDSADGA ADAAPAQAEE KKEAAPAAVP EAAAAKDVNV PDIGSDEVEV
TEILVKVGDK VEAEQSLITV EGDKASMEVP APFAGTVKEI KVNTGDKVST GSLIMVFEVA
GEAGAAAPAA KQEAAPAAAP APAAGVKEVN VPDIGGDEVE VTEVMVKVGD KVAAEQSLIT
VEGDKASMEV PAPFAGVVKE LKVNVGDKVK TGSLIMIFEV EGAAPAAAPA KQEAAAPAPA
AKAEAPAAAP AAKAEGKSEF AENDAYVHAT PLIRRLAREF GVNLAKVKGT GRKGRILRED
VQAYVKEAIK RAEAAPAATG GGIPGMLPWP KVDFSKFGEI EEVELGRIQK ISGANLSRNW
VMIPHVTHFD KTDITELEAF RKQQNEEAAK RKLDVKITPV VFIMKAVAAA LEQMPRFNSS
LSEDGQRLTL KKYINIGVAV DTPNGLVVPV FKDVNKKGII ELSRELMTIS KKARDGKLTA
GEMQGGCFTI SSIGGLGTTH FAPIVNAPEV AILGVSKSAM EPVWNGKEFV PRLMLPISLS
FDHRVIDGAD GARFITIINN TLSDIRRLVM