Gene EcolC_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3544 
SymbolaceF 
ID6064978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3871994 
End bp3873886 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content55% 
IMG OID641602961 
Productdihydrolipoamide acetyltransferase 
Protein accessionYP_001726485 
Protein GI170021531 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.484984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.109653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG AAATCAAAGT ACCGGACATC GGGGCTGATG AAGTTGAAAT CACCGAGATC 
CTGGTCAAAG TGGGCGACAA AGTTGAAGCC GAACAGTCGC TGATCACCGT AGAAGGCGAC
AAAGCCTCTA TGGAAGTTCC GTCTCCGCAG GCGGGTATCG TTAAAGAGAT CAAAGTCTCT
GTTGGCGATA AAACCCAGAC CGGCGCACTG ATTATGATTT TCGATTCCGC CGACGGTGCA
GCAGACGCTG CACCTGCTCA GGCAGAAGAG AAGAAAGAAG CAGCTCCGGC AGCAGCACCA
GCGGCTGCGG CGGCAAAAGA CGTTAACGTT CCGGATATCG GCAGCGACGA AGTTGAAGTG
ACCGAAATCC TGGTGAAAGT TGGCGATAAA GTTGAAGCTG AACAGTCGCT GATCACCGTA
GAAGGCGACA AGGCTTCTAT GGAAGTTCCG GCTCCGTTTG CTGGCACCGT GAAAGAGATC
AAAGTGAACG TGGGTGACAA AGTGTCTACC GGCTCGCTGA TTATGGTCTT CGAAGTCGCG
GGTGAAGCAG GCGCGGCAGC TCCGGCCGCT AAACAGGAAG CAGCTCCGGC AGCGGCCCCT
GCACCAGCGG CTGGCGTGAA AGAAGTTAAC GTTCCGGATA TCGGCGGTGA CGAAGTTGAA
GTGACCGAAG TGATGGTGAA AGTGGGCGAC AAAGTTGCCG CTGAACAGTC ACTGATCACC
GTAGAAGGCG ATAAAGCTTC TATGGAAGTT CCGGCTCCGT TTGCAGGCGT CGTGAAGGAA
CTGAAAGTCA ACGTTGGCGA TAAAGTGAAA ACTGGCTCGC TGATTATGAT CTTTGAAGTT
GAAGGCGCAG CGCCTGCGGC AGCTCCTGCG AAACAGGAAG CGGCAGCGCC GGCTCCGGCA
GCAAAAGCTG AAGCCCCGGC AGCAGCACCG GCTGCGAAAG CTGAAGGCAA ATCTGAATTT
GCTGAAAATG ACGCTTACGT TCACGCGACT CCGCTGATCC GCCGTCTGGC ACGCGAGTTT
GGCGTTAACC TTGCGAAAGT GAAGGGCACT GGCCGTAAAG GTCGTATCCT GCGCGAAGAC
GTTCAGGCTT ACGTGAAAGA AGCTATCAAA CGTGCAGAAG CAGCTCCGGC AGCGACTGGC
GGCGGTATCC CAGGCATGCT GCCGTGGCCG AAGGTGGACT TCAGCAAGTT TGGTGAAATC
GAAGAAGTGG AACTGGGCCG TATCCAGAAA ATTTCTGGTG CGAACCTGAG CCGTAACTGG
GTGATGATCC CGCATGTTAC TCACTTCGAC AAAACCGATA TCACCGAGCT GGAAGCGTTC
CGTAAACAGC AGAACGAAGA AGCGGCGAAA CGTAAGCTGG ATGTGAAGAT CACCCCGGTT
GTCTTCATTA TGAAAGCCGT TGCTGCAGCT CTTGAGCAGA TGCCTCGCTT CAACAGTTCG
CTGTCGGAGG ACGGTCAGCG TCTGACCCTG AAGAAGTACA TCAACATCGG TGTGGCGGTG
GATACCCCGA ACGGTCTGGT TGTTCCGGTA TTCAAAGACG TCAACAAGAA AGGCATCATC
GAGCTGTCTC GCGAGCTGAT GACTATTTCT AAGAAAGCGC GTGACGGTAA GCTGACTGCG
GGCGAAATGC AGGGCGGTTG CTTCACCATC TCCAGCATCG GCGGCCTGGG TACTACCCAC
TTCGCGCCGA TTGTGAACGC GCCGGAAGTG GCTATCCTCG GCGTTTCCAA GTCCGCGATG
GAGCCGGTGT GGAATGGTAA AGAGTTCGTG CCGCGTCTGA TGCTGCCGAT TTCTCTCTCC
TTCGACCACC GCGTGATCGA CGGTGCTGAT GGTGCTCGTT TCATTACCAT CATTAACAAC
ACGCTGTCTG ACATTCGCCG TCTGGTGATG TAA
 
Protein sequence
MAIEIKVPDI GADEVEITEI LVKVGDKVEA EQSLITVEGD KASMEVPSPQ AGIVKEIKVS 
VGDKTQTGAL IMIFDSADGA ADAAPAQAEE KKEAAPAAAP AAAAAKDVNV PDIGSDEVEV
TEILVKVGDK VEAEQSLITV EGDKASMEVP APFAGTVKEI KVNVGDKVST GSLIMVFEVA
GEAGAAAPAA KQEAAPAAAP APAAGVKEVN VPDIGGDEVE VTEVMVKVGD KVAAEQSLIT
VEGDKASMEV PAPFAGVVKE LKVNVGDKVK TGSLIMIFEV EGAAPAAAPA KQEAAAPAPA
AKAEAPAAAP AAKAEGKSEF AENDAYVHAT PLIRRLAREF GVNLAKVKGT GRKGRILRED
VQAYVKEAIK RAEAAPAATG GGIPGMLPWP KVDFSKFGEI EEVELGRIQK ISGANLSRNW
VMIPHVTHFD KTDITELEAF RKQQNEEAAK RKLDVKITPV VFIMKAVAAA LEQMPRFNSS
LSEDGQRLTL KKYINIGVAV DTPNGLVVPV FKDVNKKGII ELSRELMTIS KKARDGKLTA
GEMQGGCFTI SSIGGLGTTH FAPIVNAPEV AILGVSKSAM EPVWNGKEFV PRLMLPISLS
FDHRVIDGAD GARFITIINN TLSDIRRLVM