Gene EcSMS35_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0125 
SymbolaceF 
ID6144791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp136117 
End bp138009 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content55% 
IMG OID641615026 
Productdihydrolipoamide acetyltransferase 
Protein accessionYP_001742242 
Protein GI170680247 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.754691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG AAATCAAAGT ACCGGACATC GGGGCTGATG AAGTTGAAAT CACCGAGATC 
CTGGTCAAAG TGGGCGACAA AGTTGAAGCC GAACAGTCGC TGATCACCGT AGAAGGCGAC
AAAGCCTCTA TGGAAGTTCC GTCTCCGCAG GCGGGTATCG TTAAAGAGAT CAAAGTCTCT
GTTGGCGATA AAACCCAGAC CGGCGCACTG ATTATGATTT TCGATTCCGC CGACGGTGCA
GCAGACGCTG CACCTGCTCA GGCAGAAGAG AAGAAAGAAG CAGCTCCGGC AGCAGCACCA
GCGGCTGCGG CGGCAAAAGA CGTTAATGTT CCGGATATCG GCAGCGACGA AGTTGAAGTG
ACCGAAATCC TGGTGAAAGT TGGCGATAAA GTTGAAGCTG AACAGTCGCT GATCACCGTA
GAAGGCGACA AGGCTTCTAT GGAAGTTCCG GCTCCGTTTG CTGGCACCGT GAAAGAGATC
AAAGTGAACG TGGGTGACAA AGTGTCTACC GGCTCGCTGA TTATGGTCTT CGAAGTCGCG
GGTGAAGCAG GCGCGGCAGC TCCGGCGGCT AAACAGGAAG CGGCTCCGGC AGCGGCCCCT
GCATCAGCGG CTGGCGTGAA AGACGTTAAC GTTCCGGATA TCGGTGGTGA CGAAGTTGAA
GTGACCGAAG TGATGGTGAA AGTGGGCGAC AAAGTTGCCG CTGAACAGTC ACTGATCACC
GTAGAAGGCG ACAAAGCTTC TATGGAAGTT CCGGCTCCGT TTGCAGGCGT CGTGAAGGAA
CTGAAAGTCA ACGTTGGCGA TAAAGTGAAA ACTGGCTCGC TGATTATGAT CTTCGAAGTT
GAAGGCGCAG CGCCTGCGGC AGCTCCTGCG AAACAGGAAG CGGCAGCGCC GGCTCCGGCA
GCAAAAGCTG AAGCTCCGGC AGCAGCTCCG GCTGCGAAAG CGGAAGGCAA ATCTGAATTT
GCTGAAAATG ACGCTTACGT TCACGCGACT CCGCTGATCC GCCGTCTGGC ACGCGAGTTT
GGCGTTAACC TGGCGAAAGT GAAGGGCACT GGCCGTAAAG GTCGTATCCT GCGCGAAGAC
GTTCAGGCTT ACGTGAAAGA AGCTATCAAA CGTGCAGAAG CGGCTCCGGC GGCGACTGGC
GGCGGTATCC CGGGCATGCT GCCGTGGCCG AAGGTGGACT TCAGCAAGTT TGGTGAAATC
GAAGAAGTGG AACTGGGCCG TATCCAGAAA ATCTCTGGTG CGAACCTGAG CCGTAACTGG
GTGATGATCC CGCATGTTAC TCACTTCGAC AAAACCGATA TCACCGAGCT GGAAGCGTTC
CGTAAACAGC AGAACGAAGA AGCGGCGAAA CGTAAGCTGG ATGTGAAGAT AACCCCAGTT
GTCTTCATCA TGAAAGCCGT TGCTGCGGCA CTGGAGCAGA TGCCTCGCTT CAACAGTTCG
CTGTCGGAAG ACGGTCAGCG TCTGACCCTG AAGAAATACA TCAACATCGG TGTGGCGGTG
GATACCCCGA ACGGTCTGGT TGTTCCGGTA TTCAAAGACG TCAACAAGAA AGGCATCATC
GAGCTGTCTC GCGAGCTGAT GACTATTTCT AAGAAAGCGC GTGACGGTAA GCTGACTGCG
GGCGAAATGC AGGGCGGTTG CTTCACCATC TCCAGCATCG GCGGCCTGGG TACTACCCAC
TTCGCGCCGA TTGTGAACGC GCCGGAAGTG GCTATCCTCG GCGTTTCCAA GTCCGCGATG
GAGCCGGTGT GGAATGGTAA AGAGTTCGTG CCGCGTCTGA TGCTGCCGAT TTCTCTCTCC
TTCGACCACC GCGTGATCGA CGGTGCTGAT GGTGCCCGTT TCATTACCAT CATTAACAAC
ACGCTGTCTG ACATTCGCCG TCTGGTGATG TAA
 
Protein sequence
MAIEIKVPDI GADEVEITEI LVKVGDKVEA EQSLITVEGD KASMEVPSPQ AGIVKEIKVS 
VGDKTQTGAL IMIFDSADGA ADAAPAQAEE KKEAAPAAAP AAAAAKDVNV PDIGSDEVEV
TEILVKVGDK VEAEQSLITV EGDKASMEVP APFAGTVKEI KVNVGDKVST GSLIMVFEVA
GEAGAAAPAA KQEAAPAAAP ASAAGVKDVN VPDIGGDEVE VTEVMVKVGD KVAAEQSLIT
VEGDKASMEV PAPFAGVVKE LKVNVGDKVK TGSLIMIFEV EGAAPAAAPA KQEAAAPAPA
AKAEAPAAAP AAKAEGKSEF AENDAYVHAT PLIRRLAREF GVNLAKVKGT GRKGRILRED
VQAYVKEAIK RAEAAPAATG GGIPGMLPWP KVDFSKFGEI EEVELGRIQK ISGANLSRNW
VMIPHVTHFD KTDITELEAF RKQQNEEAAK RKLDVKITPV VFIMKAVAAA LEQMPRFNSS
LSEDGQRLTL KKYINIGVAV DTPNGLVVPV FKDVNKKGII ELSRELMTIS KKARDGKLTA
GEMQGGCFTI SSIGGLGTTH FAPIVNAPEV AILGVSKSAM EPVWNGKEFV PRLMLPISLS
FDHRVIDGAD GARFITIINN TLSDIRRLVM