Gene EcHS_A0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0119 
SymbolaceF 
ID5595158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp127334 
End bp129226 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content55% 
IMG OID640919306 
Productdihydrolipoamide acetyltransferase 
Protein accessionYP_001456901 
Protein GI157159583 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0825086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATCG AAATCAAAGT ACCGGACATC GGGGCTGATG AAGTTGAAAT CACCGAGATC 
CTGGTCAAAG TGGGCGACAA AGTTGAAGCC GAACAGTCGC TGATCACCGT AGAAGGCGAC
AAAGCCTCTA TGGAAGTTCC GTCTCCGCAG GCGGGTATCG TTAAAGAGAT CAAAGTCTCT
GTTGGCGATA AAACCCAGAC CGGCGCACTG ATTATGATTT TCGATTCCGC CGACGGTGCA
GCAGACGCTG CACCTGCTCA GGCAGAAGAG AAGAAAGAAG CAGCTCCGGC AGCAGCACCA
GCGGCTGCAG CGGCAAAAGA CGTTAACGTT CCGGATATCG GCAGCGACGA AGTTGAAGTG
ACCGAAATCC TGGTGAAAGT TGGCGATAAA GTTGAAGCTG AACAGTCGCT GATCACCGTA
GAAGGCGATA AAGCGTCTAT GGAAGTTCCG GCTCCGTTTG CTGGCACCGT GAAAGAGATC
AAAGTGAACG TGGGTGACAA AGTGTCTACC GGCTCGCTGA TTATGGTCTT CGAAGTCGCG
GGTGAAGCAG GCGCGGCAGC TCCGGCGGCT AAACAGGAAG CGGCTCCGGC AGCGGCCCCT
GCACCAGCGG CTGGCGTGAA AGAAGTTAAC GTTCCGGATA TCGGCGGTGA CGAAGTTGAA
GTGACCGAAG TGATGGTGAA AGTGGGCGAC AAAGTTGCCG CTGAACAGTC ACTGATTACC
GTAGAAGGCG ACAAAGCTTC TATGGAAGTT CCGGCTCCGT TTGCAGGCGT CGTGAAGGAA
CTGAAAGTCA ACGTTGGCGA TAAAGTGAAA ACTGGCTCGC TGATTATGAT CTTCGAAGTT
GAAGGCGCAG CGCCTGCGGC AGCTCCTGCG AAACAGGAAG CGGCAGCGCC GGCTCCGGCA
GCAAAAGCTG AAGCTCCGGC AGCAGCACCG GCTGCGAAAG CGGAAGGCAA ATCTGAATTT
GCAGAAAACG ACGCTTACGT TCACGCGACT CCGCTGATCC GCCGTCTGGC ACGCGAGTTT
GGCGTTAACC TGGCGAAAGT GAAGGGCACT GGCCGTAAAG GTCGTATCCT GCGCGAAGAC
GTTCAGGCTT ACGTGAAAGA AGCTATCAAA CGTGCAGAAG CAGCTCCGGC GGCGACTGGC
GGCGGTATCC CAGGCATGCT GCCGTGGCCG AAGGTGGACT TCAGCAAGTT TGGTGAAATC
GAAGAAGTGG AACTGGGCCG TATCCAGAAA ATTTCTGGTG CGAACCTGAG CCGTAACTGG
GTGATGATCC CGCATGTTAC TCACTTCGAC AAAACCGATA TCACCGAGCT GGAAGCGTTC
CGTAAACAGC AGAACGAAGA AGCGGCGAAA CGTAAGCTGG ATGTGAAGAT CACCCCGGTT
GTCTTCATCA TGAAAGCCGT TGCTGCAGCT CTTGAGCAGA TGCCTCGCTT CAACAGTTCG
TTGTCGGAAG ACGGTCAGCG TCTGACCCTG AAGAAATACA TCAACATCGG TGTGGCGGTG
GATACCCCGA ACGGTCTGGT TGTTCCGGTA TTCAAAGACG TCAACAAGAA AGGCATCATC
GAGCTGTCTC GCGAGCTGAT GACTATTTCT AAGAAAGCGC GTGACGGTAA GCTGACTGCG
GGCGAAATGC AGGGCGGTTG CTTCACCATC TCCAGCATCG GCGGCCTGGG TACTACCCAC
TTCGCGCCGA TTGTGAACGC GCCGGAAGTG GCTATCCTCG GCGTTTCCAA GTCCGCGATG
GAGCCGGTGT GGAATGGTAA AGAGTTCGTG CCGCGTCTGA TGCTGCCGAT TTCTCTCTCC
TTCGACCACC GCGTGATCGA CGGTGCTGAT GGTGCCCGTT TCATTACCAT CATTAACAAC
ACGCTGTCTG ACATTCGCCG TCTGGTGATG TAA
 
Protein sequence
MAIEIKVPDI GADEVEITEI LVKVGDKVEA EQSLITVEGD KASMEVPSPQ AGIVKEIKVS 
VGDKTQTGAL IMIFDSADGA ADAAPAQAEE KKEAAPAAAP AAAAAKDVNV PDIGSDEVEV
TEILVKVGDK VEAEQSLITV EGDKASMEVP APFAGTVKEI KVNVGDKVST GSLIMVFEVA
GEAGAAAPAA KQEAAPAAAP APAAGVKEVN VPDIGGDEVE VTEVMVKVGD KVAAEQSLIT
VEGDKASMEV PAPFAGVVKE LKVNVGDKVK TGSLIMIFEV EGAAPAAAPA KQEAAAPAPA
AKAEAPAAAP AAKAEGKSEF AENDAYVHAT PLIRRLAREF GVNLAKVKGT GRKGRILRED
VQAYVKEAIK RAEAAPAATG GGIPGMLPWP KVDFSKFGEI EEVELGRIQK ISGANLSRNW
VMIPHVTHFD KTDITELEAF RKQQNEEAAK RKLDVKITPV VFIMKAVAAA LEQMPRFNSS
LSEDGQRLTL KKYINIGVAV DTPNGLVVPV FKDVNKKGII ELSRELMTIS KKARDGKLTA
GEMQGGCFTI SSIGGLGTTH FAPIVNAPEV AILGVSKSAM EPVWNGKEFV PRLMLPISLS
FDHRVIDGAD GARFITIINN TLSDIRRLVM