Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0124 |
Symbol | aceE |
ID | 6142768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 133439 |
End bp | 136102 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615025 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001742241 |
Protein GI | 170680617 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.594782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.632458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAAC GTTTCCCAAA TGACGTGGAT CCGATCGAAA CTCGCGACTG GCTCCAGGCG ATCGAATCGG TCATCCGTGA AGAAGGTGTT GAGCGTGCTC AGTATCTGAT CGACCAACTG CTTGCTGAAG CCCGCAAAGG CGGTGTCAAC GTAGCCGCAG GCACAGGTAT CAGCAACTAC ATCAACACCA TCCCCGTTGA AGAACAACCG GAGTATCCGG GTAATCTGGA ACTGGAACGC CGTATTCGTT CAGCTATCCG CTGGAACGCC ATCATGACGG TTCTGCGTGC GTCGAAAAAA GACCTCGAAC TGGGCGGCCA CATGGCGTCC TTCCAGTCTT CCGCAACCAT TTATGATGTG TGCTTTAACC ACTTCTTCCG TGCACGCAAC GAGCAGGATG GCGGCGACCT GGTTTACTTC CAGGGCCACA TCTCCCCGGG CGTTTACGCA CGTGCTTTCC TGGAAGGTCG TCTGACTCAG GAGCAGCTGG ATAACTTCCG TCAGGAAGTT CACGGCAATG GCCTCTCTTC CTATCCGCAC CCGAAACTGA TGCCGGAATT CTGGCAGTTC CCGACCGTAT CTATGGGTCT GGGTCCGATT GGTGCTATTT ACCAGGCTAA GTTCCTGAAA TATCTGGAAC ACCGTGGCCT GAAAGATACC TCTAAACAGA CCGTTTACGC ATTCCTCGGC GACGGTGAAA TGGACGAACC GGAATCCAAA GGTGCGATCA CCATCGCAAC CCGTGAAAAA CTGGATAACC TGGTCTTCGT TATCAACTGT AACCTGCAAC GTCTTGACGG CCCGGTCACC GGTAACGGCA AGATCATCAA CGAACTGGAA GGCATCTTCG AAGGTGCTGG CTGGAACGTG ATCAAAGTGA TGTGGGGTAG CCGTTGGGAT GAACTGCTGC GTAAAGATAC CAGCGGTAAA CTGATCCAGC TGATGAACGA AACCGTTGAC GGCGACTACC AGACCTTCAA ATCGAAAGAT GGTGCGTACG TTCGTGAACA CTTCTTCGGT AAATATCCTG AAACCGCAGC ACTGGTTGCA GACTGGACTG ACGAGCAGAT CTGGGCACTG AACCGTGGCG GTCACGATCC GAAGAAAATC TACGCTGCAT TCAAGAAAGC GCAGGAAACC AAAGGCAAAG CGACAGTAAT CCTTGCTCAT ACCATTAAAG GTTACGGCAT GGGCGACGCG GCTGAAGGTA AAAACATCGC GCACCAGGTT AAGAAAATGA ACATGGACGG CGTGCGTCAC ATCCGCGACC GTTTCAATGT GCCGGTGTCT GATGCCGATA TCGAAAAACT GCCGTACATC ACCTTCCCGG AAGGTTCTGA AGAGCATACC TATCTGCACG CGCAGCGTCA GAAACTGCAC GGTTATCTGC CAAGCCGTCA GCCGAACTTC ACCGAGAAGC TTGAGCTGCC GAGCCTGCAA GATTTCGGCG CGCTGCTGGA AGAGCAGAGC AAAGAGATCT CTACCACTAT CGCTTTCGTT CGTGCTCTGA ACGTGATGCT GAAGAACAAG TCGATCAAAG ACCGACTGGT GCCGATCATC GCCGACGAAG CGCGTACTTT CGGTATGGAA GGTCTGTTCC GTCAGATTGG TATTTACAGC CCGAACGGTC AGCAGTACAC CCCGCAGGAC CGCGAGCAGG TTGCTTACTA TAAAGAAGAC GAGAAAGGTC AGATTCTGCA GGAAGGGATC AACGAGCTGG GCGCAGGTTG TTCCTGGCTG GCAGCGGCGA CCTCTTACAG CACCAACAAT CTGCCGATGA TTCCGTTCTA CATCTATTAC TCGATGTTCG GCTTCCAGCG TATCGGCGAT CTGTGCTGGG CGGCTGGTGA CCAGCAAGCG CGTGGCTTCC TGATCGGCGG TACTTCCGGT CGTACCACCC TGAACGGCGA AGGTCTGCAG CACGAAGATG GTCACAGCCA CATTCAGTCG CTGACTATCC CGAACTGTAT CTCTTACGAC CCGGCTTACG CTTACGAAGT TGCTGTCATC ATGCATGACG GTCTGGAGCG TATGTACGGT GAAAAACAAG AGAACGTTTA CTACTACATC ACCACGCTGA ACGAAAACTA CCACATGCCG GCAATGCCCG AAGGTGCTGA GGAAGGTATC CGTAAAGGTA TCTACAAACT CGAAACCATT GAAGGTAGCA AAGGTAAAGT TCAGCTGCTC GGCTCCGGTT CTATCCTGCG TCACGTCCGT GAAGCAGCTG AGATCCTGGC GAAAGATTAC GGCGTAGGTT CTGACGTTTA TAGCGTGACA TCCTTCACCG AACTGGCGCG TGATGGTCAG GATTGTGAAC GCTGGAACAT GCTGCACCCG CTGGAAACTC CGCGCGTTCC GTATATCGCT CAGGTGATGA ATGACGCTCC GGCAGTGGCA TCTACCGACT ATATGAAACT GTTCGCTGAG CAGGTCCGTA CTTACGTACC GGCTGACGAC TACCGCGTAC TGGGTACTGA TGGCTTCGGT CGTTCCGACA GCCGTGAGAA CCTGCGTCAC CACTTCGAAG TTGATGCTTC TTATGTCGTG GTTGCGGCGC TGGGCGAACT GGCTAAACGT GGCGAAATCG ATAAGAAAGT GGTTGCTGAC GCAATCGCCA AATTCAACAT CGATGCAGAT AAAGTTAACC CGCGTCTGGC GTAA
|
Protein sequence | MSERFPNDVD PIETRDWLQA IESVIREEGV ERAQYLIDQL LAEARKGGVN VAAGTGISNY INTIPVEEQP EYPGNLELER RIRSAIRWNA IMTVLRASKK DLELGGHMAS FQSSATIYDV CFNHFFRARN EQDGGDLVYF QGHISPGVYA RAFLEGRLTQ EQLDNFRQEV HGNGLSSYPH PKLMPEFWQF PTVSMGLGPI GAIYQAKFLK YLEHRGLKDT SKQTVYAFLG DGEMDEPESK GAITIATREK LDNLVFVINC NLQRLDGPVT GNGKIINELE GIFEGAGWNV IKVMWGSRWD ELLRKDTSGK LIQLMNETVD GDYQTFKSKD GAYVREHFFG KYPETAALVA DWTDEQIWAL NRGGHDPKKI YAAFKKAQET KGKATVILAH TIKGYGMGDA AEGKNIAHQV KKMNMDGVRH IRDRFNVPVS DADIEKLPYI TFPEGSEEHT YLHAQRQKLH GYLPSRQPNF TEKLELPSLQ DFGALLEEQS KEISTTIAFV RALNVMLKNK SIKDRLVPII ADEARTFGME GLFRQIGIYS PNGQQYTPQD REQVAYYKED EKGQILQEGI NELGAGCSWL AAATSYSTNN LPMIPFYIYY SMFGFQRIGD LCWAAGDQQA RGFLIGGTSG RTTLNGEGLQ HEDGHSHIQS LTIPNCISYD PAYAYEVAVI MHDGLERMYG EKQENVYYYI TTLNENYHMP AMPEGAEEGI RKGIYKLETI EGSKGKVQLL GSGSILRHVR EAAEILAKDY GVGSDVYSVT SFTELARDGQ DCERWNMLHP LETPRVPYIA QVMNDAPAVA STDYMKLFAE QVRTYVPADD YRVLGTDGFG RSDSRENLRH HFEVDASYVV VAALGELAKR GEIDKKVVAD AIAKFNIDAD KVNPRLA
|
| |