Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2792 |
Symbol | aceE |
ID | 4072415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3305365 |
End bp | 3308046 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984810 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_591867 |
Protein GI | 94969819 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.215552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCTG TTAATCTAGA CACAATCGAC ATTAATTCCC TGGAAGTCTT AGAGAACCGT GAATGGCTTG AGTCGCTCGA ATACGTCCTC CAAACCGGAG GCCCCGAGCG CGTAGGACGC CTGATTCAGC AGCTCCAGTT GCAATGCGAG CGCGCCGGCG TAAAACTCCC ATTCACTGCC ACCACTCCCT ACGAAAACAC CATCCCCGCC GACCGCCAGC CCCCGTTTCC CGGCAGCCAG GAAATGGAAC GTCGCATTAA GAGCCTTATT CGCTGGAACG CTCTGGCCAT GGTCATGCGC GCCAACAAGG TCGAAGAGGG AATCGGCGGC CACATCTCCA CCTTCGCCTC CGCGGCCACG CTCTACGAAG TCGGCTTCAA CCACTTCTTT CGCGCCGCTA CTGAAGATGG CGATCGTGAC ATCGTCTATT TCCAGGGACA CAGCGCCCCC GGCATCTACT CCCGCGCGTT CCTCGAAGGT CGTCTCCCGA TCGAGAAGCT GGAGAACTTC CGCCGCGAAC TTCATCCCGG CGGCGGCCTC TCGTCCTATC CGCACCCGTG GCTCATGCCC GACTTCTGGG AATTCCCCAC GGTCTCCATG GGACTCGGTC CGATCACCGC GATCTACCAG GCTCGCTTCA ACAAGTACCT CGAAAACCGC GGCCTCAAGA CTGCGACCAG CGGCAAGATC TGGGCCTTCC TCGGTGACGG CGAAACCGAT GAGCCCGAGT CGCTCGGTGC AATTTCTCTC GCTTCGCGGG AACGCCTCGA CAACCTCATC TTCGTTATCA ACTGCAACCT CCAGCGCCTC GACGGCCCCG TCCGCGGCAA TTTCAAAATC ATCCAGGAAC TTGAAGCCAA CTTCCGCGGC GCCGGATGGA ACGTAATCAA AGTTATCTGG GGCAGCGATT GGGACAGCCT CATTGAGAAG GACACCGACG GTCTGCTCGT AAAGCGCATG GGCGAAATCA CCGACGGCCA GTTCCAGAAG TACGCCGTCG AAACCGGACG CTACTTCCGC CAGAACTTCT TCGGCACCGA CCCGCGCCTG CTCAAGATGG TCGAGCACCT CAGCGACGAG CAGCTCGAAC ACCTGCGCCT CGGCGGACAC GACCCGATCA AAGTTCACGC CGCCTACAAA GAGGCCGTCG ATCACAAGGG CTCGCCCACG GTCATTCTCG CCAAGACGAT CAAGGGTTAC GGCCTCGGCG AAAGTGGCGA GGGCAAGAAC ATCACCCACC AGCAGAAGAA GCTCAACGAA GAAGAGCTCA AAATCTTCCG CTCGCGCTTC GGCATCCCTG TCGCCGATGA AGATCTCGCG AAAGCCCCGT TCTATCGCCC CAGCGACGAC TCGGCCGAAA TCAAATACCT GCAGGAACGC CGCAAGCAAC TCGGCGGATA CATGCCGGCC CGCAAGGTCC GCGCCGCCGC GCTGCCCATC CCGAAGGAAG AGCTCTTTGA AGAGTTCTAC AAGGGCACTG AAGGCCGCAA GGCATCGAGC ACCATGGTCT TCGTTCGCAT GCTCGGCAAA CTGCTGCGCG ATCCCGAATT CGGCAAGTAC GTCGTGCCCA TCGTTCCCGA CGAAGCTCGA ACCTTCGGTA TGGAAGCGCT CTTCCGCCAG GTGGGTATCT ACTCCAGCGT AGGCCAGCTC TACGAGCCTG TCGATATGGA CACGCTCCTC TATTACAAGG AGTCTAAGGA CGGCCAGATT CTCGAAGAGG GCATCACCGA GGCCGGTTCG ATGTCTTCGT TCATCGCTGC GGGCAGTGCA TATTCCACGC ACGGCATCCC GACGATTCCG TTCTTCATTT ACTACTCGAT GTTCGGATTC CAGCGCATTG GCGATCTCGT ATGGGCCGCC GCTGACACCC GCTGCCGCGG CTTCATGCTC GGCGGCACCG CCGGACGCAC CACCCTTGCC GGCGAAGGTC TCCAGCACCA GGACGGCCAC AGCCACCTGC TCGCTTACCC GGTTCCTACC TGCATGGCCT ACGATCCCGC GTTCGCCTTT GAACTCGCGA TCATCATTCA GGACGGCATC AAGCGCATGT ATCACGACGG CGAAAGCATC TTCTACTACA TCACCGTTAT GAACGAGCCG GTCGAGAACC CCGCCATGCC CGAAGGTGTG CGCGAAGGTA TTCTCCGCGG CATGTATCGC TTCAAGAAGT CGGAGCACAA GTCGAAGCTC AAGGCGAACC TCTTCGGCTC CGGCACCATC ATGCAGGAAG TGATCAAGGC CGCCGAAATC CTCGAGTCCA AGTACGACAT CGCGAGCGAT ATCTGGAGCA TCACCAGCTA CAAGGAACTC TACAAAGACG GCAATGACGT GGACCGCTGG AACATGCTGC ACCCGGCGGA GAAGCCGCGC CAGACCTTCA TCGGCGAGCA GTTGAAAGAC GCCGAAGGCG TCTTTGTCGC TGCTTCCGAC TATGTGAAGG CAATGCCGGA ATCCATTTCG CAGTGGTTCC CGCGTCCGCT CCTCGCGCTC GGTACCGACG GCTTCGGCCG CAGCGAAGGC CGCGCATCGT TGCGCGACTT CTTCGAGGTC GATGCCAAGC ACATCGTCGT CGGTACGCTC ACCGCGCTCA TGCGCGACGG CAAAGTGAAA CCCGACGCGG TCAGCCGCGC TATCAAGGAT CTCGGCGTCG ATCCCAACAA GCCGAATCCG TTCACCGTTT AG
|
Protein sequence | MNPVNLDTID INSLEVLENR EWLESLEYVL QTGGPERVGR LIQQLQLQCE RAGVKLPFTA TTPYENTIPA DRQPPFPGSQ EMERRIKSLI RWNALAMVMR ANKVEEGIGG HISTFASAAT LYEVGFNHFF RAATEDGDRD IVYFQGHSAP GIYSRAFLEG RLPIEKLENF RRELHPGGGL SSYPHPWLMP DFWEFPTVSM GLGPITAIYQ ARFNKYLENR GLKTATSGKI WAFLGDGETD EPESLGAISL ASRERLDNLI FVINCNLQRL DGPVRGNFKI IQELEANFRG AGWNVIKVIW GSDWDSLIEK DTDGLLVKRM GEITDGQFQK YAVETGRYFR QNFFGTDPRL LKMVEHLSDE QLEHLRLGGH DPIKVHAAYK EAVDHKGSPT VILAKTIKGY GLGESGEGKN ITHQQKKLNE EELKIFRSRF GIPVADEDLA KAPFYRPSDD SAEIKYLQER RKQLGGYMPA RKVRAAALPI PKEELFEEFY KGTEGRKASS TMVFVRMLGK LLRDPEFGKY VVPIVPDEAR TFGMEALFRQ VGIYSSVGQL YEPVDMDTLL YYKESKDGQI LEEGITEAGS MSSFIAAGSA YSTHGIPTIP FFIYYSMFGF QRIGDLVWAA ADTRCRGFML GGTAGRTTLA GEGLQHQDGH SHLLAYPVPT CMAYDPAFAF ELAIIIQDGI KRMYHDGESI FYYITVMNEP VENPAMPEGV REGILRGMYR FKKSEHKSKL KANLFGSGTI MQEVIKAAEI LESKYDIASD IWSITSYKEL YKDGNDVDRW NMLHPAEKPR QTFIGEQLKD AEGVFVAASD YVKAMPESIS QWFPRPLLAL GTDGFGRSEG RASLRDFFEV DAKHIVVGTL TALMRDGKVK PDAVSRAIKD LGVDPNKPNP FTV
|
| |