Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3195 |
Symbol | aceE |
ID | 7874335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3478413 |
End bp | 3481094 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700124 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_002890167 |
Protein GI | 237653853 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.226558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAA TCAGCACCGT TCTCCCGCAA ACCGACACGG ATGCGCAGGA AACGAAGGAA TGGCTGGACG CGCTCGCCGC CGTCGTCGAA CACGAAGGCC CCGAGCGCGC CCACTACCTG ATCGAGACCC TGATCGCCAC GGCGCGCCAG GAGGGGGTGA ACATCCCCTA CTCGGCCACG ACCGAGTACA TCAACACGAT TCCGGCCGAG CGCCAGCCGC CCTACCCGGG CGATCCGGAT CTGGAGATCA AGATCCACTC CTACATCCGC TGGAACGCGA TGGCGATGGT GGTGCGGGCG AACAAGGACA CCAACGTGGG CGGTCACATC GCCTCCTTCG CCTCGGCGGC GGCGCTCTAC GACGTGGGCT TCTCGCACTT CTGGCACGCG CCCTCCGAGG CGCACGACGG CGACCTGATC TACTTCCAGG GCCACTCGGT GCCGGGCGTG TATGCGCGCG CGTTCATGCT GGGCCGTCTG ACCGAAGAAC AGATGGACAG CTTCCGCCAG GAAGTGGGCG GCAAGGGCAT CTCGTCCTAC CCGCACCCCT GGCTGATGCC CGACTTCTGG CAGTTCCCCA CCGTGTCGAT GGGCCTGGGT CCGCTGTGCG CGATCTACTC GGCGCGCTTC ATGAAATACC TGGCCAGCCG CGGCCTGATC GATCCGGCAC GCGCGCAGCA GCGCAAGGTG TGGGCCTTCC TGGGCGACGG CGAGACCGAC GAGGTCGAAT CGCTGGGCGC GATCGGCATG GCCGGCCGCG AGGGCCTGGA CAACCTCGTC TTCGTGATCA ACTGCAACCT GCAGCGCCTC GATGGTCCGG TGCGCGGCAA CGGCAAGATC ATCCAGGAGC TCGAGTCCGA GTTCCGCGGC GCGGGCTGGA ACGTCATCAA GGTGATCTGG GGCACGCACT GGGACGCGCT CTTCGCCCGC GACAAGAAGG GCATCCTCAA GCAGCGCATG ATGGAGTTGT GCGACGGCGA GTATCAGACC TTCAAGGCCA AGGACGGCGC CTACGTGCGC GAGCACTTCT TCAACACGCC CGAGCTGCGC GCGCTGGTGG CCGACTGGAC CGACGAGCAG GTGTGGAACC TGAACCGCGG CGGCCACGAT CTCTTCAAGA TCTTCTCCGC CTACAAGGCC GCCAACGAGC ACAAGGGGCA GCCCACGCTG ATCCTGGCCA AGACCATCAA GGGCTTCGGC ATGGGCCAGG CCGGCGAGGC GATGAACATC TCGCACCAGC AGAAAAAGAT GGACGTCGAG GCCATCCGCC GCTTCCGCGA CCGCTTCGGC CTGCCGGTGC CCGACGACCA GCTCGAGGCA CTGCCCTACC TGAAGTTCGC CGAGGACTCT CCCGAGCACA AGTACCTGCT CAAGCACCGC ATGGACCTGG GCGGCTTCCT GCCGCAGCGC CGGCGCAAGG CCGCCGCGCT CGAGATCCCG GGCCTGGACA CCTTCGCCGC GCTGCTCAAG GCCTCGGGCG AGGGCCGCGA GCTGTCGACC ACCATGGCGG TGGTGCGCAT CATGAACACG CTGTTGAAGG ACAAGCAGAT CGGCAAGCAT GTCGTGCCGA TCGTCCCGGA CGAAAGCCGC ACCTTCGGCA TGGAGGGCAT GTTCCGCCAG TACGGCATAT GGAACCAGCA AGGCCAGAAC TACGTGCCGG AAGACCATGA CCAGCTCATG TTCTACAAGG AGAGCAAGAC CGGCCAGGTG CTGCAGGAAG GCATCAACGA GGCCGGCGCG ATGGCCGACT GGATCGCCGC GGGCACCGCC TACAGCGTGC ACAACGTGCA GATGATCCCG TTCTACATCT TCTATTCCAT GTTCGGCCTG CAGCGCACCA TGGACCTGTG CTGGGCGGCC GCCGACCAGC GCACCCGCGG CTTCCTGATC GGGGGCACCG CCGGGCGCAC GACCCTGAAC GGCGAAGGCC TGCAGCACGA GGACGGCCAC AGCCTGATCA TGGCCAACAT GATCCCCAAC TGCATCAGCT ACGACCCGAC CTTCCAGTAC GAGGTCGCGG TGATCGTGCA GGACGGCCTG CGCCGCATGT TCGCCGAGCA GGAGGACGTG TATTACTACA TCACGGTGAT GAACGAGAAC TACGAACATC CCGAGATGCC GGTTGGCGCC GAGGCCGACA TCCTCAAAGG CCTGTACCTG TTCCGCCAGG GCGGCCAGGC CAAGACCCCG CGCGTGCAGC TGATGGGTTC GGGCACGATC TTCCGCGAGG TGATCGCCGC CGCCGAGCTG CTCAAGGCCG ACTGGGGCGT GGAGTCGGAC ATCTGGGGCG CGCCCAGCTT CAACGAGCTG GCCCGCAACG GCCACGACGT CGAGCGCTGG AACCTGCTGC ACCCGATGGA AGAGCCGCGA AAGACCCACG TGCAGCAGAA GCTCGCCGGC TTCGCAGGCC CGGTCGTCGC CGCCACCGAC TACATCCGCA TGTTCCCCGA GCAGATCCGC GCGCAGCTCG ACCGCACTTA CGTCACGCTG GGCACCGACG GCTTCGGCCG CTCCGACACC CGCGAGCAGC TGCGCCACTT CTTCGAGGTG GATCGCCACT GGGTGACGCT GGCCGCGCTG AAGGCGCTGG CCGACGAGGG CACGATCGGC CGCGACAAGG TCGCCGCGGC GATGGTCAAG TACCAGCTCG ACCCGTCCAA ACCGAACCCG ATGTCGGTCT GA
|
Protein sequence | MTGISTVLPQ TDTDAQETKE WLDALAAVVE HEGPERAHYL IETLIATARQ EGVNIPYSAT TEYINTIPAE RQPPYPGDPD LEIKIHSYIR WNAMAMVVRA NKDTNVGGHI ASFASAAALY DVGFSHFWHA PSEAHDGDLI YFQGHSVPGV YARAFMLGRL TEEQMDSFRQ EVGGKGISSY PHPWLMPDFW QFPTVSMGLG PLCAIYSARF MKYLASRGLI DPARAQQRKV WAFLGDGETD EVESLGAIGM AGREGLDNLV FVINCNLQRL DGPVRGNGKI IQELESEFRG AGWNVIKVIW GTHWDALFAR DKKGILKQRM MELCDGEYQT FKAKDGAYVR EHFFNTPELR ALVADWTDEQ VWNLNRGGHD LFKIFSAYKA ANEHKGQPTL ILAKTIKGFG MGQAGEAMNI SHQQKKMDVE AIRRFRDRFG LPVPDDQLEA LPYLKFAEDS PEHKYLLKHR MDLGGFLPQR RRKAAALEIP GLDTFAALLK ASGEGRELST TMAVVRIMNT LLKDKQIGKH VVPIVPDESR TFGMEGMFRQ YGIWNQQGQN YVPEDHDQLM FYKESKTGQV LQEGINEAGA MADWIAAGTA YSVHNVQMIP FYIFYSMFGL QRTMDLCWAA ADQRTRGFLI GGTAGRTTLN GEGLQHEDGH SLIMANMIPN CISYDPTFQY EVAVIVQDGL RRMFAEQEDV YYYITVMNEN YEHPEMPVGA EADILKGLYL FRQGGQAKTP RVQLMGSGTI FREVIAAAEL LKADWGVESD IWGAPSFNEL ARNGHDVERW NLLHPMEEPR KTHVQQKLAG FAGPVVAATD YIRMFPEQIR AQLDRTYVTL GTDGFGRSDT REQLRHFFEV DRHWVTLAAL KALADEGTIG RDKVAAAMVK YQLDPSKPNP MSV
|
| |