Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_3362 |
Symbol | aceE |
ID | 4112194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 3560921 |
End bp | 3563695 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638032495 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_640525 |
Protein GI | 108800328 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00342151 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCAGG ATCTGGCCCA AAACTCCTCC ACCGCAGCCG AACACGATCG TGTCCGGGTG ATCCGTGAGG GTGTTGCATC GTATCTGCCC GACATCGATC CCGATGAGAC GAGCGAATGG CTGGAGTCGT TCGACCAGCT GCTCGAACGC TCCGGACCGG CGAGAGCCCG CTACCTGCTG TTGCGGCTGC TCGAACGCTC CGGGGAGCAG CGGGTGGCCA TCCCCGCGCT CACCTCGACC GACTACGTGA ACACGATCCC GACCGAACTC GAGCCGTGGT TCCCCGGCGA CGAGGACGTC GAACGCCGCT ACCGGGCCTG GATCCGGTGG AACGCCGCGA TCATGGTGCA CCGCGCGCAG CGTCCGGGAG TCGGTGTGGG CGGCCACATC TCGACGTACG CGTCGTCGGC CGCGCTCTAC GAGGTGGGCT TCAACCACTT CTTCCGCGGT AAGAGCCACT CCGGCGGCGG CGACCAGGTG TTCATCCAGG GCCACGCCTC CCCCGGGATC TACGCGCGCG CCTATCTCGA GGGCAGGCTG ACCGCCGACC AACTCGACGG GTTCCGCCAG GAGCACAGCC ACCCCGGCGG CGGTATCCCG TCGTACCCGC ATCCGCGGCT GATGCCCGAC TTCTGGGAGT TCCCCACGGT CTCGATGGGC CTGGGCCCGA TGAACGCGAT CTACCAGGCC CGGTTCAACC ACTACCTGCA CGACCGCGGC ATCAAGGACA CCACCGACCA GCATGTGTGG GCGTTCCTCG GCGACGGCGA GATGGACGAA CCGGAGAGCC GGGGGCTCAT CCACGTGGCC GCCCTCGAGG CGCTGGACAA CCTGACGTTC GTCGTCAACT GCAACCTGCA GCGCCTCGAC GGCCCGGTGC GCGGCAACGG CAAGATCATC CAGGAACTGG AGTCGTTCTT CCGCGGCGCC GGCTGGAACG TCATCAAGGT GGTGTGGGGC CGCGAATGGG ATGCGCTGCT GCACGCCGAC CGCGACGGCG CACTGGTCAA CCTGATGAAC ACCACCCCCG ACGGCGACTA CCAGACCTAC AAGGCCAACG ACGGCGGCTA CGTACGCGAC CACTTCTTCG GCCGTGACCC GCGCACCAAG GCGCTGGTGG AACCGATGAC CGACGCCGAG ATCTGGAACC TCAAGCGCGG CGGGCACGAC TACCGCAAGG TCTACGCCGC GTACCGCGCG GCGATGGAGC ACAAGGGCCA GCCGACGGTC ATCCTCGCCA AGACCATCAA GGGCTACACC CTGGGTAAGC ACTTCGAGGG CCGCAACGCC ACCCATCAGA TGAAAAAGCT TGCGCTGCAA GACCTCAAGG ACTTCCGCGA CGCCCAGCGC ATCCCGATCG GCGATGCCCA GCTCGAGGAG AACCCCTACC TGCCGCCGTA CTACCACCCC GGCCCCGAGG CGCCCGAGAT CCGCTACATG CTCGACCGCA GGCGCGCGCT CGGCGGTTTC GTGCCGGAGC GTCGCACGAA GTCCAAGGCA CTCGCGCTGC CGAGCAGCGA TGCCTACAAG GCGCTGAAGA AGGGCTCCGG CAAGCAGGAG GTCGCCACCA CGATGGCGAC AGTCCGCACC TTCAAGGAGA TCCTGCGCGA CAAGCAGATC GGCCATCGCA TCGTGCCGAT CATCCCCGAC GAGGCGCGCA CGTTCGGGAT GGACTCCTGG TTCCCGAACC TCAAGATCTA CAACCGCAAC GGGCAGCTCT ACACCTCCGT CGACGCCGAG CTGATGCTGG CGTACCGGGA GAGCGAGGTC GGGCAGATCC TGCACGAGGG GATCAACGAG GCCGGTTCGG TGGGCACGTT CATCGCTGCG GGCACGTCGT ACGCCACGCA CAACGAGCCG ATGATCCCGA TCTACATCTT CTATTCGATG TTCGGGTTCC AGCGCACCGG TGACAGCTTC TGGGCGGCCG CCGACCAGAT GGCCCGTGGA TTCGTGCTGG GCGCGACGGC CGGGCGCACC ACGCTGGTCG GTGAGGGGTT GCAGCACGCC GACGGCCACT CACTGCTGCT GGCGTCGACC AACCCCGCGG TGGTGGCCTA CGACCCGGCG TTCGCCTACG AGATCGCCTA CATCATCGAA TCCGGGCTGC ACCGGATGTA CGGGGAGAAC CCGGAGAACG TCTACTTCTA TCTGACGATC TACAACGAGC CCTACGTCCA GCCGGCGGAA CCGGAGAACC TCGACGTCGA GGGTCTGTTG CGCGGGATCT ACCGGTACCG GGCCGCGGCG GAGAAGAAAT CCAACACCGC CCAGATCCTG GTGTCCGGCG TGGCGATGCC CTCGGCGCTC AAGGCCGCCG AGATGCTGGC CGAGGAGTGG GACGTGGCCG CCGACGTGTG GTCGGTGACC AGCTGGAACG AGCTCAACCG CGACGGCGTC CAGGTCGAGA AGGACCTGCT GCGCCATCCG GACCGGCCGG CGGGCACCCC GTACATCACC ACGGCGCTCG CCGACGCGGC CGGACCGGTC GTGGCGGTCT CGGACTGGAT GCGCGCGGTG CCCGAGCAGA TCCGGCCGTG GGTACCCGGT ACGTACATCA CGCTCGGCAC GGACGGTTTC GGATTCTCCG ACACCCGGCC CGCCGCGCGG CGGTTCTACA ACACCGACGC CGAGTCGATC ACCGTGGCGG TGCTCGAAGG GCTGGCCCGC GACGGCAACA TCGACATCTC GGTGGCCGTC GAGGCGGCCC GCCGCTACGA GATCGACGAC GTGCTGGCGG CGCCGGAGCA GACCTCCGAT CCCGGGGTGG CCTGA
|
Protein sequence | MRQDLAQNSS TAAEHDRVRV IREGVASYLP DIDPDETSEW LESFDQLLER SGPARARYLL LRLLERSGEQ RVAIPALTST DYVNTIPTEL EPWFPGDEDV ERRYRAWIRW NAAIMVHRAQ RPGVGVGGHI STYASSAALY EVGFNHFFRG KSHSGGGDQV FIQGHASPGI YARAYLEGRL TADQLDGFRQ EHSHPGGGIP SYPHPRLMPD FWEFPTVSMG LGPMNAIYQA RFNHYLHDRG IKDTTDQHVW AFLGDGEMDE PESRGLIHVA ALEALDNLTF VVNCNLQRLD GPVRGNGKII QELESFFRGA GWNVIKVVWG REWDALLHAD RDGALVNLMN TTPDGDYQTY KANDGGYVRD HFFGRDPRTK ALVEPMTDAE IWNLKRGGHD YRKVYAAYRA AMEHKGQPTV ILAKTIKGYT LGKHFEGRNA THQMKKLALQ DLKDFRDAQR IPIGDAQLEE NPYLPPYYHP GPEAPEIRYM LDRRRALGGF VPERRTKSKA LALPSSDAYK ALKKGSGKQE VATTMATVRT FKEILRDKQI GHRIVPIIPD EARTFGMDSW FPNLKIYNRN GQLYTSVDAE LMLAYRESEV GQILHEGINE AGSVGTFIAA GTSYATHNEP MIPIYIFYSM FGFQRTGDSF WAAADQMARG FVLGATAGRT TLVGEGLQHA DGHSLLLAST NPAVVAYDPA FAYEIAYIIE SGLHRMYGEN PENVYFYLTI YNEPYVQPAE PENLDVEGLL RGIYRYRAAA EKKSNTAQIL VSGVAMPSAL KAAEMLAEEW DVAADVWSVT SWNELNRDGV QVEKDLLRHP DRPAGTPYIT TALADAAGPV VAVSDWMRAV PEQIRPWVPG TYITLGTDGF GFSDTRPAAR RFYNTDAESI TVAVLEGLAR DGNIDISVAV EAARRYEIDD VLAAPEQTSD PGVA
|
| |