Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3752 |
Symbol | aceE |
ID | 4646818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 3990551 |
End bp | 3993325 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639807217 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_954540 |
Protein GI | 120404711 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0660504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCCAGG ACCTGGCCCA AAACTCCGGC AGCACAGCCG AACCCGACCG GGTTCGCGTC ATCAGAGAAG GCGTCGCCTC GTATCTGCCG GACATCGACC CCGAAGAGAC CGGCGAGTGG CTGGAGTCGT TCGATGATCT GCTCGAGCGT TCCGGCCCGG CGCGGGCCCG CTATCTCATG TTGCGGCTGC TGGAACGGTC CCGGGAGAAG CGGGTGGCCA TCCCGGCGCT GACGTCCACC GACTACGTCA ACACGATCCC GACCGAACTG GAGCCGTGGT TCCCCGGCGA CGAGGACGTC GAGCGCCGCT ACCGCGCGTG GATCCGCTGG AACGCCGCGA TCATGGTGCA CCGCGCGCAG CGGCCGGGAG TCGGTGTGGG TGGCCATATC TCGACCTACG CGTCCTCGGC GGCGCTCTAC GAGGTGGGCT TCAACCACTT CTTCCGCGGC AAGAGCCACC CCGGCGGCGG CGACCAGATC TTCATCCAGG GCCACGCCTC CCCCGGCATC TACGCCCGCG CCTACCTCGA AGGCCGGCTG AGCGCCGACC AACTCGACGG TTTCCGTCAG GAGCACAGCC ATCCCGGCGG CGGGCTGCCG TCGTATCCGC ACCCGCGGCT GATGCCCGAT TTCTGGGAGT TCCCGACGGT GTCGATGGGC CTGGGCCCGA TGAACGCGAT CTATCAGGCG CGGTTCAACC ATTACCTGCA CGACCGGGGC ATCAAGGACA CCTCCAAACA GCACGTGTGG GCGTTCCTCG GCGACGGTGA GATGGACGAG CCGGAAAGCC GCGGGCTGAT CCAGGTGGCC GCCAACGAGG CCCTGGACAA CCTGACCTTC GTCGTCAACT GCAACCTGCA GCGCCTCGAC GGCCCGGTGC GCGGCAACGG CAAGATCATC CAGGAGCTCG AGTCGTTCTT CCGCGGCGCC GGCTGGAACG TCATCAAGGT GGTGTGGGGC CGCGAGTGGG ACGCGCTGCT GCACGCCGAC CGGGACGGCG CTCTGGTGAA CCTGATGAAC ACCACCCCGG ACGGGGATTA CCAGACCTAC AAGGCCAACG ACGGCGCCTA CGTCCGCGAC CATTTCTTCG GCCGCGATCC GCGCACCAAG GCCCTCGTCG AGAACATGAC CGACCGGGAG ATCTGGAACC TCAAGCGCGG CGGGCACGAC TACCGCAAGG TGTATGCGGC GTACCACGCG GCGCTGGAGC ACAAGGGCCA GCCGACGGTG ATCCTGGCCA AGACCATCAA GGGCTACACG CTGGGCAAGC ACTTCGAGGG CCGCAACGCT ACCCACCAGA TGAAAAAGCT TGCGCTGCAA GACCTCAAGG ACTTCCGCGA CGCGCAGCGC ATCCCGGTCA GCGACGAGCA GCTGGAAGCC GATCCGTACC TGCCGCCGTA CTACCATCCG GGTCCCGAGG CCCCGGAGAT CCGCTACATG CTCGACCGCC GGCGCGCCCT TGGCGGGTTC CTGCCCGAGC GTCGCACCAA GTCCAGAGCG CTGACCCTGC CGGGGCGCGA CGTCTACAAG GCGCTCAAGA AGGGGTCGGG CAAGCAGGAG GTCGCCACCA CCATGGCGAC CGTGCGCACC TTCAAGGAAC TGTTGCGCGA CAAGAACATC GGATCGCGCA TTGTTCCCAT CATCCCCGAC GAGGCCCGCA CGTTCGGCAT GGACTCGTGG TTCCCCAGCC TCAAGATCTA CAACCGCAAC GGCCAGCTCT ACACCGCGGT GGACGCCGAG TTGATGCTCG CCTACAAGGA GAGCGAAATC GGCCAGATCC TGCACGAAGG CATCAACGAG GCGGGGTCGA CGGCGTCGTT CACCGCGGTG GGCACCTCGT ATGCCACCCA CAACGAGCCG ATGATCCCGA TCTACATCTT CTACTCGATG TTCGGGTTCC AGCGGACCGG AGACGGACTG TGGGCGGCCG CCGACCAGAT GGCGCGCGGC TTCGTCCTCG GCGCCACCGC CGGGCGCACC ACACTCGTCG GGGAAGGGCT GCAGCACGCC GACGGCCACT CGCTGCTGCT GGCGTCCACC AACCCCGCGG TGGTGGCCTA CGACCCGGCG TTCGCCTACG AGATCGCCTA CATCATCGAA AGCGGCCTGC ACCGCATGTA CGGCGAGAAC CCGGAGAACG TCTACTTCTA CATGACGATC TACAACGAGC CCTACGTCCA GCCCGCCGAA CCCGACGGCC TCGACGTCGA GGGCCTGCTG CGCGGCATGT ACCGGTACCA GGCCGCCCCC GAGAAGCGCA CCAACGCCGC CCAGATCCTG GTGTCCGGGG TGGGCATGCC GTCGGCGCTC AAGGCCGCCG AGCTGCTGGC GCAGGAGTGG GACGTCGCCG CCGACGTCTG GTCGGTGACC AGCTGGGGCG AGCTCAACCG CGACGGCGTC GCCATCGAGA GGGAGAAGCT GCGCCATCCC GAGCTCCCCG CCGCCACGCC CTACGTCACC AAGATGCTGG CAGAGGCCGC CGGACCCGTC GTCGCGGTGT CGGACTGGAT GCGCGCCGTG CCTGAGCAGA TCCGGCCCTG GGTGCCCGGC ACGTACATCA CGCTCGGCAC CGACGGCTTC GGCTTCTCCG ACACCCGCCC GGCCGCCCGG CGCTACTACA ACACCGACGC CGAGTCGGTG GTGGTGGCCG TGCTCGAAGG GCTGGCACGC GACGGCAACA TCGACATCTC GGTGGCCGTG GAGGCGGCGA AGAAGTACGA GATCGACGAT GTGATGGCGG CACCGGAGCA GACGTCGGAT CCCGGAGTCG CGTAG
|
Protein sequence | MRQDLAQNSG STAEPDRVRV IREGVASYLP DIDPEETGEW LESFDDLLER SGPARARYLM LRLLERSREK RVAIPALTST DYVNTIPTEL EPWFPGDEDV ERRYRAWIRW NAAIMVHRAQ RPGVGVGGHI STYASSAALY EVGFNHFFRG KSHPGGGDQI FIQGHASPGI YARAYLEGRL SADQLDGFRQ EHSHPGGGLP SYPHPRLMPD FWEFPTVSMG LGPMNAIYQA RFNHYLHDRG IKDTSKQHVW AFLGDGEMDE PESRGLIQVA ANEALDNLTF VVNCNLQRLD GPVRGNGKII QELESFFRGA GWNVIKVVWG REWDALLHAD RDGALVNLMN TTPDGDYQTY KANDGAYVRD HFFGRDPRTK ALVENMTDRE IWNLKRGGHD YRKVYAAYHA ALEHKGQPTV ILAKTIKGYT LGKHFEGRNA THQMKKLALQ DLKDFRDAQR IPVSDEQLEA DPYLPPYYHP GPEAPEIRYM LDRRRALGGF LPERRTKSRA LTLPGRDVYK ALKKGSGKQE VATTMATVRT FKELLRDKNI GSRIVPIIPD EARTFGMDSW FPSLKIYNRN GQLYTAVDAE LMLAYKESEI GQILHEGINE AGSTASFTAV GTSYATHNEP MIPIYIFYSM FGFQRTGDGL WAAADQMARG FVLGATAGRT TLVGEGLQHA DGHSLLLAST NPAVVAYDPA FAYEIAYIIE SGLHRMYGEN PENVYFYMTI YNEPYVQPAE PDGLDVEGLL RGMYRYQAAP EKRTNAAQIL VSGVGMPSAL KAAELLAQEW DVAADVWSVT SWGELNRDGV AIEREKLRHP ELPAATPYVT KMLAEAAGPV VAVSDWMRAV PEQIRPWVPG TYITLGTDGF GFSDTRPAAR RYYNTDAESV VVAVLEGLAR DGNIDISVAV EAAKKYEIDD VMAAPEQTSD PGVA
|
| |