Gene Mkms_3424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3424 
SymbolaceE 
ID4611352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3589188 
End bp3591977 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content67% 
IMG OID639793098 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_939408 
Protein GI119869456 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCACCG AGTTCGTGCG CCAGGATCTG GCCCAAAACT CCTCCACCGC AGCCGAACAC 
GATCGTGTCC GGGTGATCCG TGAGGGTGTT GCATCGTATC TGCCCGACAT CGATCCCGAT
GAGACGAGCG AATGGCTGGA GTCGTTCGAC CAGCTGCTCG AACGCTCCGG ACCGGCGAGA
GCCCGCTACC TGCTGTTGCG GCTGCTCGAA CGCTCCGGGG AGCAGCGGGT GGCCATCCCC
GCGCTCACCT CGACCGACTA CGTGAACACG ATCCCGACCG AACTCGAGCC GTGGTTCCCC
GGCGACGAGG ACGTCGAACG CCGCTACCGG GCCTGGATCC GGTGGAACGC CGCGATCATG
GTGCACCGCG CGCAGCGTCC GGGAGTCGGT GTGGGCGGCC ACATCTCGAC GTACGCGTCG
TCGGCCGCGC TCTACGAGGT GGGCTTCAAC CACTTCTTCC GCGGTAAGAG CCACTCCGGC
GGCGGCGACC AGGTGTTCAT CCAGGGCCAC GCCTCCCCCG GGATCTACGC GCGCGCCTAT
CTCGAGGGCA GGCTGACCGC CGACCAACTC GACGGGTTCC GCCAGGAGCA CAGCCACCCC
GGCGGCGGTA TCCCGTCGTA CCCGCATCCG CGGCTGATGC CCGACTTCTG GGAGTTCCCC
ACGGTCTCGA TGGGCCTGGG CCCGATGAAC GCGATCTACC AGGCCCGGTT CAACCACTAC
CTGCACGACC GCGGCATCAA GGACACCACC GACCAGCATG TGTGGGCGTT CCTCGGCGAC
GGCGAGATGG ACGAACCGGA GAGCCGGGGG CTCATCCACG TGGCCGCCCT CGAGGCGCTG
GACAACCTGA CGTTCGTCGT CAACTGCAAC CTGCAGCGCC TCGACGGCCC GGTGCGCGGC
AACGGCAAGA TCATCCAGGA ACTGGAGTCG TTCTTCCGCG GCGCCGGCTG GAACGTCATC
AAGGTGGTGT GGGGCCGCGA ATGGGATGCG CTGCTGCACG CCGACCGCGA CGGCGCACTG
GTCAACCTGA TGAACACCAC CCCCGACGGC GACTACCAGA CCTACAAGGC CAACGACGGC
GGCTACGTAC GCGACCACTT CTTCGGCCGT GACCCGCGCA CCAAGGCGCT GGTGGAACCG
ATGACCGACG CCGAGATCTG GAACCTCAAG CGCGGCGGGC ACGACTACCG CAAGGTCTAC
GCCGCGTACC GCGCGGCGAT GGAGCACAAG GGCCAGCCGA CGGTCATCCT CGCCAAGACC
ATCAAGGGCT ACACCCTGGG TAAGCACTTC GAGGGCCGCA ACGCCACCCA TCAGATGAAA
AAGCTTGCGC TGCAAGACCT CAAGGACTTC CGCGACGCCC AGCGCATCCC GATCGGCGAT
GCCCAGCTCG AGGAGAACCC CTACCTGCCG CCGTACTACC ACCCCGGCCC CGAGGCGCCC
GAGATCCGCT ACATGCTCGA CCGCAGGCGC GCGCTCGGCG GTTTCGTGCC GGAGCGTCGC
ACGAAGTCCA AGGCACTCGC GCTGCCGAGC AGCGATGCCT ACAAGGCGCT GAAGAAGGGC
TCCGGCAAGC AGGAGGTCGC CACCACGATG GCGACAGTCC GCACCTTCAA GGAGATCCTG
CGCGACAAGC AGATCGGCCA TCGCATCGTG CCGATCATCC CCGACGAGGC GCGCACGTTC
GGGATGGACT CCTGGTTCCC GAACCTCAAG ATCTACAACC GCAACGGGCA GCTCTACACC
TCCGTCGACG CCGAGCTGAT GCTGGCGTAC CGGGAGAGCG AGGTCGGGCA GATCCTGCAC
GAGGGGATCA ACGAGGCCGG TTCGGTGGGC ACGTTCATCG CTGCGGGCAC GTCGTACGCC
ACGCACAACG AGCCGATGAT CCCGATCTAC ATCTTCTATT CGATGTTCGG GTTCCAGCGC
ACCGGTGACA GCTTCTGGGC GGCCGCCGAC CAGATGGCCC GTGGATTCGT GCTGGGCGCG
ACGGCCGGGC GCACCACGCT GGTCGGTGAG GGGTTGCAGC ACGCCGACGG CCACTCACTG
CTGCTGGCGT CGACCAACCC CGCGGTGGTG GCCTACGACC CGGCGTTCGC CTACGAGATC
GCCTACATCA TCGAATCCGG GCTGCACCGG ATGTACGGGG AGAACCCGGA GAACGTCTAC
TTCTATCTGA CGATCTACAA CGAGCCCTAC GTCCAGCCGG CGGAACCGGA GAACCTCGAC
GTCGAGGGTC TGTTGCGCGG GATCTACCGG TACCGGGCCG CGGCGGAGAA GAAATCCAAC
ACCGCCCAGA TCCTGGTGTC CGGCGTGGCG ATGCCCTCGG CGCTCAAGGC CGCCGAGATG
CTGGCCGAGG AGTGGGACGT GGCCGCCGAC GTGTGGTCGG TGACCAGCTG GAACGAGCTC
AACCGCGACG GCGTCCAGGT CGAGAAGGAC CTGCTGCGCC ATCCGGACCG GCCGGCGGGC
ACCCCGTACA TCACCACGGC GCTCGCCGAC GCGGCCGGAC CGGTCGTGGC GGTCTCGGAC
TGGATGCGCG CGGTGCCCGA GCAGATCCGG CCGTGGGTAC CCGGTACGTA CATCACGCTC
GGCACGGACG GTTTCGGATT CTCCGACACC CGGCCCGCCG CGCGGCGGTT CTACAACACC
GACGCCGAGT CGATCACCGT GGCGGTGCTC GAAGGGCTGG CCCGCGACGG CAACATCGAC
ATCTCGGTGG CCGTCGAGGC GGCCCGCCGC TACGAGATCG ACGACGTGCT GGCGGCGCCG
GAGCAGACCT CCGATCCCGG GGTGGCCTGA
 
Protein sequence
MTTEFVRQDL AQNSSTAAEH DRVRVIREGV ASYLPDIDPD ETSEWLESFD QLLERSGPAR 
ARYLLLRLLE RSGEQRVAIP ALTSTDYVNT IPTELEPWFP GDEDVERRYR AWIRWNAAIM
VHRAQRPGVG VGGHISTYAS SAALYEVGFN HFFRGKSHSG GGDQVFIQGH ASPGIYARAY
LEGRLTADQL DGFRQEHSHP GGGIPSYPHP RLMPDFWEFP TVSMGLGPMN AIYQARFNHY
LHDRGIKDTT DQHVWAFLGD GEMDEPESRG LIHVAALEAL DNLTFVVNCN LQRLDGPVRG
NGKIIQELES FFRGAGWNVI KVVWGREWDA LLHADRDGAL VNLMNTTPDG DYQTYKANDG
GYVRDHFFGR DPRTKALVEP MTDAEIWNLK RGGHDYRKVY AAYRAAMEHK GQPTVILAKT
IKGYTLGKHF EGRNATHQMK KLALQDLKDF RDAQRIPIGD AQLEENPYLP PYYHPGPEAP
EIRYMLDRRR ALGGFVPERR TKSKALALPS SDAYKALKKG SGKQEVATTM ATVRTFKEIL
RDKQIGHRIV PIIPDEARTF GMDSWFPNLK IYNRNGQLYT SVDAELMLAY RESEVGQILH
EGINEAGSVG TFIAAGTSYA THNEPMIPIY IFYSMFGFQR TGDSFWAAAD QMARGFVLGA
TAGRTTLVGE GLQHADGHSL LLASTNPAVV AYDPAFAYEI AYIIESGLHR MYGENPENVY
FYLTIYNEPY VQPAEPENLD VEGLLRGIYR YRAAAEKKSN TAQILVSGVA MPSALKAAEM
LAEEWDVAAD VWSVTSWNEL NRDGVQVEKD LLRHPDRPAG TPYITTALAD AAGPVVAVSD
WMRAVPEQIR PWVPGTYITL GTDGFGFSDT RPAARRFYNT DAESITVAVL EGLARDGNID
ISVAVEAARR YEIDDVLAAP EQTSDPGVA