Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0874 |
Symbol | |
ID | 4787197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 913034 |
End bp | 915709 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089435 |
Product | 2-oxoacid dehydrogenase subunit E1 |
Protein accession | YP_001020071 |
Protein GI | 124266067 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type [TIGR03186] alpha-ketoglutarate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.242196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAG TGTTCGAACC CCTGCTGTTC CAGGCCGCCG CCGCCGACAC CGATCCCGCC GAGACCGCCG AGTGGCGCGA TGCCCTGGCC GCGGTGCTGG CCGCCGCCGG GCCGGAGCGC GCGCGTTTCC TGCTGGACCA GCTGGCCGCG CAGGCCAGCG AGCCCGCCAT CGCCTGGCAG CCGCGCGGCG GCGGCTCGCC CTACGTCAAC ACCATCCCGG TCGAGCGCCA GCCGCGCTTC CCCGGCGATC TGGCGATCGA GGAGCGCCTG GCCTCCCTGA TGCGCTGGAA CGCGCTGGCG ATGGTGGTGC GCGCCAATCA GGCCTACGGC GAACTCGGCG GCCACATCGC CAGCTACGCG AGCGCGGCCG ATCTGTTCGA GGTCGGCTTC AACCACTTCT TCCGCGCGCG CAACGCGCAG CAGGGTGGCG ACCTGGTGTT CTACCAGCCG CACTCGGCGC CCGGCGTCTA CGCGCGTGCC TTCCTCGAAG GCCGCCTGAG CGAGGCCGAC CTGGCCCACT ACCGGCAGGA GCTCGCCGCG CCGGCGGCCG GCGCGCAGGG CCTGTCGAGC TATCCGCATC CGTGGCTGAT GCCCGAGTTC TGGCAGTTCC CCACCGGCTC GATGGGCATC GGCCCCATCA GCTCGATCTA CCAGGCGCGC TTCATGCGCT ACCTGCAGCA CCGCGGGCTG CTGCAGACCG AGGGCCGCCG CGTCTGGGGC GTGTTCGGCG ACGGCGAGAT GGACGAGCCC GAGAGCATGA GCGCGCTCAC GCTCGCCGCG CGCGAAAAGC TCGACAACCT CACATGGGTG ATCAACTGCA ACCTGCAGCG CCTGGACGGC CCGGTGCGCG GCAACGGCCG CATCATCGAC GAGCTGGAGG CGCTGTTCGC CGGCGCCGGC TGGCAGGTCA TCAAGCTGGT GTGGGGCTCC GACTGGGACG GCCTGTTCGC GCGCGACACC CACGGTGCGC TGCGCAAGGC CTTCGCCGGC ACCGTGGACG GCCAGTTCCA GACCTTCGCC GCCAAGGACG GCCGCTACAA CCGCGAGCAC TTCTTCGGCC AGACCCCCGA GCTCGCCGCG CTGGCCCAGG GCCTGACCGA CGAGCAGATC GACCGGCTGC GCCGCGGCGG CCACGACATG GTGAAGATCC ACGCCGCCTA CCACGCGGCG ATGCAGTGCC GCGGCCGGCC GGTGGTGATC CTGGCGCAGA CCAAGAAGGG CTACGGCATG GGCGAGGCCG GGCAGGGCCG CATGACCACG CACCAGCAGA AGAAGCTCGA CCGCGACGAC CTGATCGCCT TCCGCAACCG CTTCGCGCTG CCGCTCAGCG ACGAGCAGGC CGCGAGCCTG GCGTTCTACA AGCCGGCCGA CGACAGCCCC GAGATGCGTT ACCTGCACGC GCGCCGCGCC GCGCTGGGCG GCCAGATCCC GGCGCGGGTG GCGCAGGCGC CGCAGCTGGC GGTGCCGCCG GTGGCGCGCT GGGGTGACTT CGCGCTGAAC GCCGACGGCA AGGAGATGAG CACCACCATG GCCTTCGTGC GCATGCTCAC CGCGCTGCTG AAGGACGCGC AGCTCGGCCC GCGCATCGTG CCCATCGTCG CCGACGAGGC GCGCACCTTC GGCATGGCCA ACCTGTTCAA GCAGATCGGC ATCTACTCCA ACCTCGGCCA GAACTACGAG CCCGAGGACA TCGGCTCGGT GCTCAGCTAC CGCGAGGCCA CCGACGGCCA GATCCTGGAG GAGGGCATCA GCGAGGCCGG TGCGCTGAGC TCCTGGGTGG CCGCGGCCAC CAGCTACAGC GTGCACGGCG TGCCGATGCT GCCGTTCTAC ATCTACTACT CGATGTTCGG CTTCCAGCGC GTGGGCGACC TGATCTGGGC CGCGGCCGAC CAGCGCGCGC GCGGCTTCCT GCTCGGCGCC ACCGCGGGCC GCACCACGCT GGGCGGCGAG GGCCTGCAGC ACCAGGACGG CTCCAGCCCA CTGCAGGCGT CGACCGTGCC CAACTGCCGC ACCTGGGACC CGGCCTTCGC GGGCGAGGTG GCGGTGATCG TGGAGCACGG CATGCGCCGC ATGCTCGAGG AGCAGGCCGA CGAGTTCTTC TACCTGCTGC TGATGAACGA GAACTACGCC AACCCCTCGC TGCCCGAGGG TGCGGCGGCC GGCGTGCTGA AAGGCCTGTA CCGCTTCCGG CCGGCGCAGG GCAGGAAGGC CGCGCAGGTG CGCCTGCTCG GCTCCGGCGC GATCCTGCGC GAGGTGCTGG CCGCGGCCGA GCTGCTGCGC AACGACTACG GCGTGGAGGC CGAGGTGTGG AGCGCCACCA GCTACAGCGA GCTGCAGCGC GAGGCCATCG AGGTGGAGCG CCACAACCGC CTGCAGCCGC AGGCGCCGCG CCGCCGCAGC CACGTGGCGC AGTGCCTGGC CGGCAAGGCG CCGGTGGTGG CCGCGAGCGA CTACGTGCGC GCCTGGCCGC AGCTGATCGC ACCCTACGCC GAAGCCTCGC GCTTCGTGGC GCTGGGCACC GACGGCTTCG GCCGCAGCGA CACGCGCGAG CAGCTGCGCC GCCACTTCGA GGTGGACCGC GCGCACATCG CGCTGGCGGC GCTGGATGCG CTGGCGGACG CGGGGAGCGT GCCGCGGGAG GTGGCGGTGA AGGCAGCGGG GCGGTTGGCG GGCTGA
|
Protein sequence | MTQVFEPLLF QAAAADTDPA ETAEWRDALA AVLAAAGPER ARFLLDQLAA QASEPAIAWQ PRGGGSPYVN TIPVERQPRF PGDLAIEERL ASLMRWNALA MVVRANQAYG ELGGHIASYA SAADLFEVGF NHFFRARNAQ QGGDLVFYQP HSAPGVYARA FLEGRLSEAD LAHYRQELAA PAAGAQGLSS YPHPWLMPEF WQFPTGSMGI GPISSIYQAR FMRYLQHRGL LQTEGRRVWG VFGDGEMDEP ESMSALTLAA REKLDNLTWV INCNLQRLDG PVRGNGRIID ELEALFAGAG WQVIKLVWGS DWDGLFARDT HGALRKAFAG TVDGQFQTFA AKDGRYNREH FFGQTPELAA LAQGLTDEQI DRLRRGGHDM VKIHAAYHAA MQCRGRPVVI LAQTKKGYGM GEAGQGRMTT HQQKKLDRDD LIAFRNRFAL PLSDEQAASL AFYKPADDSP EMRYLHARRA ALGGQIPARV AQAPQLAVPP VARWGDFALN ADGKEMSTTM AFVRMLTALL KDAQLGPRIV PIVADEARTF GMANLFKQIG IYSNLGQNYE PEDIGSVLSY REATDGQILE EGISEAGALS SWVAAATSYS VHGVPMLPFY IYYSMFGFQR VGDLIWAAAD QRARGFLLGA TAGRTTLGGE GLQHQDGSSP LQASTVPNCR TWDPAFAGEV AVIVEHGMRR MLEEQADEFF YLLLMNENYA NPSLPEGAAA GVLKGLYRFR PAQGRKAAQV RLLGSGAILR EVLAAAELLR NDYGVEAEVW SATSYSELQR EAIEVERHNR LQPQAPRRRS HVAQCLAGKA PVVAASDYVR AWPQLIAPYA EASRFVALGT DGFGRSDTRE QLRRHFEVDR AHIALAALDA LADAGSVPRE VAVKAAGRLA G
|
| |