Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA3000 |
Symbol | aceE |
ID | 3104517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 3173672 |
End bp | 3176359 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637172126 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_115388 |
Protein GI | 53802927 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.685258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGCAA GCAATCCCTC CCTCAACTCC CAGTTCGCGG CCAACACGGA TACCGATCCG GAGGAAACCA GGGAGTGGCT GGACGCGCTT GCGGCCGTCA TCGAAACAGA AGGCGTCGAG CGCGCGCATT TCCTGATCGA AAAGCTCGTC GACAAGGCGC GCCGCTCCGG CGCGAATCTG CCTTACAAGG CCAACACGGC GTACATCAAC ACCATTCCGC CTCATGCGGA GGCTCGCAGC CCCGGTGACG CCGGCATCGA GCACCGCATC CGCTGCTATC TGCGTTGGAA CGCCATGGCG ATGGTGGTGC GGGCAAACCA GAAATCCACC GAGTACGGCG GACACATTTC CAGCTTCGCT TCCTCGGCCA CGCTCTACGA CGTCGGTTTC AACCATTTCT TCCACGCTCC CGACCGGGAC CATGGCGGCG ACCTGGTGTT TTTCCAGGGC CATTCGGCGC CCGGCATCTA TGCCAGGGCA TTTCTGGAAG GACGGCTGAC CGAGGAGCAT CTGGACCGGT TCCGCGCCGA AGTGGGCGGC GGGGGGCTGT CATCCTATCC GCACCCCTGG CTGATGCCGG ATTTCTGGCA GTTCCCGACC GTCTCCATGG GCCTCGGCCC GCTCATGGCG ATTTACCAGG CGCGGTTCAT GAAATACCTG GCCGATCGCA AGATCCTCCT GAATACCGAA GGCCGCAAGG TCTGGTGCTT CTGCGGGGAC GGCGAGATGG ATGAGCCGGA ATCGATGGGC GCCATCGGCC TTGCCGGCCG GGAAAAACTG GACAACCTGA TCTTCGTGGT CAACTGCAAC CTGCAGCGCC TGGACGGACC GGTGCGCGGC GACGGCAAGA TCATCCAGGA CCTGGAGGCC GAATTCCGCG GCGCCGGCTG GAACGTCATC AAAGTGATCT GGGGCTCCCA CTGGGATGCG CTGCTGGCGC GCGATACCAA GGGCCTCCTG CGGCAGCGGA TGGAAGAGGT GGTGGACGGC GAGTACCAGA CGTACAAGGC CAAAGACGGC GCCTATGTCC GCAAGCATTT CTTCGGCAAG TATCCCGAAC TGCTGGAAAT GGTCGCCAAC ATGTCCGACG AGGACATCTG GCATCTGAAC CGGGGCGGTC ACGACCCGCA CAAGGTGTAT GCCGCCTACG CGGCCGCGGT GGCGCACAAG GGACAGCCGA CGGTGATCCT GGCCAAGACC ATCAAGGGCT ATGGCATGGG CCGGGCCGGT GAAGGCCAGA TGATCGCCCA TCAGCAGAAG AAGCTGGACG CCGAGGCGCT CAAGGCCTTC CGCGACCGTT TCAACATTCC GATCCCGGAC GACAAGGTGC ATGAGGCGCC GTACTATAAA CCGGCCGAAG ACAGCCCCGA GATGATCTAT CTGCAGGAGC GCCGCAAGGC CTTGGGCGGT TACCTACCGC AACGGCGGAA GGAGGCACCC CACCTCCAGA TACCCGAGCT GAGCATCTTC GAGACGATGC TGAAAAGCTC GGAGGACCGA GAAATGTCCA CCACCATGGT GTTCGTCCGC CTGCTGTCGT CCCTGCTGCG GGACAAGGCG CTGGGCCGCT ACGTGGTGCC GATCGTGCCC GACGAGGCCC GCACCTTCGG CATGGAGGGC CTGTTCCGGC AATACGGCAT CTATTCCTCC GTCGGCCAGC TCTACGAACC GCAGGACGCC GACACCGTGA TGTTCTACCG GGAGGACAAA TCCGGCCAGA TCCTGGAGGA AGGCATCTGC GAAGCCGGCG CGATGTGCGA CTGGATCGCC GCCGGTACGG CGTTCAGCAA CCACAACGTG CAGATGATCC CGTTCTACAT CTACTATTCG ATGTTCGGCT TCCAGCGCGT CGGTGACCTG ATGTGGGCGG CGGGCGACAT GCAGGCACGC GGCTTCCTCA TGGGCGGCAC CGCCGGACGG ACCACGCTGG CCGGCGAAGG TCTGCAACAC CAGGACGGCC ACAGCCACCT GATCATGGGC GCCATCCCCA ACTGCATCAC CTATGATCCG ACCTTCGCCT ACGAGCTGGC GATCATCGTC CACGACGGTC TGCGCCGGAT GTATCAGGAA GGCGAAAACG TCTTCTACTA CATCACGGTG ATGAACGAGA ACTACACCCA TCCTGCCCTG CCGAAAGGGG CCGAGGAAGG CATCATCAAA GGTTTGTACA AGTTCCGTGA TGCCGGCAAG GACGAGGCAG TGCAGCTCCT CGGCAGCGGG ACCATTCTGC GCGAAGTCAT CAAGGCCGCG GAGTTGCTGG AAGCCGATTA CGGCATCAAG GCCGGCATCT GGAGCGCCAC CAGCTTCAAC CAGTTGCGCC GCGACGGCCT CGAGGTTTCG CGCTGGAACC TGCTGCATCC GGAACAGCCG CGGAAGACCA GCTACGTCGA ACGATGCCTG GCCCCCACCG CCGGCCCCGT CGTCGCCGCC ACGGACTACG TCAAAGCCTA TCCGGACCTG ATCCGGGAGT TCGTACCCCG ACGCTACACC GTGCTGGGTA CGGACGGCTT CGGCCGCAGC GACCGTCGGT CGGCGCTGCG CCGGTTCTTC GAGGTGGACA GCCACTACAT CGCGTTCGCG GCCTTGAAGT CACTGGCCGA TGAAGGCGGC ATCGAGAAAA CCAAGGTGTC CGAGGCGATG ACCCGCTGGA ACATCGATCC GGACAAGGCC AACCCCATCG GGTGCTGA
|
Protein sequence | MIASNPSLNS QFAANTDTDP EETREWLDAL AAVIETEGVE RAHFLIEKLV DKARRSGANL PYKANTAYIN TIPPHAEARS PGDAGIEHRI RCYLRWNAMA MVVRANQKST EYGGHISSFA SSATLYDVGF NHFFHAPDRD HGGDLVFFQG HSAPGIYARA FLEGRLTEEH LDRFRAEVGG GGLSSYPHPW LMPDFWQFPT VSMGLGPLMA IYQARFMKYL ADRKILLNTE GRKVWCFCGD GEMDEPESMG AIGLAGREKL DNLIFVVNCN LQRLDGPVRG DGKIIQDLEA EFRGAGWNVI KVIWGSHWDA LLARDTKGLL RQRMEEVVDG EYQTYKAKDG AYVRKHFFGK YPELLEMVAN MSDEDIWHLN RGGHDPHKVY AAYAAAVAHK GQPTVILAKT IKGYGMGRAG EGQMIAHQQK KLDAEALKAF RDRFNIPIPD DKVHEAPYYK PAEDSPEMIY LQERRKALGG YLPQRRKEAP HLQIPELSIF ETMLKSSEDR EMSTTMVFVR LLSSLLRDKA LGRYVVPIVP DEARTFGMEG LFRQYGIYSS VGQLYEPQDA DTVMFYREDK SGQILEEGIC EAGAMCDWIA AGTAFSNHNV QMIPFYIYYS MFGFQRVGDL MWAAGDMQAR GFLMGGTAGR TTLAGEGLQH QDGHSHLIMG AIPNCITYDP TFAYELAIIV HDGLRRMYQE GENVFYYITV MNENYTHPAL PKGAEEGIIK GLYKFRDAGK DEAVQLLGSG TILREVIKAA ELLEADYGIK AGIWSATSFN QLRRDGLEVS RWNLLHPEQP RKTSYVERCL APTAGPVVAA TDYVKAYPDL IREFVPRRYT VLGTDGFGRS DRRSALRRFF EVDSHYIAFA ALKSLADEGG IEKTKVSEAM TRWNIDPDKA NPIGC
|
| |