Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2128 |
Symbol | aceE |
ID | 4784347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2277739 |
End bp | 2280468 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640090696 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001021319 |
Protein GI | 124267315 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00934481 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCAT TGCCGGAATC CTTTGGCCGT TCGGCCGCCA ACGACAGTGA CGCGCAGGAA ACGCGCGAAT GGCTGGAGGC GCTGGCCGCC GTGATCGACA GCGAGGGGCC GCAGCGCGCG CACTTCCTCC TCGAGCGCCT GATCGACGAG GCGCGCCAGG CCGGCATCGA CATGCCGTTC TCGGCCACCA CGCCCTACGT CAACACCATC CCGGCCGGCC AGGAACTGCA CAGTCCGGGG CAGATCGACA TCGAGGAGCG GCTGCGTGCC TACATGCGCT GGAACGCGAT GGCGATGGTG GTCAAGGCCA ACCGCCTCGA CCCGGCCGAC GGCGGCGACC TCGGCGGCCA CATCTCCAGC TTCGCCTCGC TGGCCACGAT GTTCGGCGCC GGCTTCAACC ACTTCTGGCA CGCCGACGAC ACCGACCAGG GCGGCAAGCA CGGCGGCGAC CTGCTCTACA TCCAGGGCCA TTCTTCGCCC GGCATCTACG CCCGCGCCTT CATGGAAGGC CGGATCACCG AGGAGCAACT GCTCAACTTC CGCCAGGAGG TCGACGGTAA GGGCCTCTCC AGCTACCCGC ACCCGAAGCT GATGCCGGAG TTCTGGCAGT TCCCCACCGT CTCGATGGGC CTCGGCCCGC TGATGGCGAT CTACCAGGCG CGCTTCCTGA AGTACCTCCA CGCGCGCGGC ATTGCCGACA CATCGAAGCG CAAGGTCTGG GTGTTCTGCG GCGACGGCGA GATGGACGAG CCCGAGTCGC TGGGCGCCAT CGGCCTGGCG GCGCGCGAGA AGCTCGACAA CCTGATCTTC GTCATCAACT GCAACCTGCA GCGGCTGGAC GGCCCGGTGC GCGGTAACGG CAAGATCATC CAGGAGCTCG AGGGCGAGTT CCGCGGCTCG GGCTGGAATG TCATCAAGCT GATCTGGGGC AGCTACTGGG ACCCGCTGCT GGCGCGCGAC AAGGAGGGCC TGCTGCGCAA GGTGATGATG GAGACGGTCG ACGGCGACTA CCAGGCGATG AAGGCCAACG ACGGCGCTTT CGTGCGCAAG CACTTCTTCG GCCAGCACCC CAAGCTGCTG GAGATGGTGT CCAAGCTGAG CGACGACGAC ATCTGGCGCC TGAACCGCGG TGGTCACGAC CCGCACAAGG TGCATGCCGC CTACCACGAG GCCGTGAACC ACAAGGGCCA GCCCACCGTG CTGCTGATCA AGACGGTCAA GGGCTATGGC ATGGGCAAGA TCGGCGAGGG CAAGAACACC GCCCACCAGA CCAAGAAGCT GGTCGACGAG GACGTGAAGG CTTTCCGCGA CCGCTTCAAC ATCCCCATCC CCGACGACAA GCTGCACGAC ATTCCGTTCT ACAAGCCGGC CGACGACACG CCGGAGATGC AGTATCTGCA CGAGCGCCGC AAGGCCCTCG GCGGCTACCT GCCCAAGCGC CGTCCGAAGG CCGACGAGCA GCTGCCGGTG CCCGATCTGT CGGTGTTCCA GGCGGTGATG GAGCCGACAG CCGAAGGCCG CGAGATCAGC ACCACGCAGG CCTATGTGCG CTGTCTGAAC GCGCTGCTGC GCGACAAGGC CCTGGGCCCG CGCACCGTGC CGATCCTCGT CGACGAGGCG CGCACCTTCG GCATGGAAGG CCTGTTCCGC CAGATCGGCA TCTACAACCC CGCGGGGCAG CAGTACACGC CGGTCGACAA GGACCAGGTC ATGTACTACA AGGAAGACAA GGCCGGCCAG ATCCTGCAGG AAGGCATCAA CGAGGCGGGC GGGATGAGCT CGTGGATCGC TGCGGCGACG TCGTACTCGA CGAACAACCG CATCATGATT CCGTTCTACA TCTACTACTC GATGTTCGGG CTGCAGCGCG TCGGCGACCT GTGCTGGGCC GCGGGCGACA TGCAGGCGCG CGGCTTCCTG CTCGGCGGCA CCTCGGGCCG CACCACGCTC AACGGCGAGG GCCTGCAGCA CGAGGACGGC CACAGCCACA TCCTCGCGAG CACGATCCCC AACTGCATCA GCTACGACCC GACCTTCGCG CACGAGGTGG CGGTCATCAT CCACCATGGC CTGAAGCGCA TGGTCGAGCA GCAGGACAAC GTCTACTTCT ATCTCACGCT GCTCAACGAG AACTACGCGC AACCGGGCCT GAGGCCGGGC ACCGAGACGC AGATCGTCAA GGGCATGTAC CTGCTGCTCG AGGGCGGGTC GAAGAAGAAG GACGCGCCGC AGGTCAACCT GCTGGGCAGC GGCACCATCC TGCGCGAGTC GATCGCCGCG AAGGAGTTGC TCGAGAAGGA CTGGGGCGTG TCCGCCAATG TGTGGAGCTG CCCGAGCTTC AACGAACTGG CGCGCGACGG CCAGGACGCC GACCGCTGGA ACCTGCTGCA CCCGGCCGAC AAGAAGCCGC GCGTGCCCTT CGTGACGCAG CAGCTCGAGC CGCATGCCGG GCCGGTGGTG GCGTCGACCG ACTACATGAA GGCCTACGCG GAGCAGATCC GCGCCTTCAT CCCGAAGGGC CGCAGCTACA AGGTGCTGGG GACCGACGGC TTCGGCCGCA GTGACTTCCG CACCAAGCTG CGCGAACACT TTGAGGTGAA CCGCCACTAC GTCGTGGTTG CCGCGCTCAA GGCCCTGGCA GACGAGGGCA CGGTGCCGGC TGCGAAGGTG GCCGAGGCGA TCAAGAAGTA CGGGATCAAC GCTGACAAGA TCAACCCCTT GTACGCGTGA
|
Protein sequence | MSALPESFGR SAANDSDAQE TREWLEALAA VIDSEGPQRA HFLLERLIDE ARQAGIDMPF SATTPYVNTI PAGQELHSPG QIDIEERLRA YMRWNAMAMV VKANRLDPAD GGDLGGHISS FASLATMFGA GFNHFWHADD TDQGGKHGGD LLYIQGHSSP GIYARAFMEG RITEEQLLNF RQEVDGKGLS SYPHPKLMPE FWQFPTVSMG LGPLMAIYQA RFLKYLHARG IADTSKRKVW VFCGDGEMDE PESLGAIGLA AREKLDNLIF VINCNLQRLD GPVRGNGKII QELEGEFRGS GWNVIKLIWG SYWDPLLARD KEGLLRKVMM ETVDGDYQAM KANDGAFVRK HFFGQHPKLL EMVSKLSDDD IWRLNRGGHD PHKVHAAYHE AVNHKGQPTV LLIKTVKGYG MGKIGEGKNT AHQTKKLVDE DVKAFRDRFN IPIPDDKLHD IPFYKPADDT PEMQYLHERR KALGGYLPKR RPKADEQLPV PDLSVFQAVM EPTAEGREIS TTQAYVRCLN ALLRDKALGP RTVPILVDEA RTFGMEGLFR QIGIYNPAGQ QYTPVDKDQV MYYKEDKAGQ ILQEGINEAG GMSSWIAAAT SYSTNNRIMI PFYIYYSMFG LQRVGDLCWA AGDMQARGFL LGGTSGRTTL NGEGLQHEDG HSHILASTIP NCISYDPTFA HEVAVIIHHG LKRMVEQQDN VYFYLTLLNE NYAQPGLRPG TETQIVKGMY LLLEGGSKKK DAPQVNLLGS GTILRESIAA KELLEKDWGV SANVWSCPSF NELARDGQDA DRWNLLHPAD KKPRVPFVTQ QLEPHAGPVV ASTDYMKAYA EQIRAFIPKG RSYKVLGTDG FGRSDFRTKL REHFEVNRHY VVVAALKALA DEGTVPAAKV AEAIKKYGIN ADKINPLYA
|
| |