Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0269 |
Symbol | aceE |
ID | 4270487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 307186 |
End bp | 309873 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638124994 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_741114 |
Protein GI | 114319431 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.818226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCAG TTGAACCCCT CCCAGAGGGT GATGTTGATC CGCTCGAGAC CCGGGAGTGG CTCGAGGCAC TCGAGGCCGT TATCGAGGAG GAGGGCCCCG AGCGCGCCCA GTACCTGCTG GAGCAACTGG TCGAGAAAAC CCGCCGACGC GGCGGCGTGG CCCCCTTCAA GGCCACCACT GCGTACGAGA ACACCATCCC CCGCCACCTT GAGGCCCGCT CCCCCGGAGA CCACCAACTG GAGTGGCGCC TGCGTTCGAT CATGCGCTGG AACGCCATGG CGATGGTGGT GCAGGCCAAC AAGGAGCATG ACGGTATCGG GGGGCACATC GCCTCCTACG CCTCGGCGGC GACGCTCTAC GAGACCGGTT TCAACCACTT CTGGCATGCG CCGTCGGACG AGCACGGCGG CGACATGGTC TTCATCCAGG GCCATTCGGC GCCCGGTATC TATGCCCGGG CATTCCTCGA GGGCCGCCTC ACCGAGGATC AACTGCACAA TTTCCGCCAG GATGTCGGAG GTGAAGGCGT TACCTCCTAT CCTCACCCCT GGCTGATGCC GGAGTTCTGG CAGTTTCCCA CCGTCTCCAT GGGGCTGGGG CCGATCATGG CCATCTACCA GGCCCGGTTC ATGAAGTACA TGCATGACCG CGAGGTCATC GACACCGAGG GCCGCAAGGT GTGGGCCTTC ATGGGCGACG GTGAGATGGA CGAGCCGGAG TCCTTGGGGG CGATCGGCTT GGCCGCCCGC GAGAAGCTGG ACAACTTGGT GTTCGTGGTC AACTGCAACC TGCAGCGGCT GGACGGCCCG GTCCGCGGTA ACGCCAAGAT TATCCAGGAG CTGGAGGGCG AGTTCCGCGG CGCCGGCTGG AATGTCATCA AGCTGATCTG GGGCTCCGGC TGGGACGCGC TGCTGGAGCG CGACCGCACT GGGCTGCTGC GCAAGCGCAT GGAAGAGTGC GTGGACGGCG AGTACCAGAA CTTCAAGGCC AAGGGCGGTG CCTACACCCG CGAGCACTTC TTCGGCAAGT ACCCGGAGCT CAAGGACATG GTGGCCCACA TGTCCGACGA GGAGATTGCC CGGCTCAACC GCGGCGGCCA CGATCCGCAC AAGGTCTACG CCGCCTACCA CGCGGCGGCC AACCACAAGG GCCAGCCCAC CGTCATCCTG GTGAAGACGG TGAAGGGCTA CGGCATGGGC GAGGCCGGCG AGGGCCAGAA CATCACCCAC CAGCAGAAGA AGATGGGTGA GGCGGCGCTC AAGGCCTTCC GCGATCGGTT CGATATCCCG ATCCCGGACG ATAAGATCGC CGAGGCGCCC TTCTACAAGC CGGACGATGA CAGTCCGGAG ATCAAGTACC TGCACGAGCG CCGCCAGGCG TTGGGTGGCT ACCTGCCCAG TCGTCGCACC GAGGCGGCGC CGCTGAAGGC GCCGGCGCTG TCCGCCTTCG ACGTGCTGCT GAAGGACAGC GGCGACCGCG AGATGTCCAC CACCATGGCC TTCGTGCGGG CGCTGACCAT CCTCACCCGC GACAAGGACC TGAAAAAGCA CATAGTACCC ATCGTGCCGG ACGAGGCGCG CACCTTCGGC ATGGAGGGGC TGTTCCGCCA GCTGGGCATC TACTCCTCCG TCGGTCAGCT CTACACCCCG CAGGATGCGG ATCAGCTGAT GTATTACAAG GAGGACAAGA AGGGGCAGAT CCTGCAGGAG GGGATCAACG AGGCCGGTGC CTTCTCCTCC TGGATTGCGG CGGGCACCAG CTACAGCAAC CATGGCGTGA ACATGGTGCC CTTCTACGCC TACTACTCCA TGTTCGGCTT CCAGCGCATC GGGGACCTGG CCTGGGCCGC CGGCGACATG CAGGCGCGCG GCTTCCTGAT GGGCGGTACC GCCGGCCGCA CCACCCTGAA CGGCGAGGGG TTGCAGCACG AGGATGGCCA CAGCCACCTG CTGGCCTCGA CCATCCCCAA CTGTCGGGCC TATGACCCCA CCTTTGCCTA CGAGATGGCG GTGCTCATCC AGGACGGGCT GCGGCGGATG TACGTGGATC AGGAGAACTG CTTCTACTAC ATCACCATGC TCAACGAGAA CTACCGGCAG CCGGCCATGC CCAAGGGTGC CGAGGAGGGC ATCGTCAAGG GCATGTACCT GTTCCGCGAG GGCAAGAAGA GCTCTGGGAG GAAGAAGGCC CCGCGGGTGC AGCTCATGGG CTCCGGTTCC ATCCTGCGCG AGGTGATTGA GGCGGCCGAC CTGTTGCAGC AGGACTTCGG CGTGGAGGCG GACATCTGGA GCGTTACCTC CTTCACCGAG GTGCGCCGTG AGGGCATGTC CGCCGAGCGC TATAACACGC TGAATCCGGA GACGAAGAAG CCCCGGGTGC CCTACCTGCA GGAATGCCTG AAGGGGCGCG CGGGGCCGGC GATTGCGGCG ACCGACTACA TGCGCACCTT CGCCGACCAG ATCCGTCCTT GGATGGATCG CACCTACCGG GTGCTGGGCA CGGACGGCTA TGGCCGTTCG GATACCCGCG AGAAGCTCCG CCAGTTCTTC GAGGTGGACC GCTACCACGT GGCGGTGGCC GCACTGAAGG CACTGGCCGA TGACGGCGTC GTGCCGGTGG CTAAGGTCTC CGAGGCCATC CGTAAGTACG GCGTGGACCC GGAGCGGCCC GACCCCTGGA CGGCCTAG
|
Protein sequence | MAAVEPLPEG DVDPLETREW LEALEAVIEE EGPERAQYLL EQLVEKTRRR GGVAPFKATT AYENTIPRHL EARSPGDHQL EWRLRSIMRW NAMAMVVQAN KEHDGIGGHI ASYASAATLY ETGFNHFWHA PSDEHGGDMV FIQGHSAPGI YARAFLEGRL TEDQLHNFRQ DVGGEGVTSY PHPWLMPEFW QFPTVSMGLG PIMAIYQARF MKYMHDREVI DTEGRKVWAF MGDGEMDEPE SLGAIGLAAR EKLDNLVFVV NCNLQRLDGP VRGNAKIIQE LEGEFRGAGW NVIKLIWGSG WDALLERDRT GLLRKRMEEC VDGEYQNFKA KGGAYTREHF FGKYPELKDM VAHMSDEEIA RLNRGGHDPH KVYAAYHAAA NHKGQPTVIL VKTVKGYGMG EAGEGQNITH QQKKMGEAAL KAFRDRFDIP IPDDKIAEAP FYKPDDDSPE IKYLHERRQA LGGYLPSRRT EAAPLKAPAL SAFDVLLKDS GDREMSTTMA FVRALTILTR DKDLKKHIVP IVPDEARTFG MEGLFRQLGI YSSVGQLYTP QDADQLMYYK EDKKGQILQE GINEAGAFSS WIAAGTSYSN HGVNMVPFYA YYSMFGFQRI GDLAWAAGDM QARGFLMGGT AGRTTLNGEG LQHEDGHSHL LASTIPNCRA YDPTFAYEMA VLIQDGLRRM YVDQENCFYY ITMLNENYRQ PAMPKGAEEG IVKGMYLFRE GKKSSGRKKA PRVQLMGSGS ILREVIEAAD LLQQDFGVEA DIWSVTSFTE VRREGMSAER YNTLNPETKK PRVPYLQECL KGRAGPAIAA TDYMRTFADQ IRPWMDRTYR VLGTDGYGRS DTREKLRQFF EVDRYHVAVA ALKALADDGV VPVAKVSEAI RKYGVDPERP DPWTA
|
| |