Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1968 |
Symbol | aceE |
ID | 5712962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2077242 |
End bp | 2079899 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641267892 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001533309 |
Protein GI | 159044515 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0676932 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACA CCAAGCACGA TATCGACCCG GTGGAATCCC AGGAATGGCA GGAAGCCATC GAGGATGTGA TCGCCCGGGA CGGCGCGGAT CGCGCCCATT ACCTTCTGGA CAAGGCGGTG CAGCAGGCCC GCGCGGCGGG CGCGACCCTG CCGTTCTCGG CCACCACGCC CTACCAGAAC ACCATTCCCG CCGATGACCA GTACGACTTC CCCGGCGACC TGGAGATGGA ATGGCGGATC CGCACCATCA ACCGCTGGAA CGCCATGGCC ACGGTCGTGC GCCGCAACAA GGTCAGCTCG GAATATGGCG GGCATATCGC GTCCTTTGCG TCCTCGGCGG TGATGTATGA CGTGGGTCTG AACCACTTCT GGCGCTCGAA ATCGGCGATC CACGGTGGGG ATCTCGTGTT CTTCCAGGGC CATGTGATCC CGGGGATCTA TGCGCGGTCC TTCATGGAAG GCCGGATCAG CGAGGAGCAG CTGGAAAATT TCCGCTCCGA AGTCGACGGG TCCGGCCTGT CATCCTATCC GCACCCTTGG CTGATGCCGG ACTACTGGCA GTTCCCGACC GTGTCGATGG GGCTCGGCCC GCTCATGGCG ATCTACCAGG CGCGGTTCAT GAAATACATG CACAGCCGGG GCCTCATCGA CATGGCCGAC CGCAAGGTGT GGTGTTTCCT GGGCGATGGC GAGATGGACG AGCCGGAAAG CCGGGGCGCC ATTGACCTCG CGGTGCGCGA GGGGCTGGAC AACCTGATCT TCGTGATCAA CTGCAACTTG CAACGGCTCG ACGGACCGGT GCGCGGCAAT GGCAAGATCG TGCAGGAGCT GGAAGGCGAC TTCCGCGGCG CGGGCTGGAA CGTCATCAAG CTGCTCTGGG GCAAGGGTTG GGACGAATTG CTGGAGAAAG ACGTCTCGGG CCGGTTGCGC CAGCTGATGG ACGAGACCGT GGACGGCGAT TACCAGACGT TCAAATCCAA GGACGGCGCC TATATCCGCA AGCATTTCTT TGGCAAGTAT CCCGAGACCG CGGCGCTGGT CGAGGACTGG ACCGATGAGC AGATCTGGGC GCTGCGCCGG GGCGGGCACG ATCCGCAAAA GGTCTATACC GCGTTCAAGA AGGCGACCGA GACCAAGGGG CAGCCGAGCT GTCTGCTGGT CAAGACCGTG AAGGGTCACG GGATGGGCAC GGCCGGCGAG GGCCAGAACA CCACCCACCA GCAAAAGAAG ATGAACGAGG AGCAGTTGCG CGCCTTCCGC GACCGGTTCA AGATCCCTGT CAGCGACGAG GATGTCGGCA AGGCCCCGTT CGTGGCGCTG AACAACGCGC AGAAGGCCTA TATCCTGGAG CGGCGCAAGG AGCTGGGCGG CGAGTTCCCG AAGCGCGAAT GGCGCGACAC CCCCAAGCTG GAGATCCCTG CGCTGGAGGC GTTCGGCAAG GAGCTGAAAT CCACCGGCAC GCGTGAGATT TCGACCACCA TGGCCTTCGT GCGGATACTG ACGACGCTCT TGCGCGACAA GAACATCGGC AAGCAGGTCG TGCCGATCGT GCCGGACGAG AGCCGCACCT TCGGGATGGA AGGCTTGTTC CGCTCGGTGG GGATCTACAA CCCGATGGGG CAGACCTACA TCCCCGAAGA CCGGGACCAG ATGTCCTATT ACAAGGAGAG CGAGACGGGA CAGGTCCTGC AGGAGGGCAT CAACGAGGCC GGGGCCATGG CCGACTGGAT CGCGGCGGCC ACGTCGTACT CCAACCACGG CGTGCCGATG ATCCCGTTCT TCATCTACTA CTCTATGTTC GGGTTCCAGC GGATCGGCGA CCTGGCCTGG GCCGCGGGCG ACAGCCGTGC GCGGGGCTTC ATGCTGGGCG GCACCGCCGG GCGCACGACG CTGAACGGCG AGGGGCTGCA GCACGAGGAC GGGCATTCCC ACATCCTGGC GGGCACCATC CCGAACTGCA TCAGCTACGA TCCGACCTTC AGCTACGAGG TGGCGGTGAT CGTCCATCAC GGGCTCAAGC GGATGTATGT CGAGCAGGAC GACGTGTATT TCTACCTGAC GCTGATGAAC GAGAACTACA CCCATCCGGA CATGCCGATG GGTGTAGAAG AGGACATCAT CAAGGGGCTT TACCGGTTCT CGAAAACCGC CAAGCCGAAC AAGAAGCATG TCAACCTGAT GGGGTCGGGC ACGATCCTGG TGCAGGCGAT CAAGGCGGCC GAGATGCTGA AGGAAGACTT CGGCGTGACC TCGGACATCT GGTCGGCGAC CTCGATGAAC GAGCTGGCCC GCGACGGGCA GGATTGCGCA CGGGCCAACC GGCTCGACCC CCTTGGAGAT CAGAAGGTGC CGTTTGTCAC GCAACAGCTT GAAGGGGTGA CCGGACCGGT CATCGCGGCC ACCGATTACA TGAAGAACTA TGCCGAACAG ATCCGCGCGT TCGTGCCGCA GGACTTCACG GTCCTGGGCA CCGACGGGTT CGGGCGGTCC GACAGCCGGG TCAACCTGCG CCGGTTCTTC GAGGTGGATT CCAACCACAT CGCCGCCGCG GCCATGGTCG CGCTGCACAG GCAGGGGACC GTCACCGATG CGGTGCTGAA AAAGGCGCTC GCCCGGTACG AGATCGACAG CAACAAGCCG AACCCCCGTC TGGTGTGA
|
Protein sequence | MTDTKHDIDP VESQEWQEAI EDVIARDGAD RAHYLLDKAV QQARAAGATL PFSATTPYQN TIPADDQYDF PGDLEMEWRI RTINRWNAMA TVVRRNKVSS EYGGHIASFA SSAVMYDVGL NHFWRSKSAI HGGDLVFFQG HVIPGIYARS FMEGRISEEQ LENFRSEVDG SGLSSYPHPW LMPDYWQFPT VSMGLGPLMA IYQARFMKYM HSRGLIDMAD RKVWCFLGDG EMDEPESRGA IDLAVREGLD NLIFVINCNL QRLDGPVRGN GKIVQELEGD FRGAGWNVIK LLWGKGWDEL LEKDVSGRLR QLMDETVDGD YQTFKSKDGA YIRKHFFGKY PETAALVEDW TDEQIWALRR GGHDPQKVYT AFKKATETKG QPSCLLVKTV KGHGMGTAGE GQNTTHQQKK MNEEQLRAFR DRFKIPVSDE DVGKAPFVAL NNAQKAYILE RRKELGGEFP KREWRDTPKL EIPALEAFGK ELKSTGTREI STTMAFVRIL TTLLRDKNIG KQVVPIVPDE SRTFGMEGLF RSVGIYNPMG QTYIPEDRDQ MSYYKESETG QVLQEGINEA GAMADWIAAA TSYSNHGVPM IPFFIYYSMF GFQRIGDLAW AAGDSRARGF MLGGTAGRTT LNGEGLQHED GHSHILAGTI PNCISYDPTF SYEVAVIVHH GLKRMYVEQD DVYFYLTLMN ENYTHPDMPM GVEEDIIKGL YRFSKTAKPN KKHVNLMGSG TILVQAIKAA EMLKEDFGVT SDIWSATSMN ELARDGQDCA RANRLDPLGD QKVPFVTQQL EGVTGPVIAA TDYMKNYAEQ IRAFVPQDFT VLGTDGFGRS DSRVNLRRFF EVDSNHIAAA AMVALHRQGT VTDAVLKKAL ARYEIDSNKP NPRLV
|
| |