Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7017 |
Symbol | aceE |
ID | 5675328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8556315 |
End bp | 8559128 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641245863 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001511254 |
Protein GI | 158318746 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACGG TCATCACCGA CGGGCTCCCG AGCCAGTTGC CGGACATCGA TCCGTCCGAA ACCAGCGAAT GGCTCGAGTC GCTCGACGCG GTGATCGAGG AGTCCGGGCG AGGCCGGGCA CGGTTCCTCA TGCTCAAACT CCTCGAACGG GCACGCGAGA AGGCGGTCGG GGTACCGGGC CTGACCAGCA CCGACTTCAT CAACACGATC CCCCCCGAGC AGGAACCCTG GTTCCCGGGC GACGAGCACG TCGAACGGCG GATCCGGGCG TACATCCGGT GGAACGCCGC CATCATGGTC AGCCGCGCGA ACCGTCCGGA ATATAATGTC GGCGGCCATA TCGCCACATA TGCCTCGAGC GCGAGCCTGT ACGAGGTCGG CTTCAATCAT TTCTTCCGGG GCAAGGACCA CCTGACCTCC TCCCCCGGCA GCGATTCCGG GGACCAGATC TTCATCCAGG GGCACGCCTC CCCCGGCATC TACGCCCGCG CGTTCCTCGA GGGCCGTCTC ACCGAGGCAC AGCTGGACGC GTTCCGCCGC GAGGGCGAGC CGGGCGGCCT GTCGTCCTAC CCGCACCCCC GGCTGATGCC GGACTTCTGG GAGTTCCCGA CAGTGTCGAT GGGCCTCGGC CCGATCGACG CCATCTACCA GGCGCGGTTC AACCGCTACC TGCTCAACCG CCAGATCAAG GACACCTCCC GCAGCAAGGT GTGGGCGTTC CTCGGCGACG GCGAGATGGA CGAGCCGGAG TCGATCGGCG CCCTCGGGGT CGCCGCCCGC GAGGAGCTCG ACAACCTCAT CTTCGTGGTG AACTGCAACC TGCAGCGCCT CGACGGGCCG GTCCGCGGCA ACGGCAAGAT CATGCAGGAG CTGGAGTCGC TGTTCCGCGG CGCCGGCTGG AACGTCATCA AAGTCGTCTG GGGCCGTGAC TGGGACCCGC TGCTCGCCAA GGACACCGAC GGCGTCCTCG TCCACCGGAT GAACACCACA CCCGACGGCC AGTTCCAGAC CTACTCGACC TCGTCAGGCG ACTACATCCG GGAGCACTTC TTCGGCGCCG ACGCCCGGCT ACGCCGGATG GTCACCGACC TGGCCGACGA GGATCTCAGC AAGCTCTCCC GCGGCGGGCA CGACTACCGC AAGCTGTACG CGGCCTACAA GGCGGCCACG GAGCATGCCG GTCAGCCCAC CGTGATCCTC GCCCACACGA TCAAGGGCTG GACGCTGGGC AAGGACTTCG AGGGCCGCAA CGCCACGCAC CAGATGAAGA AGCTCACCAA GACCGAGCTC AAGGAGCTGC GCGACCGGCT CTACCTGGAA ATCCCCGACT CGGCGCTGGA CGGCAACCTC CCGCCGTACT ACCGCCCGGG GCCGGACTCC GAGGAGATCC AGTACATGCG GGAACGCCGG GCGGGGCTCG GCGGATCGAT CCCGCGCCGC GTGGTGCACG CCCGCACCCT CCCCCAGCCA CCAAAGTCGA TCTTCGACGA GCTGCGGCGG GGCTCGGGCA AGCAGCCGGT GGCCACGACA ATGGCCATCG TCCGCCTACT CAAGGACCTG ATGAAGACCA AGGAGATGGG CGCCCGGTTC GTGCCCGTCA TCCCGGACGA GGCGCGCACG TTCGGCATGG ACGCGATGTT CCCCACAGCG AAGATCTACT CGCCGCACGG GCAGCGCTAC GAGGCCGTCG ACCGGGAGCT GCTGCTCTCC TACCGGGAGT CCGAGTCCGG CCAGATGCTG CACGAGGGCA TCAGCGAGGC CGGTTCGATG GGCTCGGTGA TCGCCGCCGC GACGGCGTAC TCCACCCACG GCCAGCACAT GATCCCGGTG TACGTCTTCT ACTCGATGTT CGGCTTCCAG CGGACCGGCG ACCAGATGTG GGCGCTCGGC GACCAGCTCG GCCGGGCCTT CCTACTCGGC GCCACCGCGG GGCGCACGAC CCTGAACGGC GAGGGCCTAC AGCACCAGGA CGGCCACTCG CTACTGCTGG CATCCACGAA CCCGGCGTGC GTGTCGTACG ACCCGGCGTT CGCGTTCGAG ATCTCCCACA TCGTCCGCGA CGCCCTCGAC CGGATGTACG GCGAGCGGAA CGAGAACGTC TTCTACTACC TGACCGTCTA CAACGAGCCG GTCCCGCAGC CAGCCGAGCC GACCGGGGTC GACCCGACCC AGATCATCGC CGGCATGTAC CGGTTCCGTA CCGCCGACGC GCTCACCGGC GGCGAGGAGA CACCCACCGG GCCGGTCGAG GACGGGGCCA CCGAGAGCTC CACCACGACG CAGGCACAGC TCCTGGCCAG CGGTACCGGC ATGCGCTGGG CGCTGGCCGC CCAGGAGATG CTGGCGGCCG ACTACGGCAT CGCCGCCGAC GTGTGGTCAG TGACCTCGTG GAACGAACTG CGCCGGGAAG CACTGGTATG CGAGCGGCGC AACCTGCTCA ACCCCGAGCA GCCGCCAGCC GTGCCCTACA TCAGCCAGAT CCTGAACGGC GCACCAGGCC CGGTCATCGC GGTCTCGGAC TGGATGCGCG CCGTCCCCGA CCAGATCTCA CGGTGGGTGC CACAGCCGTA CACCTCGCTG GGCACCGACG GCTTCGGCCG CTCCGACACC CGGGCGGCCC TACGGCGCCA CTTCAACGTC GACGCCGAGT CCGTGGTCGT GGCGACCCTG GAGGCACTGA CACGAACAGG TGACGTCGAG CAGGCCACCG TGGACGACGC CATCCGCCGG TACGGGCTAC GCAAGGACGG CGCCAACGAG GCCGCTCTCG GCAACGGCGC CACGGAAGAG AACGACCAGC CAGTCTCCGG CTGA
|
Protein sequence | MRTVITDGLP SQLPDIDPSE TSEWLESLDA VIEESGRGRA RFLMLKLLER AREKAVGVPG LTSTDFINTI PPEQEPWFPG DEHVERRIRA YIRWNAAIMV SRANRPEYNV GGHIATYASS ASLYEVGFNH FFRGKDHLTS SPGSDSGDQI FIQGHASPGI YARAFLEGRL TEAQLDAFRR EGEPGGLSSY PHPRLMPDFW EFPTVSMGLG PIDAIYQARF NRYLLNRQIK DTSRSKVWAF LGDGEMDEPE SIGALGVAAR EELDNLIFVV NCNLQRLDGP VRGNGKIMQE LESLFRGAGW NVIKVVWGRD WDPLLAKDTD GVLVHRMNTT PDGQFQTYST SSGDYIREHF FGADARLRRM VTDLADEDLS KLSRGGHDYR KLYAAYKAAT EHAGQPTVIL AHTIKGWTLG KDFEGRNATH QMKKLTKTEL KELRDRLYLE IPDSALDGNL PPYYRPGPDS EEIQYMRERR AGLGGSIPRR VVHARTLPQP PKSIFDELRR GSGKQPVATT MAIVRLLKDL MKTKEMGARF VPVIPDEART FGMDAMFPTA KIYSPHGQRY EAVDRELLLS YRESESGQML HEGISEAGSM GSVIAAATAY STHGQHMIPV YVFYSMFGFQ RTGDQMWALG DQLGRAFLLG ATAGRTTLNG EGLQHQDGHS LLLASTNPAC VSYDPAFAFE ISHIVRDALD RMYGERNENV FYYLTVYNEP VPQPAEPTGV DPTQIIAGMY RFRTADALTG GEETPTGPVE DGATESSTTT QAQLLASGTG MRWALAAQEM LAADYGIAAD VWSVTSWNEL RREALVCERR NLLNPEQPPA VPYISQILNG APGPVIAVSD WMRAVPDQIS RWVPQPYTSL GTDGFGRSDT RAALRRHFNV DAESVVVATL EALTRTGDVE QATVDDAIRR YGLRKDGANE AALGNGATEE NDQPVSG
|
| |