Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1860 |
Symbol | aceE |
ID | 5670262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2234474 |
End bp | 2237308 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641240781 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_001506204 |
Protein GI | 158313696 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00231048 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGCGCAGG ACACAACGCG GAAGTTCTCG GTCATCACCG ACGGGCTCCC GAGCCAGTTG CCGGACATCG ATCCGTCCGA AACCAGCGAA TGGCTCGAGT CGCTCGACGC GGTGATCGAG GAGTCCGGGC GAGGCCGGGC ACGGTTCCTC ATGCTCAAAC TCCTCGAACG GGCACGCGAG AAGGCGGTCG GGGTACCGGG CCTGACCAGC ACCGACTTCA TCAACACGAT CCCCCCCGAG CAGGAACCCT GGTTCCCGGG CGACGAGCAC GTCGAACGGC GGATCCGGGC GTACATCCGG TGGAACGCCG CCATCATGGT CAGCCGCGCG AACCGTCCGG AATATAATGT CGGCGGCCAT ATCGCCACAT ATGCCTCGAG CGCGAGCCTG TACGAGGTCG GCTTCAATCA TTTCTTCCGG GGCAAGGACC ACCTGACCTC CTCCCCCGGC AGCGATTCCG GGGACCAGAT CTTCATCCAG GGGCACGCCT CCCCCGGCAT CTACGCCCGC GCGTTCCTCG AGGGCCGTCT CACCGAGGCA CAGCTGGACG CGTTCCGCCG CGAGGGCGAG CCGGGCGGCC TGTCGTCCTA CCCGCACCCC CGGCTGATGC CGGACTTCTG GGAGTTCCCG ACAGTGTCGA TGGGCCTCGG CCCGATCGAC GCCATCTACC AGGCGCGGTT CAACCGCTAC CTGCTCAACC GCCAGATCAA GGACACCTCC CGCAGCAAGG TGTGGGCGTT CCTCGGCGAC GGCGAGATGG ACGAGCCGGA GTCGATCGGC GCCCTCGGGG TCGCCGCCCG CGAGGAGCTC GACAACCTCA TCTTCGTGGT GAACTGCAAC CTGCAGCGCC TCGACGGGCC GGTCCGCGGC AACGGCAAGA TCATGCAGGA GCTGGAGTCG CTGTTCCGCG GCGCCGGCTG GAACGTCATC AAAGTCGTCT GGGGCCGTGA CTGGGACCCG CTGCTCGCCA AGGACACCGA CGGCGTCCTC GTCCACCGGA TGAACACCAC ACCCGACGGC CAGTTCCAGA CCTACTCGAC CTCGTCAGGC GACTACATCC GGGAGCACTT CTTCGGCGCC GACGCCCGGC TACGCCGGAT GGTCACCGAC CTGGCCGACG AGGATCTCAG CAAGCTCTCC CGCGGCGGGC ACGACTACCG CAAGCTGTAC GCGGCCTACA AGGCGGCCAC GGAGCATGCC GGTCAGCCCA CCGTGATCCT CGCCCACACG ATCAAGGGCT GGACGCTGGG CAAGGACTTC GAGGGCCGCA ACGCCACGCA CCAGATGAAG AAGCTCACCA AGACCGAGCT CAAGGAGCTG CGCGACCGGC TCTACCTGGA AATCCCCGAC TCGGCGCTGG ACGGCAACCT CCCGCCGTAC TACCGCCCGG GGCCGGACTC CGAGGAGATC CAGTACATGC GGGAACGCCG GGCGGGGCTC GGCGGATCGA TCCCGCGCCG CGTGGTGCAC GCCCGCACCC TCCCCCAGCC ACCAAAGTCG ATCTTCGACG AGCTGCGGCG GGGCTCGGGC AAGCAGCCGG TGGCCACGAC AATGGCCATC GTCCGCCTAC TCAAGGACCT GATGAAGACC AAGGAGATGG GCGCCCGGTT CGTGCCCGTC ATCCCGGACG AGGCGCGCAC GTTCGGCATG GACGCGATGT TCCCCACAGC GAAGATCTAC TCGCCGCACG GGCAGCGCTA CGAGGCCGTC GACCGGGAGC TGCTGCTCTC CTACCGGGAG TCCGAGTCCG GCCAGATGCT GCACGAGGGC ATCAGCGAGG CCGGTTCGAT GGGCTCGGTG ATCGCCGCCG CGACGGCGTA CTCCACCCAC GGCCAGCACA TGATCCCGGT GTACGTCTTC TACTCGATGT TCGGCTTCCA GCGGACCGGC GACCAGATGT GGGCGCTCGG CGACCAGCTC GGCCGGGCCT TCCTACTCGG CGCCACCGCG GGGCGCACGA CCCTGAACGG CGAGGGCCTA CAGCACCAGG ACGGCCACTC GCTACTGCTG GCATCCACGA ACCCGGCGTG CGTGTCGTAC GACCCGGCGT TCGCGTTCGA GATCTCCCAC ATCGTCCGCG ACGCCCTCGA CCGGATGTAC GGCGAGCGGA ACGAGAACGT CTTCTACTAC CTGACCGTCT ACAACGAGCC GGTCCCGCAG CCAGCCGAGC CGACCGGGGT CGACCCGACC CAGATCATCG CCGGCATGTA CCGGTTCCGT ACCGCCGACG CGCTCACCGG CGGCGAGGAG ACACCCACCG GGCCGGTCGA GGACGGGGCC ACCGAGAGCT CCACCACGAC GCAGGCACAG CTCCTGGCCA GCGGTACCGG CATGCGCTGG GCGCTGGCCG CCCAGGAGAT GCTGGCGGCC GACTACGGCA TCGCCGCCGA CGTGTGGTCA GTGACCTCGT GGAACGAACT GCGCCGGGAA GCACTGGTAT GCGAGCGGCG CAACCTGCTC AACCCCGAGC AGCCGCCAGC CGTGCCCTAC ATCAGCCAGA TCCTGAACGG CGCACCAGGC CCGGTCATCG CGGTCTCGGA CTGGATGCGC GCCGTCCCCG ACCAGATCTC ACGGTGGGTG CCACAGCCGT ACACCTCGCT GGGCACCGAC GGCTTCGGCC GCTCCGACAC CCGGGCGGCC CTACGGCGCC ACTTCAACGT CGACGCCGAG TCCGTGGTCG TGGCGACCCT GGAGGCACTG ACACGAACAG GTGACGTCGA GCAGGCCACC GTGGACGACG CCATCCGCCG GTACGGGCTA CGCAAGGACG GCGCCAACGA GGCCGCTCTC GGCAACGGCG CCACGGAAGA GAACGACCAG CCAGTCTCCG GCTGA
|
Protein sequence | MAQDTTRKFS VITDGLPSQL PDIDPSETSE WLESLDAVIE ESGRGRARFL MLKLLERARE KAVGVPGLTS TDFINTIPPE QEPWFPGDEH VERRIRAYIR WNAAIMVSRA NRPEYNVGGH IATYASSASL YEVGFNHFFR GKDHLTSSPG SDSGDQIFIQ GHASPGIYAR AFLEGRLTEA QLDAFRREGE PGGLSSYPHP RLMPDFWEFP TVSMGLGPID AIYQARFNRY LLNRQIKDTS RSKVWAFLGD GEMDEPESIG ALGVAAREEL DNLIFVVNCN LQRLDGPVRG NGKIMQELES LFRGAGWNVI KVVWGRDWDP LLAKDTDGVL VHRMNTTPDG QFQTYSTSSG DYIREHFFGA DARLRRMVTD LADEDLSKLS RGGHDYRKLY AAYKAATEHA GQPTVILAHT IKGWTLGKDF EGRNATHQMK KLTKTELKEL RDRLYLEIPD SALDGNLPPY YRPGPDSEEI QYMRERRAGL GGSIPRRVVH ARTLPQPPKS IFDELRRGSG KQPVATTMAI VRLLKDLMKT KEMGARFVPV IPDEARTFGM DAMFPTAKIY SPHGQRYEAV DRELLLSYRE SESGQMLHEG ISEAGSMGSV IAAATAYSTH GQHMIPVYVF YSMFGFQRTG DQMWALGDQL GRAFLLGATA GRTTLNGEGL QHQDGHSLLL ASTNPACVSY DPAFAFEISH IVRDALDRMY GERNENVFYY LTVYNEPVPQ PAEPTGVDPT QIIAGMYRFR TADALTGGEE TPTGPVEDGA TESSTTTQAQ LLASGTGMRW ALAAQEMLAA DYGIAADVWS VTSWNELRRE ALVCERRNLL NPEQPPAVPY ISQILNGAPG PVIAVSDWMR AVPDQISRWV PQPYTSLGTD GFGRSDTRAA LRRHFNVDAE SVVVATLEAL TRTGDVEQAT VDDAIRRYGL RKDGANEAAL GNGATEENDQ PVSG
|
| |