Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3057 |
Symbol | aceE |
ID | 3904258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3622703 |
End bp | 3625540 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637880378 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_482143 |
Protein GI | 86741743 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.636776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCAGG ACACACCCCG GAAGTTCTCG GTCATCACCG ATGGGCTACC GAGTCAGCTG CCGGATATCG ATCCCTCCGA GACCAGCGAG TGGCTGGAGT CGCTCGACGC CGTCATCGAG GAGTCCGGCC GCGGCCGGGC CCGCTTCCTC ATGTTGAAGC TCCTCGAACG GGCCCGGGAG AAGGCGGTCG GTGTTCCCGG TCTCACCAGC ACCGACTACA TCAACACCAT CAGCCCGGAG CGGGAGCCCT GGTTCCCCGG CAACGAGCAC ATCGAACGCC GCATCCGGGC CTATATCCGG TGGAACGCGG CCATCATGGT GAGCCGCGCC AACCGCCCCG CGTTCAATGT CGGCGGTCAC ATCGCGACCT ACGCCTCAAG CGCGAGCCTC TACGAGGTGG GCTTCAACCA TTTCTTCCGC GGCAAGGACC ATCTCAGTTC CTCGCCGGGA AGTGCCTCGG GTGACCAGAT CTTTATTCAG GGTCACGCGT CCCCGGGCAT CTACGCCCGC GCCTTCCTGG AAGGCCGGCT GACGGAGAAG CAGCTCGACG CCTTCCGGCG CGAGGGCGAG TCGGGCGGCC TGTCGTCCTA CCCTCACCCT CGGCTCATGC CTGATTTCTG GGAGTTCCCC ACGGTTTCGA TGGGACTCGG GCCGATCGGC GCCATCTACC AGGCCCGGTT CAACCGCTAC CTGCTCAACC GCCAGATCAA GGACACCTCG GGCAGCCGGG TGTGGGCGTT CCTCGGCGAC GGCGAGATGG ACGAGCCGGA GTCAATCGGC GCGCTGGGCG TGGCCGCCCG CGAGGAGCTC GACAACCTCA TCTTCGTGGT CAACTGCAAC CTGCAGCGCC TCGACGGCCC GGTCCGCGGC AACGGCAAGA TCATGCAGGA GCTGGAGTCG CTGTTCCGGG GCGCCGGCTG GAACGTCATC AAGGTGGTCT GGGGCCGCGA CTGGGACCCG CTGCTGGCCA AGGACACCGA CGGCGTGCTG GTAAACCGCA TGAACACCAC GCCGGACGGG CAGTTCCAGA CCTACTCGAC GTCGTCGGGC GAATACATCC GGGAGCACTT CTTCGGCTCG GACGCCCGGC TGCGCCGGAT GGTCGCGGAC CTCTCCGACG ACGATCTGCG CAAGCTCTCC CGCGGTGGCC ACGACTACCG CAAGCTCTAC GCGGCCTACA AGGCGGCGAC CGAGCACGCG GGCCAGCCCA CGGTCATCCT CGCGCACACC ATCAAGGGCT GGACCCTGGG CAAGGACTTC GAGGCCCGCA ACGCCACCCA CCAGATGAAG AAGCTGACCA AGAGCGAGCT CAAGGAGTTC CGCGACCGGC TCTATCTGGA GATCCCGGAC TCGGCCCTGT CGGGTGACCT GCCGCCCTAC TGCCACCCGG GGCCGGACTC TGAGGAAATC GCGTACATGC GGGAACGCCG GGCCGCGCTC GGTGGGTCGA TCCCGCGTCG GGTCGTGCGG GCCAAGCCGC TCCCCCAACC GCCAGCGAAG ATCTTCGACG AACTCCGGAA GGGCTCCGGC AAACAGCCCG TCGCCACGAC GATGGCCTTC GTCCGGCTGC TGAAGGACCT CATGAAGACC AAGGAGATGG GCGCGCGCTT CGTGCCGGTC ATCCCCGACG AGGCCCGCAC CTTCGGCATG GACGCGATGT TCCCGACGGC GAAGATCTAC TCGCCGCACG GGCAGCGCTA CGAGGCCGTC GACCGCGAAC TGCTGCTGTC CTACCAGGAG TCCGAGACCG GTCAGATGCT GCACGAGGGC ATCAGCGAGG CCGGGTCGAT GGGCTCGGTG ATCGCCGCTG GCACGGCGTA CGCCACCCAC GCGCAGCACA TGATCCCGGT CTACGTCTTC TACTCGATGT TCGGCTTCCA GCGCACCGGC GACCAGATGT GGGCGCTCGG CGACCAGCTC GGCCGCGGGT TCCTGCTCGG GGCGACCGCC GGTCGGACCA CTCTCAACGG CGAGGGCCTG CAGCACCAGG ACGGCCACTC CCTGCTCCTG GCGTCGACTA ACCCGGCCTG CGTGGCCTAC GACCCCGCCT TCGCCTTCGA GCTGACACAC ATCGTCCGGG ACGCGCTCGA CCGAATGTAC GGCGAGCGGG ACGACAACGT CTTCTACTAC CTCACTGTCT ACAACGAGCC GGTGCCGCAG CCTGCCGAGC CCGCGGGTCT CGACCCGGCG CAGATCATCT CCGGGATGTA CCGGTTCCGC TCGGCTGAGG ACCTGGCCGG CGGGCTTCCG GCGGCGGGGG CCGGCGTGGG CTCCCCGCCG CGGGCGCAAC TGCTGGCCAG CGGGACTTCC ATCCACTGGG CGCTGGCGGC GCAGGAGATC CTTGCCGCCG ACTTCGGGGT CGCCGCCGAC CTGTGGTCGG TGACCTCGTG GAACGAGCTT CGGCGAGATG CGCTCGACTG CGACCGCGCC AACCTGCTCA ACCCCGAGGC CGACGACGCC GTGCCGTACG TGACCCGTGC CCTCGAAGGG GCCGCCGGCC CGGTCGTGGC GGTCTCCGAC TGGATGCGCG CGGTGCCCGA CCAGATCTCC CGGTGGGTTC CGCAGCCGTT CACCTCGCTG GGCACCGACG GGTACGGCCG TTCCGACACT CGGGCCGCCC TGCGCCGGCA CTTCAAGGTC GACGCCGAGT CGATCGTCGT CGCGACCCTG GAAGCCCTCG TCCGGGCCGG TGAGGTCAAG GCCACCACGG TCGGGGACGC GATCAGGCGA TTCGGGTTGC GTGCCGACAA GGCCGGCCTG GACGGGCCCG AGGCGATCGT GGCCGGCATC AGGATCGTGG CCGGCGCCGA GCAGGACGAG CGGCCGATCT CGGGCTGA
|
Protein sequence | MAQDTPRKFS VITDGLPSQL PDIDPSETSE WLESLDAVIE ESGRGRARFL MLKLLERARE KAVGVPGLTS TDYINTISPE REPWFPGNEH IERRIRAYIR WNAAIMVSRA NRPAFNVGGH IATYASSASL YEVGFNHFFR GKDHLSSSPG SASGDQIFIQ GHASPGIYAR AFLEGRLTEK QLDAFRREGE SGGLSSYPHP RLMPDFWEFP TVSMGLGPIG AIYQARFNRY LLNRQIKDTS GSRVWAFLGD GEMDEPESIG ALGVAAREEL DNLIFVVNCN LQRLDGPVRG NGKIMQELES LFRGAGWNVI KVVWGRDWDP LLAKDTDGVL VNRMNTTPDG QFQTYSTSSG EYIREHFFGS DARLRRMVAD LSDDDLRKLS RGGHDYRKLY AAYKAATEHA GQPTVILAHT IKGWTLGKDF EARNATHQMK KLTKSELKEF RDRLYLEIPD SALSGDLPPY CHPGPDSEEI AYMRERRAAL GGSIPRRVVR AKPLPQPPAK IFDELRKGSG KQPVATTMAF VRLLKDLMKT KEMGARFVPV IPDEARTFGM DAMFPTAKIY SPHGQRYEAV DRELLLSYQE SETGQMLHEG ISEAGSMGSV IAAGTAYATH AQHMIPVYVF YSMFGFQRTG DQMWALGDQL GRGFLLGATA GRTTLNGEGL QHQDGHSLLL ASTNPACVAY DPAFAFELTH IVRDALDRMY GERDDNVFYY LTVYNEPVPQ PAEPAGLDPA QIISGMYRFR SAEDLAGGLP AAGAGVGSPP RAQLLASGTS IHWALAAQEI LAADFGVAAD LWSVTSWNEL RRDALDCDRA NLLNPEADDA VPYVTRALEG AAGPVVAVSD WMRAVPDQIS RWVPQPFTSL GTDGYGRSDT RAALRRHFKV DAESIVVATL EALVRAGEVK ATTVGDAIRR FGLRADKAGL DGPEAIVAGI RIVAGAEQDE RPISG
|
| |