Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2464 |
Symbol | aceE |
ID | 4445053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2759009 |
End bp | 2761777 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639690279 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_831943 |
Protein GI | 116671010 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0453737 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGCGC GCAAAGAGAG GTTGGACGTG GCTGCAGGAG AAGATACCTC CCATATCCTC AGCGGGTTGA CTAACCAGCT GCCTGATCGT GATCCGGAAG AGACCGCCGA ATGGATTGAG TCCCTGGATA CGCTGATCAG GGAACAGGGC ACCGAGCGTG CCCAGTACAT CATGCGCAGT CTCCTGCAGC GTGCCGGCGC CCAGAGCGTC GGGGTTCCGA TGGTCACCAC CACGGACTAT GTGAACACCA TTCCCGCGGA CCAGGAAGCA CCGTTCCCGG GCAACGAGGA ATACGAGCGC CGCTACCGGG CGTACATGCG CTGGAACGCC GCGGTCATGG TGCACCGGTC CCAGCGCCCG AACATCGGGG TCGGCGGGCA CATCTCCACC TATGCCGGGG CCGCGACCCT GTACGAGGTC GGGTTCAACC ACTTCTTCCG CGGCAAGGAC CACCCCGGCG GCGGGGACCA GGTCTTCTTC CAGGGCCACG CTTCCCCGGG CATGTACGCC AGGGCGTTCA TGGAAGGACG CCTGACCGAG GAGGACCTGG ACGGGTTCCG GCAGGAAAAG TCCAAGGCCG GCCACGCCCT GTCCTCCTAC CCGCACCCGC GGCTGATGCC CGGGTTCTGG GAATTCCCCA CCGTGTCCAT GGGCATCGGG CCGATGAACG CGATCTACCA GGCCCAGTCC AACCGGTACC TGCACAACCG CGGCCTGAAA GACACCTCCG ACCAGCAGGT CTGGGCGTTC CTGGGCGACG GGGAAATGGA CGAGCCCGAG TCCCGCGGCC TGCTCCAGCT CGCCGCGAAC GAGAACCTGG ACAACCTGAA CTTCGTGATC AACTGCAACC TCCAGCGCCT GGACGGGCCG GTGCGCGGCA ACGGGAAGAT CATGCAGGAA CTCGAAGCGT TCTTCCGCGG CGCGGGCTGG AACGTCATCA AGGTCGTCTG GGGCCGGGAA TGGGATGACC TCCTGGCCAA GGACAACGAC GGGTCCCTGG TGAAGATCAT GAACGAGACC CCGGACGGGG ACTACCAGAC CTACAAGGCA GAATCCGGCG GGTTCGTCCG CGAACACTTC TTCGGGAAGA CCCCGCAGAC CAAGGACATG GTCGCGGACC TGAGCGATGA CCAGATCTGG AACCTCAAGC GCGGCGGCCA CGACTACCGC AAGGTCTACG CCGCGTACAA GGCAGCCACC GAATTCAAGG GCAAACCCAC CGTCATCCTG GCCAAAACGG TCAAGGGCTA CGGCCTCGGC CCGCACTTCG AAGGCCGCAA CGCCACACAC CAGATGAAGA AACTCACCCT CGACGACCTC AAGTCGTTCC GGGACCACCT GCGCATCCCG ATCACGGATG AGCAGCTCGA GGGCGATCCC TACCAGCCGC CGTACTTCCA CCCCGGCAAC GATGCGCCGG AAATCGCGTA CATGATGGAG CGCCGGGCCG CCCTGGGCGG TTCCGTTCCG GAGCGCCGCA GCAAGCATGC CGCCATCACG CTTCCCGACG CGAAGTCCTA TGAGGTGGCC AAGCGCGGTT CGGGCAAGCA GCAGGCTGCC ACGACCATGG CGTTCGTCCG CCTGCTCAAG GACCTCATGC GGGACAAGGA GTTCGGCAAG CACATCGCGC CGATCATCCC CGATGAGGCG CGTACGTTCG GCATGGATGC GTTCTTCCCG ACGGCGAAGA TCTACAACCC CAAGGGCCAG AACTACCTCT CCGTGGACCG GGACCTGGTC CTGGCCTACA AGGAATCCGC CCAGGGCCAG CTGATCCACC CCGGCATCAA CGAAGCCGGC GCCGTCGCAG CCTTCACCGC CGCCGGCACC GCCTACGCCA CCCACGGCGT CCCGCTGATC CCGGTCTACG TGTTCTACTC CATGTTCGGC TTCCAGCGCA CCGGCGACGC CTTCTGGGCC GCCGCGGACC AAATGACCCG CGGCTTCATC ATCGGCGCCA CTGCAGGCCG GACCACCCTC ACCGGCGAAG GACTCCAGCA CGCCGACGGC CACTCCCCCA TCCTCGCCGC CACCAACCCG GCCGTCGTCA CCTACGACCC CGCCTACGGC TACGAAATGG GCCACATCAT CCGCGACGGC ATCGAGCGGA TGTACGGACC GGACTCCACT GACCGGAACC TGATGTACTA CATCACCGTC TACAACGAAC CCATCACCCA GCCGGCAGAG CCGGACGAGC TGGACGTTGA AGGCGTGATC AAGGGCATCT ATCTGCTCGC ACCGGCCAAG ATTGACGGCC CCCGCACGCA GATCCTGGCC TCGGGCGTTT CGGTGCCCTG GGCGCTCGAA GCCCAGCGGA TCCTGGCCGA GGACTGGGGC GTCTCCGCAG ACGTCTGGTC CGTCACGTCA TGGAACGAAC TCCGCCGCGA CGCCATGGCC GCCGAGGAAG AGGCCTTCCT CAACCCGGGC CAGCCGGCGC GCGTGCCGTT CGTCACCGCG CAGCTCGAAG GTGCCACCGG CCCTATCGTG GCTGTCACGG ACTACATGAA GGCCGTCCCG GACCAGATCC GCCAGTTCCT CCCGAACGAG TTCGCCTCGC TCGGCGCGGA CGGCTTCGGC TTCTCCGACA CCCGCGCCGC AGCACGCCGC TTCTTCAAGA ACGACATCCA CTCCATCGTG GTCCGTTCAC TGGAGATGCT CGCGCGCCGC AGCGAGGTGG ACGCCCAGGC TCCTGCCCAG GCCATTGAGA AGTATCGCCT GCATAACGTG AATGCGGGTT CCACCGGAAA CGCCGGAGGC GAATCCTGA
|
Protein sequence | MHARKERLDV AAGEDTSHIL SGLTNQLPDR DPEETAEWIE SLDTLIREQG TERAQYIMRS LLQRAGAQSV GVPMVTTTDY VNTIPADQEA PFPGNEEYER RYRAYMRWNA AVMVHRSQRP NIGVGGHIST YAGAATLYEV GFNHFFRGKD HPGGGDQVFF QGHASPGMYA RAFMEGRLTE EDLDGFRQEK SKAGHALSSY PHPRLMPGFW EFPTVSMGIG PMNAIYQAQS NRYLHNRGLK DTSDQQVWAF LGDGEMDEPE SRGLLQLAAN ENLDNLNFVI NCNLQRLDGP VRGNGKIMQE LEAFFRGAGW NVIKVVWGRE WDDLLAKDND GSLVKIMNET PDGDYQTYKA ESGGFVREHF FGKTPQTKDM VADLSDDQIW NLKRGGHDYR KVYAAYKAAT EFKGKPTVIL AKTVKGYGLG PHFEGRNATH QMKKLTLDDL KSFRDHLRIP ITDEQLEGDP YQPPYFHPGN DAPEIAYMME RRAALGGSVP ERRSKHAAIT LPDAKSYEVA KRGSGKQQAA TTMAFVRLLK DLMRDKEFGK HIAPIIPDEA RTFGMDAFFP TAKIYNPKGQ NYLSVDRDLV LAYKESAQGQ LIHPGINEAG AVAAFTAAGT AYATHGVPLI PVYVFYSMFG FQRTGDAFWA AADQMTRGFI IGATAGRTTL TGEGLQHADG HSPILAATNP AVVTYDPAYG YEMGHIIRDG IERMYGPDST DRNLMYYITV YNEPITQPAE PDELDVEGVI KGIYLLAPAK IDGPRTQILA SGVSVPWALE AQRILAEDWG VSADVWSVTS WNELRRDAMA AEEEAFLNPG QPARVPFVTA QLEGATGPIV AVTDYMKAVP DQIRQFLPNE FASLGADGFG FSDTRAAARR FFKNDIHSIV VRSLEMLARR SEVDAQAPAQ AIEKYRLHNV NAGSTGNAGG ES
|
| |