Gene Arth_2464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2464 
SymbolaceE 
ID4445053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2759009 
End bp2761777 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content65% 
IMG OID639690279 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_831943 
Protein GI116671010 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0453737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGCGC GCAAAGAGAG GTTGGACGTG GCTGCAGGAG AAGATACCTC CCATATCCTC 
AGCGGGTTGA CTAACCAGCT GCCTGATCGT GATCCGGAAG AGACCGCCGA ATGGATTGAG
TCCCTGGATA CGCTGATCAG GGAACAGGGC ACCGAGCGTG CCCAGTACAT CATGCGCAGT
CTCCTGCAGC GTGCCGGCGC CCAGAGCGTC GGGGTTCCGA TGGTCACCAC CACGGACTAT
GTGAACACCA TTCCCGCGGA CCAGGAAGCA CCGTTCCCGG GCAACGAGGA ATACGAGCGC
CGCTACCGGG CGTACATGCG CTGGAACGCC GCGGTCATGG TGCACCGGTC CCAGCGCCCG
AACATCGGGG TCGGCGGGCA CATCTCCACC TATGCCGGGG CCGCGACCCT GTACGAGGTC
GGGTTCAACC ACTTCTTCCG CGGCAAGGAC CACCCCGGCG GCGGGGACCA GGTCTTCTTC
CAGGGCCACG CTTCCCCGGG CATGTACGCC AGGGCGTTCA TGGAAGGACG CCTGACCGAG
GAGGACCTGG ACGGGTTCCG GCAGGAAAAG TCCAAGGCCG GCCACGCCCT GTCCTCCTAC
CCGCACCCGC GGCTGATGCC CGGGTTCTGG GAATTCCCCA CCGTGTCCAT GGGCATCGGG
CCGATGAACG CGATCTACCA GGCCCAGTCC AACCGGTACC TGCACAACCG CGGCCTGAAA
GACACCTCCG ACCAGCAGGT CTGGGCGTTC CTGGGCGACG GGGAAATGGA CGAGCCCGAG
TCCCGCGGCC TGCTCCAGCT CGCCGCGAAC GAGAACCTGG ACAACCTGAA CTTCGTGATC
AACTGCAACC TCCAGCGCCT GGACGGGCCG GTGCGCGGCA ACGGGAAGAT CATGCAGGAA
CTCGAAGCGT TCTTCCGCGG CGCGGGCTGG AACGTCATCA AGGTCGTCTG GGGCCGGGAA
TGGGATGACC TCCTGGCCAA GGACAACGAC GGGTCCCTGG TGAAGATCAT GAACGAGACC
CCGGACGGGG ACTACCAGAC CTACAAGGCA GAATCCGGCG GGTTCGTCCG CGAACACTTC
TTCGGGAAGA CCCCGCAGAC CAAGGACATG GTCGCGGACC TGAGCGATGA CCAGATCTGG
AACCTCAAGC GCGGCGGCCA CGACTACCGC AAGGTCTACG CCGCGTACAA GGCAGCCACC
GAATTCAAGG GCAAACCCAC CGTCATCCTG GCCAAAACGG TCAAGGGCTA CGGCCTCGGC
CCGCACTTCG AAGGCCGCAA CGCCACACAC CAGATGAAGA AACTCACCCT CGACGACCTC
AAGTCGTTCC GGGACCACCT GCGCATCCCG ATCACGGATG AGCAGCTCGA GGGCGATCCC
TACCAGCCGC CGTACTTCCA CCCCGGCAAC GATGCGCCGG AAATCGCGTA CATGATGGAG
CGCCGGGCCG CCCTGGGCGG TTCCGTTCCG GAGCGCCGCA GCAAGCATGC CGCCATCACG
CTTCCCGACG CGAAGTCCTA TGAGGTGGCC AAGCGCGGTT CGGGCAAGCA GCAGGCTGCC
ACGACCATGG CGTTCGTCCG CCTGCTCAAG GACCTCATGC GGGACAAGGA GTTCGGCAAG
CACATCGCGC CGATCATCCC CGATGAGGCG CGTACGTTCG GCATGGATGC GTTCTTCCCG
ACGGCGAAGA TCTACAACCC CAAGGGCCAG AACTACCTCT CCGTGGACCG GGACCTGGTC
CTGGCCTACA AGGAATCCGC CCAGGGCCAG CTGATCCACC CCGGCATCAA CGAAGCCGGC
GCCGTCGCAG CCTTCACCGC CGCCGGCACC GCCTACGCCA CCCACGGCGT CCCGCTGATC
CCGGTCTACG TGTTCTACTC CATGTTCGGC TTCCAGCGCA CCGGCGACGC CTTCTGGGCC
GCCGCGGACC AAATGACCCG CGGCTTCATC ATCGGCGCCA CTGCAGGCCG GACCACCCTC
ACCGGCGAAG GACTCCAGCA CGCCGACGGC CACTCCCCCA TCCTCGCCGC CACCAACCCG
GCCGTCGTCA CCTACGACCC CGCCTACGGC TACGAAATGG GCCACATCAT CCGCGACGGC
ATCGAGCGGA TGTACGGACC GGACTCCACT GACCGGAACC TGATGTACTA CATCACCGTC
TACAACGAAC CCATCACCCA GCCGGCAGAG CCGGACGAGC TGGACGTTGA AGGCGTGATC
AAGGGCATCT ATCTGCTCGC ACCGGCCAAG ATTGACGGCC CCCGCACGCA GATCCTGGCC
TCGGGCGTTT CGGTGCCCTG GGCGCTCGAA GCCCAGCGGA TCCTGGCCGA GGACTGGGGC
GTCTCCGCAG ACGTCTGGTC CGTCACGTCA TGGAACGAAC TCCGCCGCGA CGCCATGGCC
GCCGAGGAAG AGGCCTTCCT CAACCCGGGC CAGCCGGCGC GCGTGCCGTT CGTCACCGCG
CAGCTCGAAG GTGCCACCGG CCCTATCGTG GCTGTCACGG ACTACATGAA GGCCGTCCCG
GACCAGATCC GCCAGTTCCT CCCGAACGAG TTCGCCTCGC TCGGCGCGGA CGGCTTCGGC
TTCTCCGACA CCCGCGCCGC AGCACGCCGC TTCTTCAAGA ACGACATCCA CTCCATCGTG
GTCCGTTCAC TGGAGATGCT CGCGCGCCGC AGCGAGGTGG ACGCCCAGGC TCCTGCCCAG
GCCATTGAGA AGTATCGCCT GCATAACGTG AATGCGGGTT CCACCGGAAA CGCCGGAGGC
GAATCCTGA
 
Protein sequence
MHARKERLDV AAGEDTSHIL SGLTNQLPDR DPEETAEWIE SLDTLIREQG TERAQYIMRS 
LLQRAGAQSV GVPMVTTTDY VNTIPADQEA PFPGNEEYER RYRAYMRWNA AVMVHRSQRP
NIGVGGHIST YAGAATLYEV GFNHFFRGKD HPGGGDQVFF QGHASPGMYA RAFMEGRLTE
EDLDGFRQEK SKAGHALSSY PHPRLMPGFW EFPTVSMGIG PMNAIYQAQS NRYLHNRGLK
DTSDQQVWAF LGDGEMDEPE SRGLLQLAAN ENLDNLNFVI NCNLQRLDGP VRGNGKIMQE
LEAFFRGAGW NVIKVVWGRE WDDLLAKDND GSLVKIMNET PDGDYQTYKA ESGGFVREHF
FGKTPQTKDM VADLSDDQIW NLKRGGHDYR KVYAAYKAAT EFKGKPTVIL AKTVKGYGLG
PHFEGRNATH QMKKLTLDDL KSFRDHLRIP ITDEQLEGDP YQPPYFHPGN DAPEIAYMME
RRAALGGSVP ERRSKHAAIT LPDAKSYEVA KRGSGKQQAA TTMAFVRLLK DLMRDKEFGK
HIAPIIPDEA RTFGMDAFFP TAKIYNPKGQ NYLSVDRDLV LAYKESAQGQ LIHPGINEAG
AVAAFTAAGT AYATHGVPLI PVYVFYSMFG FQRTGDAFWA AADQMTRGFI IGATAGRTTL
TGEGLQHADG HSPILAATNP AVVTYDPAYG YEMGHIIRDG IERMYGPDST DRNLMYYITV
YNEPITQPAE PDELDVEGVI KGIYLLAPAK IDGPRTQILA SGVSVPWALE AQRILAEDWG
VSADVWSVTS WNELRRDAMA AEEEAFLNPG QPARVPFVTA QLEGATGPIV AVTDYMKAVP
DQIRQFLPNE FASLGADGFG FSDTRAAARR FFKNDIHSIV VRSLEMLARR SEVDAQAPAQ
AIEKYRLHNV NAGSTGNAGG ES