Gene Acid345_2792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2792 
SymbolaceE 
ID4072415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3305365 
End bp3308046 
Gene Length2682 bp 
Protein Length893 aa 
Translation table11 
GC content60% 
IMG OID637984810 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_591867 
Protein GI94969819 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCTG TTAATCTAGA CACAATCGAC ATTAATTCCC TGGAAGTCTT AGAGAACCGT 
GAATGGCTTG AGTCGCTCGA ATACGTCCTC CAAACCGGAG GCCCCGAGCG CGTAGGACGC
CTGATTCAGC AGCTCCAGTT GCAATGCGAG CGCGCCGGCG TAAAACTCCC ATTCACTGCC
ACCACTCCCT ACGAAAACAC CATCCCCGCC GACCGCCAGC CCCCGTTTCC CGGCAGCCAG
GAAATGGAAC GTCGCATTAA GAGCCTTATT CGCTGGAACG CTCTGGCCAT GGTCATGCGC
GCCAACAAGG TCGAAGAGGG AATCGGCGGC CACATCTCCA CCTTCGCCTC CGCGGCCACG
CTCTACGAAG TCGGCTTCAA CCACTTCTTT CGCGCCGCTA CTGAAGATGG CGATCGTGAC
ATCGTCTATT TCCAGGGACA CAGCGCCCCC GGCATCTACT CCCGCGCGTT CCTCGAAGGT
CGTCTCCCGA TCGAGAAGCT GGAGAACTTC CGCCGCGAAC TTCATCCCGG CGGCGGCCTC
TCGTCCTATC CGCACCCGTG GCTCATGCCC GACTTCTGGG AATTCCCCAC GGTCTCCATG
GGACTCGGTC CGATCACCGC GATCTACCAG GCTCGCTTCA ACAAGTACCT CGAAAACCGC
GGCCTCAAGA CTGCGACCAG CGGCAAGATC TGGGCCTTCC TCGGTGACGG CGAAACCGAT
GAGCCCGAGT CGCTCGGTGC AATTTCTCTC GCTTCGCGGG AACGCCTCGA CAACCTCATC
TTCGTTATCA ACTGCAACCT CCAGCGCCTC GACGGCCCCG TCCGCGGCAA TTTCAAAATC
ATCCAGGAAC TTGAAGCCAA CTTCCGCGGC GCCGGATGGA ACGTAATCAA AGTTATCTGG
GGCAGCGATT GGGACAGCCT CATTGAGAAG GACACCGACG GTCTGCTCGT AAAGCGCATG
GGCGAAATCA CCGACGGCCA GTTCCAGAAG TACGCCGTCG AAACCGGACG CTACTTCCGC
CAGAACTTCT TCGGCACCGA CCCGCGCCTG CTCAAGATGG TCGAGCACCT CAGCGACGAG
CAGCTCGAAC ACCTGCGCCT CGGCGGACAC GACCCGATCA AAGTTCACGC CGCCTACAAA
GAGGCCGTCG ATCACAAGGG CTCGCCCACG GTCATTCTCG CCAAGACGAT CAAGGGTTAC
GGCCTCGGCG AAAGTGGCGA GGGCAAGAAC ATCACCCACC AGCAGAAGAA GCTCAACGAA
GAAGAGCTCA AAATCTTCCG CTCGCGCTTC GGCATCCCTG TCGCCGATGA AGATCTCGCG
AAAGCCCCGT TCTATCGCCC CAGCGACGAC TCGGCCGAAA TCAAATACCT GCAGGAACGC
CGCAAGCAAC TCGGCGGATA CATGCCGGCC CGCAAGGTCC GCGCCGCCGC GCTGCCCATC
CCGAAGGAAG AGCTCTTTGA AGAGTTCTAC AAGGGCACTG AAGGCCGCAA GGCATCGAGC
ACCATGGTCT TCGTTCGCAT GCTCGGCAAA CTGCTGCGCG ATCCCGAATT CGGCAAGTAC
GTCGTGCCCA TCGTTCCCGA CGAAGCTCGA ACCTTCGGTA TGGAAGCGCT CTTCCGCCAG
GTGGGTATCT ACTCCAGCGT AGGCCAGCTC TACGAGCCTG TCGATATGGA CACGCTCCTC
TATTACAAGG AGTCTAAGGA CGGCCAGATT CTCGAAGAGG GCATCACCGA GGCCGGTTCG
ATGTCTTCGT TCATCGCTGC GGGCAGTGCA TATTCCACGC ACGGCATCCC GACGATTCCG
TTCTTCATTT ACTACTCGAT GTTCGGATTC CAGCGCATTG GCGATCTCGT ATGGGCCGCC
GCTGACACCC GCTGCCGCGG CTTCATGCTC GGCGGCACCG CCGGACGCAC CACCCTTGCC
GGCGAAGGTC TCCAGCACCA GGACGGCCAC AGCCACCTGC TCGCTTACCC GGTTCCTACC
TGCATGGCCT ACGATCCCGC GTTCGCCTTT GAACTCGCGA TCATCATTCA GGACGGCATC
AAGCGCATGT ATCACGACGG CGAAAGCATC TTCTACTACA TCACCGTTAT GAACGAGCCG
GTCGAGAACC CCGCCATGCC CGAAGGTGTG CGCGAAGGTA TTCTCCGCGG CATGTATCGC
TTCAAGAAGT CGGAGCACAA GTCGAAGCTC AAGGCGAACC TCTTCGGCTC CGGCACCATC
ATGCAGGAAG TGATCAAGGC CGCCGAAATC CTCGAGTCCA AGTACGACAT CGCGAGCGAT
ATCTGGAGCA TCACCAGCTA CAAGGAACTC TACAAAGACG GCAATGACGT GGACCGCTGG
AACATGCTGC ACCCGGCGGA GAAGCCGCGC CAGACCTTCA TCGGCGAGCA GTTGAAAGAC
GCCGAAGGCG TCTTTGTCGC TGCTTCCGAC TATGTGAAGG CAATGCCGGA ATCCATTTCG
CAGTGGTTCC CGCGTCCGCT CCTCGCGCTC GGTACCGACG GCTTCGGCCG CAGCGAAGGC
CGCGCATCGT TGCGCGACTT CTTCGAGGTC GATGCCAAGC ACATCGTCGT CGGTACGCTC
ACCGCGCTCA TGCGCGACGG CAAAGTGAAA CCCGACGCGG TCAGCCGCGC TATCAAGGAT
CTCGGCGTCG ATCCCAACAA GCCGAATCCG TTCACCGTTT AG
 
Protein sequence
MNPVNLDTID INSLEVLENR EWLESLEYVL QTGGPERVGR LIQQLQLQCE RAGVKLPFTA 
TTPYENTIPA DRQPPFPGSQ EMERRIKSLI RWNALAMVMR ANKVEEGIGG HISTFASAAT
LYEVGFNHFF RAATEDGDRD IVYFQGHSAP GIYSRAFLEG RLPIEKLENF RRELHPGGGL
SSYPHPWLMP DFWEFPTVSM GLGPITAIYQ ARFNKYLENR GLKTATSGKI WAFLGDGETD
EPESLGAISL ASRERLDNLI FVINCNLQRL DGPVRGNFKI IQELEANFRG AGWNVIKVIW
GSDWDSLIEK DTDGLLVKRM GEITDGQFQK YAVETGRYFR QNFFGTDPRL LKMVEHLSDE
QLEHLRLGGH DPIKVHAAYK EAVDHKGSPT VILAKTIKGY GLGESGEGKN ITHQQKKLNE
EELKIFRSRF GIPVADEDLA KAPFYRPSDD SAEIKYLQER RKQLGGYMPA RKVRAAALPI
PKEELFEEFY KGTEGRKASS TMVFVRMLGK LLRDPEFGKY VVPIVPDEAR TFGMEALFRQ
VGIYSSVGQL YEPVDMDTLL YYKESKDGQI LEEGITEAGS MSSFIAAGSA YSTHGIPTIP
FFIYYSMFGF QRIGDLVWAA ADTRCRGFML GGTAGRTTLA GEGLQHQDGH SHLLAYPVPT
CMAYDPAFAF ELAIIIQDGI KRMYHDGESI FYYITVMNEP VENPAMPEGV REGILRGMYR
FKKSEHKSKL KANLFGSGTI MQEVIKAAEI LESKYDIASD IWSITSYKEL YKDGNDVDRW
NMLHPAEKPR QTFIGEQLKD AEGVFVAASD YVKAMPESIS QWFPRPLLAL GTDGFGRSEG
RASLRDFFEV DAKHIVVGTL TALMRDGKVK PDAVSRAIKD LGVDPNKPNP FTV