Gene Acid345_4350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4350 
Symbol 
ID4071768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5159799 
End bp5161466 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content62% 
IMG OID637986383 
Product2-oxoglutarate dehydrogenase E2 component 
Protein accessionYP_593424 
Protein GI94971376 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01347] 2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component)
[TIGR02927] 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACCG ACGTCATCAT GCCCCAGATG GGGGAATCGA TCTTTGAAGG CACCATCACC 
AAATGGCTCA AACAACCGGG CGACCAGGTC CAGCGCGACG AGCCCCTCTT CGAAATCTCC
ACCGATAAAG TCGACGCCGA AATCCCCGCC CCTGCCGCGG GCATCCTGAA AGAAATCAAG
GCCCAGGCCG GCCAGACCGT ACAGGTCAAC ACCGTAGTCG CCATCATCGA CGCCGCCGGC
TCCGCGACCA CTTCCGCTCC GAAACCCGCC GCCGCTGCGC CACCGAAATC CGCGCCGCAA
CCCGATGGCG TAAGCTCCTC TGCCCCGTCA ACGAGCGCGC CGAGCGTCCC CGCCGCTGGT
CCCAAGACCG ACGTCGTAAT GCCGCAGATG GGCGAATCCA TCTTCGAAGG CACCATCACC
AAGTGGCTCA AGAATGTTGG TGACACCGTC CAGCGCGACG AGCCGCTCTT CGAAATCTCC
ACCGACAAAG TGGACGCCGA GATCCCCGCC CCAGTAGCCG GAGTCCTCAG CGAAATCAAA
GTTCAAGCCG GCGCCACCGT CCAGGTCAAC ACCGTCGTCG CCACCATCGG CGGCGCCGCT
GGAGCCAGCG CGCGCGCGCC GCAAGCCGCC GCACCCGCTC CTTCCGCACC TGCTCCCGCA
GCCCCCGCGC CACAAGCGCC CGCTGCAGCC GAGCCCGAAG AAGAAGAAAT CTCCGCTAGT
GGCGATCGCG TCCGCACCAG TCCGTTAGTC CGCAAAATGG CAAAAGAAGC CAACGTAGAC
CTCGGCAAAG TCCGCGGCAC CGGCATGGGT GGACGCATCA CCAAGGAAGA CATCCAGGCC
TTCGTCGAAA AACAAAAGAC CGCGCCGACT CCAACGCCGC AGCCGCAAGC CGCGCAACCT
TCCGCGCCCG CACCTGCTCC CAGCGCCCCC GTCGCCGCCA CACCAAACAA ATTCGCGGGC
ACTCCCGGTG CCATCGAGCC CATGTCGGTC ATGCGCAAGA AAATCGCCGA CCATATGGTC
ATGTCGAAGC GCACCAGCGC TCACGTGCAT GGCGTCTTCG AGGTCGACTT CACCAAAATC
GTGAAGCTCC GCGAGAAAAA CAAAAACAGC TTCCAGGAAA AGACAGGCCT CAAGCTTACC
TACACGCCGT TCTACGCGCG TGCAGTGGCC CACGCGTTGC GCGCATGGCC CATCATCAAC
GCCTCCGTCG AAGGCGAGAA CATTCACTAT AAAAAGGACA TCAACCTCGG CATCGCCGTA
GCTCTCGACT GGGGCCTCAT CGTCCCGGTG GTCAAGCAGG CCGACGGCCT AAGCTTCGTC
GGGCTCCAGC GCGCCATCAC CGACCTCGGC GAACGCGCCC GCGCCAAAAA GCTCAAACCC
GAAGACGTCC AGGGCGGCAC CTTCACCATC ACCAACCCCG GCATCTTCGG AGCAAAGTTC
GGCATGCCGA TCATCAGCCA GCCCCAGCTC GCGATCTTAG GCATCGGCGC GATCACGAAA
GTCCCCATGG TCGTCACCGA CAAAGACGGC AACGACAGCA TCGCCATCCG CTCGCGCTGC
CACATCTCTA TCGGCTACGA CCACCGAGTA ATCGACGGCG CAGTCGCCGA CCAGTTCATG
GTCGTAGTAC GAGACTACCT CCAGAACTGG AACGAACCGT TAATCTAA
 
Protein sequence
MPTDVIMPQM GESIFEGTIT KWLKQPGDQV QRDEPLFEIS TDKVDAEIPA PAAGILKEIK 
AQAGQTVQVN TVVAIIDAAG SATTSAPKPA AAAPPKSAPQ PDGVSSSAPS TSAPSVPAAG
PKTDVVMPQM GESIFEGTIT KWLKNVGDTV QRDEPLFEIS TDKVDAEIPA PVAGVLSEIK
VQAGATVQVN TVVATIGGAA GASARAPQAA APAPSAPAPA APAPQAPAAA EPEEEEISAS
GDRVRTSPLV RKMAKEANVD LGKVRGTGMG GRITKEDIQA FVEKQKTAPT PTPQPQAAQP
SAPAPAPSAP VAATPNKFAG TPGAIEPMSV MRKKIADHMV MSKRTSAHVH GVFEVDFTKI
VKLREKNKNS FQEKTGLKLT YTPFYARAVA HALRAWPIIN ASVEGENIHY KKDINLGIAV
ALDWGLIVPV VKQADGLSFV GLQRAITDLG ERARAKKLKP EDVQGGTFTI TNPGIFGAKF
GMPIISQPQL AILGIGAITK VPMVVTDKDG NDSIAIRSRC HISIGYDHRV IDGAVADQFM
VVVRDYLQNW NEPLI