Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4350 |
Symbol | |
ID | 4071768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5159799 |
End bp | 5161466 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637986383 |
Product | 2-oxoglutarate dehydrogenase E2 component |
Protein accession | YP_593424 |
Protein GI | 94971376 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01347] 2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component) [TIGR02927] 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0190521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.146742 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTACCG ACGTCATCAT GCCCCAGATG GGGGAATCGA TCTTTGAAGG CACCATCACC AAATGGCTCA AACAACCGGG CGACCAGGTC CAGCGCGACG AGCCCCTCTT CGAAATCTCC ACCGATAAAG TCGACGCCGA AATCCCCGCC CCTGCCGCGG GCATCCTGAA AGAAATCAAG GCCCAGGCCG GCCAGACCGT ACAGGTCAAC ACCGTAGTCG CCATCATCGA CGCCGCCGGC TCCGCGACCA CTTCCGCTCC GAAACCCGCC GCCGCTGCGC CACCGAAATC CGCGCCGCAA CCCGATGGCG TAAGCTCCTC TGCCCCGTCA ACGAGCGCGC CGAGCGTCCC CGCCGCTGGT CCCAAGACCG ACGTCGTAAT GCCGCAGATG GGCGAATCCA TCTTCGAAGG CACCATCACC AAGTGGCTCA AGAATGTTGG TGACACCGTC CAGCGCGACG AGCCGCTCTT CGAAATCTCC ACCGACAAAG TGGACGCCGA GATCCCCGCC CCAGTAGCCG GAGTCCTCAG CGAAATCAAA GTTCAAGCCG GCGCCACCGT CCAGGTCAAC ACCGTCGTCG CCACCATCGG CGGCGCCGCT GGAGCCAGCG CGCGCGCGCC GCAAGCCGCC GCACCCGCTC CTTCCGCACC TGCTCCCGCA GCCCCCGCGC CACAAGCGCC CGCTGCAGCC GAGCCCGAAG AAGAAGAAAT CTCCGCTAGT GGCGATCGCG TCCGCACCAG TCCGTTAGTC CGCAAAATGG CAAAAGAAGC CAACGTAGAC CTCGGCAAAG TCCGCGGCAC CGGCATGGGT GGACGCATCA CCAAGGAAGA CATCCAGGCC TTCGTCGAAA AACAAAAGAC CGCGCCGACT CCAACGCCGC AGCCGCAAGC CGCGCAACCT TCCGCGCCCG CACCTGCTCC CAGCGCCCCC GTCGCCGCCA CACCAAACAA ATTCGCGGGC ACTCCCGGTG CCATCGAGCC CATGTCGGTC ATGCGCAAGA AAATCGCCGA CCATATGGTC ATGTCGAAGC GCACCAGCGC TCACGTGCAT GGCGTCTTCG AGGTCGACTT CACCAAAATC GTGAAGCTCC GCGAGAAAAA CAAAAACAGC TTCCAGGAAA AGACAGGCCT CAAGCTTACC TACACGCCGT TCTACGCGCG TGCAGTGGCC CACGCGTTGC GCGCATGGCC CATCATCAAC GCCTCCGTCG AAGGCGAGAA CATTCACTAT AAAAAGGACA TCAACCTCGG CATCGCCGTA GCTCTCGACT GGGGCCTCAT CGTCCCGGTG GTCAAGCAGG CCGACGGCCT AAGCTTCGTC GGGCTCCAGC GCGCCATCAC CGACCTCGGC GAACGCGCCC GCGCCAAAAA GCTCAAACCC GAAGACGTCC AGGGCGGCAC CTTCACCATC ACCAACCCCG GCATCTTCGG AGCAAAGTTC GGCATGCCGA TCATCAGCCA GCCCCAGCTC GCGATCTTAG GCATCGGCGC GATCACGAAA GTCCCCATGG TCGTCACCGA CAAAGACGGC AACGACAGCA TCGCCATCCG CTCGCGCTGC CACATCTCTA TCGGCTACGA CCACCGAGTA ATCGACGGCG CAGTCGCCGA CCAGTTCATG GTCGTAGTAC GAGACTACCT CCAGAACTGG AACGAACCGT TAATCTAA
|
Protein sequence | MPTDVIMPQM GESIFEGTIT KWLKQPGDQV QRDEPLFEIS TDKVDAEIPA PAAGILKEIK AQAGQTVQVN TVVAIIDAAG SATTSAPKPA AAAPPKSAPQ PDGVSSSAPS TSAPSVPAAG PKTDVVMPQM GESIFEGTIT KWLKNVGDTV QRDEPLFEIS TDKVDAEIPA PVAGVLSEIK VQAGATVQVN TVVATIGGAA GASARAPQAA APAPSAPAPA APAPQAPAAA EPEEEEISAS GDRVRTSPLV RKMAKEANVD LGKVRGTGMG GRITKEDIQA FVEKQKTAPT PTPQPQAAQP SAPAPAPSAP VAATPNKFAG TPGAIEPMSV MRKKIADHMV MSKRTSAHVH GVFEVDFTKI VKLREKNKNS FQEKTGLKLT YTPFYARAVA HALRAWPIIN ASVEGENIHY KKDINLGIAV ALDWGLIVPV VKQADGLSFV GLQRAITDLG ERARAKKLKP EDVQGGTFTI TNPGIFGAKF GMPIISQPQL AILGIGAITK VPMVVTDKDG NDSIAIRSRC HISIGYDHRV IDGAVADQFM VVVRDYLQNW NEPLI
|
| |