Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0098 |
Symbol | pdhC |
ID | 3927745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 87593 |
End bp | 88843 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901222 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_506926 |
Protein GI | 88657701 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.66929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATAG AAGTATTGAT GCCTGCTCTT TCTCCAACTA TGAAAAGTGG CACTATAAGA AAGTGGTATA AAGCAGAAGG GGATGTAGTA AAATCAGGTG ATGTAATAGC TGATATAGAA ACTGATAAAG CTGTTATGGA ATGTGAGTAT ACAGATGAAG ATGGTATCAT GGGTAAAATT TTTTTTGCTG AAGGAAGCAA AAATATAGAG GTAAATCAAT TGATAGCATT GATTGCTGTA GATGAACAGG ATTTAGCTAA AGTTCATTCA TATGAAAAAG GTGATAATGT TGTAAAAAAT GAGTTAGTTG CGTTACAAGA TAGTCAACCT GCTCAAGATG AGTCAGTTGT ATTGCAAATG AATCAACAAA TTGTGAATGC TAGCGAAGTA TTGGTTAATT CATCTAATTC TTCTGAAAGA GTTAAGGTGA GTCCTTTAGC AAAAAAAATT GCTTCGAACC TTGGTGTTGA TGTAAATTTG GTAAAAGGAA CAGGCCCGTA CGGTAGGATT ATAAAAGCTG ATATATTGGA TGTTATAAAT CAGCATGGTC ACATTGCTAA CTCTCCTGAG GATGCTTCTT TTACTGAAAT TAGTAGTATG CGTAGAGTAA TAGCAGAACG TTTAGTGTAT TCAAAACAAA CAATTCCTCA CTTTTACGTT TCAATAGATT GTCTTGTGGA TAGTTTGTTA AAATTAAGGT TAGAAATAAA TGCTGAAAAT CCTGATACTA AGGTTACAGT GAATGATTTC ATCATTAAAG CTGTTGCCAT GAGTATTAAG AAATTTCCTG AAATTAATGT ATCTTGGTCT GATGATAAGA TAGTTGTATT TCCTAGTATT GATATTTCTG TTGCTGTATC TATTGATAAT GGACTTATCA CACCAATTAT TTTTGGTGCA GATAAGAAAT CTTTGTTAGA AATATCAAGA GAAGTGAAAG CATTAGCTAG TAAGGCTAAA TCTGGAAAAT TGAAACCTGA AGAATTTCAA GGCGGAGGTT TTACTGTTTC TAATCTTGGT ATGTTTGGTA TTAAAGAATT CTACGCAATT GTTAATCCTC CACAGTCTTG TATAATGTCT GTTGGATGCT CTGAAAAACG AGCAATGGTT GTTAATGAGC AAATATGTAT TTCAAATGTT GTGACAGTAA CATTATCTGT AGACCATAGA GTTATTGATG GTGTACTAGC GGCAAAATTT TTAAATTGCT TTAAATCTTA TCTAGAGAAA CCATTTTTAA TGTTGATATA A
|
Protein sequence | MPIEVLMPAL SPTMKSGTIR KWYKAEGDVV KSGDVIADIE TDKAVMECEY TDEDGIMGKI FFAEGSKNIE VNQLIALIAV DEQDLAKVHS YEKGDNVVKN ELVALQDSQP AQDESVVLQM NQQIVNASEV LVNSSNSSER VKVSPLAKKI ASNLGVDVNL VKGTGPYGRI IKADILDVIN QHGHIANSPE DASFTEISSM RRVIAERLVY SKQTIPHFYV SIDCLVDSLL KLRLEINAEN PDTKVTVNDF IIKAVAMSIK KFPEINVSWS DDKIVVFPSI DISVAVSIDN GLITPIIFGA DKKSLLEISR EVKALASKAK SGKLKPEEFQ GGGFTVSNLG MFGIKEFYAI VNPPQSCIMS VGCSEKRAMV VNEQICISNV VTVTLSVDHR VIDGVLAAKF LNCFKSYLEK PFLMLI
|
| |