Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1798 |
Symbol | |
ID | 8012856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1791985 |
End bp | 1793349 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644824389 |
Product | pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase |
Protein accession | YP_002975622 |
Protein GI | 241204526 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.408526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00150502 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGATCA ATATCACGAT GCCCGCCCTC TCTCCGACCA TGGAGGAAGG CAACCTTTCC AAATGGCTGG TCAAGGAAGG CGACAAGGTC AAGTCTGGCG ATGTGATCGC CGAGATCGAG ACCGACAAGG CGACGATGGA AGTCGAAGCC GTCGATGAAG GCACGGTCGC CAAGCTCGTC GTTGCCGCCG GCACCGAAGG CGTCAAGGTC AATGCGCTGA TTGCGGTTCT CGCCGCCGAT GGCGAGGATG TCTCCGCTGC CGCAAGCAGT GCGGGTTCCG CTGCTCCGGC ACCGAAAGCT GACGGTGCAG CCGCGCCGAA GGCCGAAGCT GCACCGGCTC CGGCCCAGTC TACTCCGGCT GCGGCACCTG TAGCCGCCGC TGCACCCGCA TCGGTGTCAT CTGATGGCAG CCGCGCCTTC TCTTCGCCGC TTGCCCGCAG GCTGGCCAAG GAAGCCGGTA TCGACCTTTC GGCAGTCGCA GGCTCCGGCC CGCACGGCCG CGTCGTCAAG AGCGACATCG AAGCCGCCCT TGCCGGCGGC GGCGCCAAGG CCGCAGCCCC CGCCGCTGCT GCTTCCGCTC CGCAAGCCTC CGCAGCTCCG GCTCCGGCCG CCGCTGCCCC GAAGGGCGCT TCCGAAGAAG CCGTGCTCAA GCTCTTCGAA CCGGGCTCCT ACGAGCTCGT GCCGCATGAC GGCATGCGCA AGACGATCGC CAGGCGCCTG GTCGAATCCA AGCAGACGAT CCCGCATTTC TACGTCAGCG TCGATTGCGA ACTCGATGCG CTTCTGGCGC TGCGTGCCCA GCTGAACGAT GCGGCTCCGC GCAAGGATAA CGCTCCGGCC TACAAGCTCT CGGTCAACGA CATGGTCATC AAGGCCATGG CGCTGTCGCT GCGCGACGTT CCGGATGCGA ACGTCTCCTG GACCGACAAC AACATGATCA AGCACAAGCA TGCCGATGTC GGCGTTGCTG TCTCGATCCC CGGCGGCCTG ATCACGCCGA TCATCCGCAA GGCCGAGGAA AAGACCCTGT CGACGATCTC CAACGAGATG CGCGATCTCG GCAAGCGGGC CAAGGACCGC AAGCTGAAGC CTGAGGAATA TCAGGGCGGC ACCAGTTCGG TCTCGAACAT GGGCATGATG GGCGTGAAGA ACTTCGCAGC CGTGGTCAAC CCGCCGCATG CGACGATCCT CGCGGTCGGC GCCGGCGAAC AGCGGGTCGT CGTCAAGAAG GGCGAGATGG CGATTGCGAC CGTGATGTCC GTCACGCTCT CGACGGACCA TCGCTGCGTC GATGGCGCGC TCGGCGCCGA GCTGCTCCAG GCCTTCAAGG GCTACATCGA AAACCCGATG GGCATGCTTG TCTGA
|
Protein sequence | MPINITMPAL SPTMEEGNLS KWLVKEGDKV KSGDVIAEIE TDKATMEVEA VDEGTVAKLV VAAGTEGVKV NALIAVLAAD GEDVSAAASS AGSAAPAPKA DGAAAPKAEA APAPAQSTPA AAPVAAAAPA SVSSDGSRAF SSPLARRLAK EAGIDLSAVA GSGPHGRVVK SDIEAALAGG GAKAAAPAAA ASAPQASAAP APAAAAPKGA SEEAVLKLFE PGSYELVPHD GMRKTIARRL VESKQTIPHF YVSVDCELDA LLALRAQLND AAPRKDNAPA YKLSVNDMVI KAMALSLRDV PDANVSWTDN NMIKHKHADV GVAVSIPGGL ITPIIRKAEE KTLSTISNEM RDLGKRAKDR KLKPEEYQGG TSSVSNMGMM GVKNFAAVVN PPHATILAVG AGEQRVVVKK GEMAIATVMS VTLSTDHRCV DGALGAELLQ AFKGYIENPM GMLV
|
| |