Gene Rleg_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1798 
Symbol 
ID8012856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1791985 
End bp1793349 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content65% 
IMG OID644824389 
Productpyruvate dehydrogenase complex dihydrolipoamide acetyltransferase 
Protein accessionYP_002975622 
Protein GI241204526 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.408526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00150502 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGATCA ATATCACGAT GCCCGCCCTC TCTCCGACCA TGGAGGAAGG CAACCTTTCC 
AAATGGCTGG TCAAGGAAGG CGACAAGGTC AAGTCTGGCG ATGTGATCGC CGAGATCGAG
ACCGACAAGG CGACGATGGA AGTCGAAGCC GTCGATGAAG GCACGGTCGC CAAGCTCGTC
GTTGCCGCCG GCACCGAAGG CGTCAAGGTC AATGCGCTGA TTGCGGTTCT CGCCGCCGAT
GGCGAGGATG TCTCCGCTGC CGCAAGCAGT GCGGGTTCCG CTGCTCCGGC ACCGAAAGCT
GACGGTGCAG CCGCGCCGAA GGCCGAAGCT GCACCGGCTC CGGCCCAGTC TACTCCGGCT
GCGGCACCTG TAGCCGCCGC TGCACCCGCA TCGGTGTCAT CTGATGGCAG CCGCGCCTTC
TCTTCGCCGC TTGCCCGCAG GCTGGCCAAG GAAGCCGGTA TCGACCTTTC GGCAGTCGCA
GGCTCCGGCC CGCACGGCCG CGTCGTCAAG AGCGACATCG AAGCCGCCCT TGCCGGCGGC
GGCGCCAAGG CCGCAGCCCC CGCCGCTGCT GCTTCCGCTC CGCAAGCCTC CGCAGCTCCG
GCTCCGGCCG CCGCTGCCCC GAAGGGCGCT TCCGAAGAAG CCGTGCTCAA GCTCTTCGAA
CCGGGCTCCT ACGAGCTCGT GCCGCATGAC GGCATGCGCA AGACGATCGC CAGGCGCCTG
GTCGAATCCA AGCAGACGAT CCCGCATTTC TACGTCAGCG TCGATTGCGA ACTCGATGCG
CTTCTGGCGC TGCGTGCCCA GCTGAACGAT GCGGCTCCGC GCAAGGATAA CGCTCCGGCC
TACAAGCTCT CGGTCAACGA CATGGTCATC AAGGCCATGG CGCTGTCGCT GCGCGACGTT
CCGGATGCGA ACGTCTCCTG GACCGACAAC AACATGATCA AGCACAAGCA TGCCGATGTC
GGCGTTGCTG TCTCGATCCC CGGCGGCCTG ATCACGCCGA TCATCCGCAA GGCCGAGGAA
AAGACCCTGT CGACGATCTC CAACGAGATG CGCGATCTCG GCAAGCGGGC CAAGGACCGC
AAGCTGAAGC CTGAGGAATA TCAGGGCGGC ACCAGTTCGG TCTCGAACAT GGGCATGATG
GGCGTGAAGA ACTTCGCAGC CGTGGTCAAC CCGCCGCATG CGACGATCCT CGCGGTCGGC
GCCGGCGAAC AGCGGGTCGT CGTCAAGAAG GGCGAGATGG CGATTGCGAC CGTGATGTCC
GTCACGCTCT CGACGGACCA TCGCTGCGTC GATGGCGCGC TCGGCGCCGA GCTGCTCCAG
GCCTTCAAGG GCTACATCGA AAACCCGATG GGCATGCTTG TCTGA
 
Protein sequence
MPINITMPAL SPTMEEGNLS KWLVKEGDKV KSGDVIAEIE TDKATMEVEA VDEGTVAKLV 
VAAGTEGVKV NALIAVLAAD GEDVSAAASS AGSAAPAPKA DGAAAPKAEA APAPAQSTPA
AAPVAAAAPA SVSSDGSRAF SSPLARRLAK EAGIDLSAVA GSGPHGRVVK SDIEAALAGG
GAKAAAPAAA ASAPQASAAP APAAAAPKGA SEEAVLKLFE PGSYELVPHD GMRKTIARRL
VESKQTIPHF YVSVDCELDA LLALRAQLND AAPRKDNAPA YKLSVNDMVI KAMALSLRDV
PDANVSWTDN NMIKHKHADV GVAVSIPGGL ITPIIRKAEE KTLSTISNEM RDLGKRAKDR
KLKPEEYQGG TSSVSNMGMM GVKNFAAVVN PPHATILAVG AGEQRVVVKK GEMAIATVMS
VTLSTDHRCV DGALGAELLQ AFKGYIENPM GMLV