Gene Rleg_3968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3968 
Symbol 
ID8014782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4043488 
End bp4044750 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content64% 
IMG OID644826537 
Productdihydrolipoamide succinyltransferase 
Protein accessionYP_002977748 
Protein GI241206652 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01347] 2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0489345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCAG AAATCCGCGT TCCAACTCTC GGTGAATCCG TCAGCGAGGC AACCGTCGGC 
ACCTGGTTCA AGAAGGTCGG CGACGCCATC AAGGCCGACG AGCCGATTCT CGAGCTTGAA
ACCGACAAGG TGACCATCGA AGTTCCAGCA CCCGCCTCCG GCACGCTTTC GGAAATCGTC
GTTGCCGCCG GCGAGACCGT CGGCCTCGGC GCGCTGCTCG GCCAGATCGC TGAAGGTGCT
GCCGCTGCTG CCGCGCCGGC TGCCGCTGCA CCGGCTGCCG CGCCTGCCCA GCCAGCCCCG
GCGGCGGCTG CCCAGCCAGC CCCGGTTGCC GCTGCTGCGT CGTCATCGAG CGCCTCCGTC
TCCACCATGC CGCCTGCACC GGCAGCTTCG AAGATGCTTG CCGAAAACAA CCTTTCCGCC
GATCAGGTCG ACGGTAGCGG CAAGCGCGGC CAGGTGCTGA AGGGCGACGT CATCGCTGCC
GTCGCCAAGG GCATTTCCGC CCCGGCCGCC GCACCCGCAG CAACGCCTGC CGCCGCGCGT
GGTCCGTCGA CGGTCGAGGA TGCCTCGCGC GAAGAGCGCG TGAAGATGAC GCGCCTGCGC
CAGACGATCG CCAAGCGCCT CAAGGATGCG CAGAACACCG CCGCCATGCT GACCACCTAC
AACGAGGTGG ACATGAAGGC GGTCATGGAT CTGCGCAACA AGTACAAGGA CATTTTCGAG
AAGAAGCACG GCGTCAAGCT CGGCTTCATG GGCTTCTTTA CCAAGGCGGT GACGCATGCG
CTGAAGGAAC TGCCGGCCGT CAATGCCGAA ATCGACGGCA CCGACGTCAT CTACAAGAAC
TACTGCCATG TCGGCATGGC CGTAGGTACG GACAAAGGCC TCGTCGTTCC CGTCATCCGC
GACGCCGACC AGATGTCGAT CGCCGAAATC GAGAAGGAAC TCGGCCGTCT TGCCAAGGCA
GCCCGTGATG GCTCGCTCTC CATGGCCGAC ATGCAGGGCG GCACCTTCAC CATCACCAAT
GGCGGCGTCT ACGGGTCGCT GATGTCTTCG CCGATCCTCA ACGCGCCGCA GTCCGGCATT
CTCGGCATGC ACAAGATCCA GGAGCGGCCG GTTGCGATCG GCGGCCAGGT CGTCATCCGT
CCGATGATGT ATCTGGCGCT GTCCTACGAT CACCGCATCG TCGACGGCAA GGAAGCGGTC
ACCTTCCTCG TGCGCGTCAA GGAAAGCCTG GAAGATCCGG AACGTCTGGT TCTCGATCTC
TAA
 
Protein sequence
MASEIRVPTL GESVSEATVG TWFKKVGDAI KADEPILELE TDKVTIEVPA PASGTLSEIV 
VAAGETVGLG ALLGQIAEGA AAAAAPAAAA PAAAPAQPAP AAAAQPAPVA AAASSSSASV
STMPPAPAAS KMLAENNLSA DQVDGSGKRG QVLKGDVIAA VAKGISAPAA APAATPAAAR
GPSTVEDASR EERVKMTRLR QTIAKRLKDA QNTAAMLTTY NEVDMKAVMD LRNKYKDIFE
KKHGVKLGFM GFFTKAVTHA LKELPAVNAE IDGTDVIYKN YCHVGMAVGT DKGLVVPVIR
DADQMSIAEI EKELGRLAKA ARDGSLSMAD MQGGTFTITN GGVYGSLMSS PILNAPQSGI
LGMHKIQERP VAIGGQVVIR PMMYLALSYD HRIVDGKEAV TFLVRVKESL EDPERLVLDL