Gene Rleg2_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1603 
Symbol 
ID6980339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1630116 
End bp1631162 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content62% 
IMG OID643396328 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_002281119 
Protein GI209549202 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0112007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGC GAAAGACCGC GACCGTTTCC AGCCGCAAAA CTGCAGCGAA ACCGGCAGCC 
AAGGCATCGA ATGGAGGCCC GGTAGCCGAC TTCGATCGCG ATGAGGAGCT CAAGGCCTAT
CGCGAGATGC TGCTGATCCG CCGCTTCGAG GAGAAGGCCG GCCAGCTTTA CGGCATGGGC
TTCATCGGCG GCTTTTGCCA CCTTTACATC GGTCAGGAAG CTGTCGTCGT CGGCATGCAG
ATGGCGCAGA AGGAAGGCGA CCAGGTCATC ACCGCCTATC GCGATCACGG CCACATGCTG
GCAACCGGCA TGGAAGCGCG CGGCGTCATG GCGGAGTTGA CCGGCCGCCG CAGCGGCTAT
TCCCACGGGA AGGGCGGCTC GATGCACATG TTCTCGAAAG AGAAGCATTT CTACGGCGGC
CACGGCATCG TCGGCGCCCA GGTTTCGCTC GGAACCGGTC TTGCCTTTGC AAACCATTAC
CGCGGCAACG GCAATGTCTC GATTGCCTAT TTCGGCGATG GCGCCGCCAA CCAGGGCCAG
GTCTACGAGA GCTTCAACAT GGCGGCTCTC TGGAAGCTGC CGATCGTCTA TATCGTCGAA
AACAACCGTT ACGCCATGGG CACCTCGACG GCACGCGCCA CCGCGCAGTC GAACTACTCG
CTGCGCGGCT CCGGCTTCGG CATTCCCGGC ATCCAGGTCG ACGGCATGGA CGTTCGCGCC
GTCAAGGCGG CCGCTGACGA GGCGCTCGAA CATTGCCGCT CCGGCAAGGG TCCGATCATC
CTCGAAATGC TGACCTATCG TTATCGCGGT CACTCCATGT CGGATCCGGC GAAATATCGC
TCGAAGGAAG AAGTGCAGAA GATGCGCTCC GAGCAGGACC CGATCGAGCA GGTCAAGGCG
CGCCTCATCG AAAAGGGTTG GGCCTCGGAA GACGATCTGA AGGCGATCGA CAAGGATATC
CGCGACATCG TCGCCGACAG CGCCGACTTC GCCCAGGCCG ATCCGGAGCC GGATGCATCC
GCGCTCTACA CCGACATTCT GCTCTAA
 
Protein sequence
MAPRKTATVS SRKTAAKPAA KASNGGPVAD FDRDEELKAY REMLLIRRFE EKAGQLYGMG 
FIGGFCHLYI GQEAVVVGMQ MAQKEGDQVI TAYRDHGHML ATGMEARGVM AELTGRRSGY
SHGKGGSMHM FSKEKHFYGG HGIVGAQVSL GTGLAFANHY RGNGNVSIAY FGDGAANQGQ
VYESFNMAAL WKLPIVYIVE NNRYAMGTST ARATAQSNYS LRGSGFGIPG IQVDGMDVRA
VKAAADEALE HCRSGKGPII LEMLTYRYRG HSMSDPAKYR SKEEVQKMRS EQDPIEQVKA
RLIEKGWASE DDLKAIDKDI RDIVADSADF AQADPEPDAS ALYTDILL