Gene Rleg_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1796 
Symbol 
ID8012855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1789516 
End bp1790562 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content61% 
IMG OID644824387 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_002975620 
Protein GI241204524 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.281194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0011937 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCCGC GAAAGACCGC GACCGTTTCC AGCCGCAAAA CTGCAGCAAA ACCGGCAGCC 
AAAGCATCGA ATGGAGGCCC GGTAGCCGAC TTCGATCGCA ATGAAGAGCT GAAGGCCTAT
CGCGAGATGC TGCTGATCCG CCGCTTCGAG GAGAAGGCCG GCCAGCTTTA CGGCATGGGG
TTCATCGGCG GCTTTTGTCA CCTCTACATC GGTCAGGAAG CTGTCGTCGT CGGCATGCAG
ATGGCGCAGA AGGATGGCGA CCAGGTCATC ACCGCCTATC GCGACCACGG TCATATGCTG
GCAACCGGCA TGGAAGCGCG TGGCGTCATG GCGGAACTGA CCGGGCGCCG CAGCGGCTAT
TCCCACGGCA AGGGCGGCTC GATGCACATG TTCTCGAAAG AGAAGCATTT CTACGGCGGT
CACGGCATCG TCGGTGCCCA GGTCTCGCTC GGAACGGGTC TTGCCTTCGC AAACCGCTAC
CGCGGCAATG ACAATGTCTC CATCGCCTAT TTCGGCGACG GCGCTGCCAA CCAGGGCCAG
GTCTACGAGA GCTTCAACAT GGCTGCTCTC TGGAAGCTGC CGATCGTCTA CATCGTCGAG
AACAACCGTT ACGCCATGGG CACCTCGACT GCCCGCGCCA CCGCGCAGTC GAATTACTCG
CTTCGCGGAT CCGGTTTCGG CATCCCCGGC ATTCAGGTCG ATGGCATGGA CGTCCGCGCC
GTCAAGGCGG CCGCCGACGA GGCGCTCGAG CATTGCCGTT CCGGCAAGGG TCCGATCATT
CTCGAAATGC TGACCTATCG TTATCGCGGT CACTCGATGT CCGACCCGGC GAAGTATCGC
TCCAAGGACG AAGTGCAGAA GATGCGCTCC GAGCATGACC CGATCGAACA GGTCAAGGCA
CGCCTCGTCG AAAAGGGCTG GGCTTCCGAA GACGATCTGA AGGCGATCGA CAAGGATGTT
CGCGACATCG TCGCCGATAG CGCCGATTTC GCCCAGGCCG ATCCGGAGCC GGATGCATCC
GAGCTCTATA CCGACATTCT GCTCTAA
 
Protein sequence
MAPRKTATVS SRKTAAKPAA KASNGGPVAD FDRNEELKAY REMLLIRRFE EKAGQLYGMG 
FIGGFCHLYI GQEAVVVGMQ MAQKDGDQVI TAYRDHGHML ATGMEARGVM AELTGRRSGY
SHGKGGSMHM FSKEKHFYGG HGIVGAQVSL GTGLAFANRY RGNDNVSIAY FGDGAANQGQ
VYESFNMAAL WKLPIVYIVE NNRYAMGTST ARATAQSNYS LRGSGFGIPG IQVDGMDVRA
VKAAADEALE HCRSGKGPII LEMLTYRYRG HSMSDPAKYR SKDEVQKMRS EHDPIEQVKA
RLVEKGWASE DDLKAIDKDV RDIVADSADF AQADPEPDAS ELYTDILL