Gene Rleg2_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4387 
Symbol 
ID6977481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp16995 
End bp18224 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID643393567 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_002278385 
Protein GI209546467 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAAT TCATCATCAA GATGCCCGAC GTCGGGGAAG GCGTCGCCGA GGCCGAGCTT 
GTCGAATGGC ATGTGAAGGC GGGAGATCCG GTTCGCGAGG ACATGGTGAT CGCCGCCGTC
ATGACCGACA AGGCGACGGT GGAAATTCCC TCTCCCGTCA ACGGCACCGT TATCTGGCTT
GCGGGCGAGG TCGGAGACCG TATCGCGGTC AAGGCGCCGC TGGTGCGGAT TGAGACGGCG
GGCGATGCCG GCGAGGCTCA GCCCGTGCAG ATCTCGCAGG GGCCGGTTGC CGAGACAACG
AAAGTCGAGA CTGCGAAGGC TGCCCCGGCA GCGCCAGCTC CTGCGGCTGC ACCGGCTGAA
AAGCCGCTCG CCTCGCCTTC CGTGCGGCTC TTCGCCAGGG AAAACGGCGT CGATCTCAGG
CAAGTGCAGG GGACGGGACC GGCCGGGCGC ATCCTGCGTG AGGATATCGA GCAGTTCCTG
GCTCAGGGAA CCGCGCCTGT GACGGCCAAG AACGGTTTTG CCAGGAAGAC GGCGACCGAG
GAGATCAAGC TGACCGGCCT GCGCCGCCGC ATCGCCGAGA AAATGGTGCT CTCCACCTCG
CGCATCCCCC ACATCACCTA TGTGGAGGAA GTCGATATGA CTGCGCTGGA AGAATTGCGC
GCCACCATGA ACGGCGATCG CAGGGAAGGT CATCCGAAGC TGACGGTTCT GCCCTTCCTG
ATGCGGGCGC TGGTCAAAGC CATTGCCGAG CAGCCGGAGG TCAACGCCAC CTTCGACGAC
GATGCCGGCC TCATCACGCG TTATAGCGCC GTCCATATCG GCATCGCCAC GCAGACGCCG
GCCGGCCTGA CCGTGCCGGT GGTGCGGCAT GCGGAAGCCC GCGGCATCTG GGATTGCGCC
GCCGAGATGA ACCGGTTGGC GGAAGCGGCG CGCTCGGGCA CTGCGACGCG CGACGAGCTT
TTGGGCTCGA CGATCACCAT CAGCTCGCTC GGCGCGCTCG GCGGCATCGT CTCGACGCCG
GTCATCAACC ATCCTGAAGT GGCAATCATC GGCGTCAACA AGATTGCGAC GCGGCCGGTC
TGGGACGGTG CGCAATTCGT GCCGCGCAAG ATGATGAACC TCTCCTCGAG CTTCGATCAT
CGCATCATCG ACGGCTGGGA TGCGGCCACC TTCGTGCAGC GCATCCGCGC GCTGCTCGAA
ACCCCGGCGC TCATTTTCAT CGAAGGCTGA
 
Protein sequence
MGEFIIKMPD VGEGVAEAEL VEWHVKAGDP VREDMVIAAV MTDKATVEIP SPVNGTVIWL 
AGEVGDRIAV KAPLVRIETA GDAGEAQPVQ ISQGPVAETT KVETAKAAPA APAPAAAPAE
KPLASPSVRL FARENGVDLR QVQGTGPAGR ILREDIEQFL AQGTAPVTAK NGFARKTATE
EIKLTGLRRR IAEKMVLSTS RIPHITYVEE VDMTALEELR ATMNGDRREG HPKLTVLPFL
MRALVKAIAE QPEVNATFDD DAGLITRYSA VHIGIATQTP AGLTVPVVRH AEARGIWDCA
AEMNRLAEAA RSGTATRDEL LGSTITISSL GALGGIVSTP VINHPEVAII GVNKIATRPV
WDGAQFVPRK MMNLSSSFDH RIIDGWDAAT FVQRIRALLE TPALIFIEG