Gene Rleg_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2271 
Symbol 
ID8013272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2276756 
End bp2277745 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content62% 
IMG OID644824856 
Productaldo/keto reductase 
Protein accessionYP_002976086 
Protein GI241204990 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0397783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.307626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAGC GTGAACTTGG AAAGAGCGGA CTTCAAGTCT CGGCCGTCGG TCTCGGCTGC 
ATGGGGCTGA GTTACGGGTA TGGCCCGGCG ACAGATATTC AGGAAGCGAC CGTACTGATC
CGGCGGGCAT TTGAACGCGG CGTGACCTTC TTCGACACGG CCGAGGCCTA TGGCCCCTAT
AAGAACGAAG AGCTTCTGGG AGAGGCGCTC GCCCCCTTCC GCAACGAGGT GGTGATCGCC
ACGAAATTCG GTTTCAACTT CGATGCCAAT GGCGGCCAGA GCGGCATGAA CAGCCGGCCC
AAGCAGATCC GCGCGGTGGC CGACCAGGCG CTGAAGCGTT TGAAGACTGA TGTCATCGAT
CTCTTTTACC AGCATCGCGT CGATCCCGAT GTTCCGATCG AGGATGTCGC CGGCACGGTC
AAGGCGCTGA TCGCGGAAGG CAAGGTCAGG CATTTCGGCC TCTCGGAAGC GGGCGCCCGG
ACGATCCGCC GCGCCCATGC CGTCCAGCCG GTGGCGGCGT TGCAGAGCGA ATATTCGCTG
TGGTGGCGCG AACCAGAGCA GGAAATCCTG CCGACGCTTG AAGAACTCGG CATCGGCTTC
GTGCCCTTCA GCCCGCTCGG TAAGGGCTTT CTGACTGGCG CGATCAGCGA AACGACCACC
TTCGACAGCA AGGATTTCCG CAACGTCGTG CCCCGCTTTT CTCAGGAGGC GCGAAAAGCC
AACCAAGCGC TCGTAGATCG TCTCGGAGAA ATCGCCGCCC GCAAGAAGGC TACCTCCGCC
CAAGTGGCTC TCGCATGGCT GCTGGCGCAG AAGCCCTGGA TCGTGCCGAT CCCCGGCACC
ACCAAGCTGC ACCGCCTCGA GGAGAACATC CAGGCCGCCG AGGTCGAACT GACGGCCGAG
GATCTTGCCA GCATCGAAAG CGCGCTGGCC ACGATCAAGG TGGAAGGCGA TCGTTATCCC
GCGCACCTGC AAGCCAGGGT CAACCGCTAA
 
Protein sequence
MQKRELGKSG LQVSAVGLGC MGLSYGYGPA TDIQEATVLI RRAFERGVTF FDTAEAYGPY 
KNEELLGEAL APFRNEVVIA TKFGFNFDAN GGQSGMNSRP KQIRAVADQA LKRLKTDVID
LFYQHRVDPD VPIEDVAGTV KALIAEGKVR HFGLSEAGAR TIRRAHAVQP VAALQSEYSL
WWREPEQEIL PTLEELGIGF VPFSPLGKGF LTGAISETTT FDSKDFRNVV PRFSQEARKA
NQALVDRLGE IAARKKATSA QVALAWLLAQ KPWIVPIPGT TKLHRLEENI QAAEVELTAE
DLASIESALA TIKVEGDRYP AHLQARVNR