Gene Rleg2_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2047 
Symbol 
ID6980786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2110274 
End bp2111263 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content62% 
IMG OID643396769 
Productaldo/keto reductase 
Protein accessionYP_002281557 
Protein GI209549640 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.128714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAGC GTGAACTCGG AAAGAGCGGT CTCGAAGTCT CGGCCATCGG TCTCGGCTGC 
ATGGGGCTAA GTTATGGATA TGGCCCAGCG ACAGATATCC AGGAAGCCGT CGCGCTGATC
CGGCAGGCGG TCGAACGTGG CGTGACCTTC TTCGACACCG CGGAAGCCTA CGGCCCCTAT
AGAAACGAAG AGCTTTTGGG AGAAGCACTT GCTCCCTTTC GCAGCGAGGT AGTGATCGCC
ACCAAATTCG GCTTCAACTT CGATGCCAAT GGCGGCCAGA GTGGCATGAA CAGCCGGCCC
GAGCAGATCC GGGCAGTTGC CGACCAGGCG TTGAAGCGCT TGAAGACCGA TGTCATCGAT
CTGTTCTACC AGCATCGCGT CGATCCGGAT GTTCCGATCG AGGACGTCGC TGGCACGGTC
AAGGCGCTGA TTTCAGAAGG CAAGGTGAAG CATTTCGGCC TCTCGGAAGC CGGCTCCAAG
ACGATCCGCC GCGCCCATGC CGTTCAGCCG GTGGCGGCGC TGCAGAGCGA ATATTCGCTC
TGGTGGCGCG AGCCCGAGCA GGATATCCTG CCGGTGCTCG AAGAGCTCGG CATCGGCTTC
GTGCCGTTCA GCCCGCTCGG CAAGGGCTTC CTCACCGGCG CGATCAGCGA AACCACAACC
TTCGACAGCA AGGACTTCCG CAACATCGTG CCGCGCTTTT CACCGGAAGC GCGAAAGGCC
AACCAGGCGC TCGTCGATCT CCTCGCAGAG ATCGCCGCGC GCAAGCAGGC GACCTCCGCC
CAGGTGGCGC TCGCCTGGCT GCTGGCGCAA AAACCCTGGA TCGTGCCGAT CCCCGGCACC
ACCAAGCTGC ATCGCCTGGA GGAGAATATC CGGGCCGCCG AGGTCGAACT GACGGCGGAG
GATCTCGGCA ATATCGAAAG CGCGCTCGCC ACCATCAAGG TGGAAGGCGA TCGATATCCC
GCGCATCTGC AGGCAAGGGT CAATCGCTGA
 
Protein sequence
MHKRELGKSG LEVSAIGLGC MGLSYGYGPA TDIQEAVALI RQAVERGVTF FDTAEAYGPY 
RNEELLGEAL APFRSEVVIA TKFGFNFDAN GGQSGMNSRP EQIRAVADQA LKRLKTDVID
LFYQHRVDPD VPIEDVAGTV KALISEGKVK HFGLSEAGSK TIRRAHAVQP VAALQSEYSL
WWREPEQDIL PVLEELGIGF VPFSPLGKGF LTGAISETTT FDSKDFRNIV PRFSPEARKA
NQALVDLLAE IAARKQATSA QVALAWLLAQ KPWIVPIPGT TKLHRLEENI RAAEVELTAE
DLGNIESALA TIKVEGDRYP AHLQARVNR