Gene Rleg2_5658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5658 
Symbol 
ID6977049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp46640 
End bp47671 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID643393115 
Productaldo/keto reductase 
Protein accessionYP_002277933 
Protein GI209546043 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.590758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTATC GCAAGCTCGG TCCCAGCGGG ACCGTCGTCA CCGCCTATTG CCTGGGCACC 
ATGACCTTCG GCGCGGAGGC CGACGAAGCG GCCTCGCACA AGCTGCTCGA CGATTATTTC
GCCTGGGGCG GCAATTTCAT CGATACCGCC GATGTCTACA GCGCCGGCAA GTCGGAAGAG
ATCATCGGAC GCTGGCTGAA GGCGCGCCCG ACCGAAGCCC GCCAGGCGAT CGTCGCCACC
AAGGGCCGTT TTCCGATGGG CAACGGTCCC AACGACATCG GCCTGTCGCG CCGCCATCTC
GGCCAGGCGC TCGACGATTC TCTGCGCCGC CTCGGCCTTG AGCAGATCGA CCTCTACCAG
ATGCATGCCT GGGACGCGCT GACTCCGATC GAGGAAACGC TGCGCTTCCT CGACGATGCG
GTTTCATCAG GCAAGATCGG CTATTACGGC TTCTCCAACT ATGTCGGCTG GCATATCGCC
AAGGCCTCCG AGATTGCCAA GGCGCGCGGT TATACCCGCC CGGTGACGCT GCAGCCGCAA
TATAACCTGC TGGTGCGCGA CATCGAGCTC GAGATCGTCG CGGCCTGCCA GGATGCCGGC
ATGGGGCTGT TGCCCTGGTC GCCGCTCGGG GGCGGCTGGC TGACCGGCAA ATACAAGCGC
GACGAGATGC CGACCGGCGC CACCCGCCTC GGCGAAAATC CCAATCGCGG CGGCGAATCC
TATGCGCCGC GCAATGCGAT GGAACGAACC TGGGCGATCA TCGCTGCTGT CGAGGAAATC
GCCAAGGCGC ACGGCGTCAG CATGGCGCAG GTGGCGCTCG CCTGGACGGC GGCGCAGCCG
GCAATCACCT CGGTCATCCT CGGCGCCCGC ACGCCGGAGC AACTGGCCGA CAATCTCGGC
GCCATGAAGC TCAAGCTCTC CGACGAAGAC ATGACGCGAC TGAATGAGGT CAGCGCCCCT
CAGCCCTTCG ACTATCCCTA CGGCAAGGGC GGCATCAACC AGCGCCACCG CAAGATCGAA
GGCGGCCGCT GA
 
Protein sequence
MDYRKLGPSG TVVTAYCLGT MTFGAEADEA ASHKLLDDYF AWGGNFIDTA DVYSAGKSEE 
IIGRWLKARP TEARQAIVAT KGRFPMGNGP NDIGLSRRHL GQALDDSLRR LGLEQIDLYQ
MHAWDALTPI EETLRFLDDA VSSGKIGYYG FSNYVGWHIA KASEIAKARG YTRPVTLQPQ
YNLLVRDIEL EIVAACQDAG MGLLPWSPLG GGWLTGKYKR DEMPTGATRL GENPNRGGES
YAPRNAMERT WAIIAAVEEI AKAHGVSMAQ VALAWTAAQP AITSVILGAR TPEQLADNLG
AMKLKLSDED MTRLNEVSAP QPFDYPYGKG GINQRHRKIE GGR