Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5658 |
Symbol | |
ID | 6977049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 46640 |
End bp | 47671 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643393115 |
Product | aldo/keto reductase |
Protein accession | YP_002277933 |
Protein GI | 209546043 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.590758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTATC GCAAGCTCGG TCCCAGCGGG ACCGTCGTCA CCGCCTATTG CCTGGGCACC ATGACCTTCG GCGCGGAGGC CGACGAAGCG GCCTCGCACA AGCTGCTCGA CGATTATTTC GCCTGGGGCG GCAATTTCAT CGATACCGCC GATGTCTACA GCGCCGGCAA GTCGGAAGAG ATCATCGGAC GCTGGCTGAA GGCGCGCCCG ACCGAAGCCC GCCAGGCGAT CGTCGCCACC AAGGGCCGTT TTCCGATGGG CAACGGTCCC AACGACATCG GCCTGTCGCG CCGCCATCTC GGCCAGGCGC TCGACGATTC TCTGCGCCGC CTCGGCCTTG AGCAGATCGA CCTCTACCAG ATGCATGCCT GGGACGCGCT GACTCCGATC GAGGAAACGC TGCGCTTCCT CGACGATGCG GTTTCATCAG GCAAGATCGG CTATTACGGC TTCTCCAACT ATGTCGGCTG GCATATCGCC AAGGCCTCCG AGATTGCCAA GGCGCGCGGT TATACCCGCC CGGTGACGCT GCAGCCGCAA TATAACCTGC TGGTGCGCGA CATCGAGCTC GAGATCGTCG CGGCCTGCCA GGATGCCGGC ATGGGGCTGT TGCCCTGGTC GCCGCTCGGG GGCGGCTGGC TGACCGGCAA ATACAAGCGC GACGAGATGC CGACCGGCGC CACCCGCCTC GGCGAAAATC CCAATCGCGG CGGCGAATCC TATGCGCCGC GCAATGCGAT GGAACGAACC TGGGCGATCA TCGCTGCTGT CGAGGAAATC GCCAAGGCGC ACGGCGTCAG CATGGCGCAG GTGGCGCTCG CCTGGACGGC GGCGCAGCCG GCAATCACCT CGGTCATCCT CGGCGCCCGC ACGCCGGAGC AACTGGCCGA CAATCTCGGC GCCATGAAGC TCAAGCTCTC CGACGAAGAC ATGACGCGAC TGAATGAGGT CAGCGCCCCT CAGCCCTTCG ACTATCCCTA CGGCAAGGGC GGCATCAACC AGCGCCACCG CAAGATCGAA GGCGGCCGCT GA
|
Protein sequence | MDYRKLGPSG TVVTAYCLGT MTFGAEADEA ASHKLLDDYF AWGGNFIDTA DVYSAGKSEE IIGRWLKARP TEARQAIVAT KGRFPMGNGP NDIGLSRRHL GQALDDSLRR LGLEQIDLYQ MHAWDALTPI EETLRFLDDA VSSGKIGYYG FSNYVGWHIA KASEIAKARG YTRPVTLQPQ YNLLVRDIEL EIVAACQDAG MGLLPWSPLG GGWLTGKYKR DEMPTGATRL GENPNRGGES YAPRNAMERT WAIIAAVEEI AKAHGVSMAQ VALAWTAAQP AITSVILGAR TPEQLADNLG AMKLKLSDED MTRLNEVSAP QPFDYPYGKG GINQRHRKIE GGR
|
| |