Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2271 |
Symbol | |
ID | 8013272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2276756 |
End bp | 2277745 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644824856 |
Product | aldo/keto reductase |
Protein accession | YP_002976086 |
Protein GI | 241204990 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0397783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.307626 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAGC GTGAACTTGG AAAGAGCGGA CTTCAAGTCT CGGCCGTCGG TCTCGGCTGC ATGGGGCTGA GTTACGGGTA TGGCCCGGCG ACAGATATTC AGGAAGCGAC CGTACTGATC CGGCGGGCAT TTGAACGCGG CGTGACCTTC TTCGACACGG CCGAGGCCTA TGGCCCCTAT AAGAACGAAG AGCTTCTGGG AGAGGCGCTC GCCCCCTTCC GCAACGAGGT GGTGATCGCC ACGAAATTCG GTTTCAACTT CGATGCCAAT GGCGGCCAGA GCGGCATGAA CAGCCGGCCC AAGCAGATCC GCGCGGTGGC CGACCAGGCG CTGAAGCGTT TGAAGACTGA TGTCATCGAT CTCTTTTACC AGCATCGCGT CGATCCCGAT GTTCCGATCG AGGATGTCGC CGGCACGGTC AAGGCGCTGA TCGCGGAAGG CAAGGTCAGG CATTTCGGCC TCTCGGAAGC GGGCGCCCGG ACGATCCGCC GCGCCCATGC CGTCCAGCCG GTGGCGGCGT TGCAGAGCGA ATATTCGCTG TGGTGGCGCG AACCAGAGCA GGAAATCCTG CCGACGCTTG AAGAACTCGG CATCGGCTTC GTGCCCTTCA GCCCGCTCGG TAAGGGCTTT CTGACTGGCG CGATCAGCGA AACGACCACC TTCGACAGCA AGGATTTCCG CAACGTCGTG CCCCGCTTTT CTCAGGAGGC GCGAAAAGCC AACCAAGCGC TCGTAGATCG TCTCGGAGAA ATCGCCGCCC GCAAGAAGGC TACCTCCGCC CAAGTGGCTC TCGCATGGCT GCTGGCGCAG AAGCCCTGGA TCGTGCCGAT CCCCGGCACC ACCAAGCTGC ACCGCCTCGA GGAGAACATC CAGGCCGCCG AGGTCGAACT GACGGCCGAG GATCTTGCCA GCATCGAAAG CGCGCTGGCC ACGATCAAGG TGGAAGGCGA TCGTTATCCC GCGCACCTGC AAGCCAGGGT CAACCGCTAA
|
Protein sequence | MQKRELGKSG LQVSAVGLGC MGLSYGYGPA TDIQEATVLI RRAFERGVTF FDTAEAYGPY KNEELLGEAL APFRNEVVIA TKFGFNFDAN GGQSGMNSRP KQIRAVADQA LKRLKTDVID LFYQHRVDPD VPIEDVAGTV KALIAEGKVR HFGLSEAGAR TIRRAHAVQP VAALQSEYSL WWREPEQEIL PTLEELGIGF VPFSPLGKGF LTGAISETTT FDSKDFRNVV PRFSQEARKA NQALVDRLGE IAARKKATSA QVALAWLLAQ KPWIVPIPGT TKLHRLEENI QAAEVELTAE DLASIESALA TIKVEGDRYP AHLQARVNR
|
| |