Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2047 |
Symbol | |
ID | 6980786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2110274 |
End bp | 2111263 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396769 |
Product | aldo/keto reductase |
Protein accession | YP_002281557 |
Protein GI | 209549640 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.128714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAGC GTGAACTCGG AAAGAGCGGT CTCGAAGTCT CGGCCATCGG TCTCGGCTGC ATGGGGCTAA GTTATGGATA TGGCCCAGCG ACAGATATCC AGGAAGCCGT CGCGCTGATC CGGCAGGCGG TCGAACGTGG CGTGACCTTC TTCGACACCG CGGAAGCCTA CGGCCCCTAT AGAAACGAAG AGCTTTTGGG AGAAGCACTT GCTCCCTTTC GCAGCGAGGT AGTGATCGCC ACCAAATTCG GCTTCAACTT CGATGCCAAT GGCGGCCAGA GTGGCATGAA CAGCCGGCCC GAGCAGATCC GGGCAGTTGC CGACCAGGCG TTGAAGCGCT TGAAGACCGA TGTCATCGAT CTGTTCTACC AGCATCGCGT CGATCCGGAT GTTCCGATCG AGGACGTCGC TGGCACGGTC AAGGCGCTGA TTTCAGAAGG CAAGGTGAAG CATTTCGGCC TCTCGGAAGC CGGCTCCAAG ACGATCCGCC GCGCCCATGC CGTTCAGCCG GTGGCGGCGC TGCAGAGCGA ATATTCGCTC TGGTGGCGCG AGCCCGAGCA GGATATCCTG CCGGTGCTCG AAGAGCTCGG CATCGGCTTC GTGCCGTTCA GCCCGCTCGG CAAGGGCTTC CTCACCGGCG CGATCAGCGA AACCACAACC TTCGACAGCA AGGACTTCCG CAACATCGTG CCGCGCTTTT CACCGGAAGC GCGAAAGGCC AACCAGGCGC TCGTCGATCT CCTCGCAGAG ATCGCCGCGC GCAAGCAGGC GACCTCCGCC CAGGTGGCGC TCGCCTGGCT GCTGGCGCAA AAACCCTGGA TCGTGCCGAT CCCCGGCACC ACCAAGCTGC ATCGCCTGGA GGAGAATATC CGGGCCGCCG AGGTCGAACT GACGGCGGAG GATCTCGGCA ATATCGAAAG CGCGCTCGCC ACCATCAAGG TGGAAGGCGA TCGATATCCC GCGCATCTGC AGGCAAGGGT CAATCGCTGA
|
Protein sequence | MHKRELGKSG LEVSAIGLGC MGLSYGYGPA TDIQEAVALI RQAVERGVTF FDTAEAYGPY RNEELLGEAL APFRSEVVIA TKFGFNFDAN GGQSGMNSRP EQIRAVADQA LKRLKTDVID LFYQHRVDPD VPIEDVAGTV KALISEGKVK HFGLSEAGSK TIRRAHAVQP VAALQSEYSL WWREPEQDIL PVLEELGIGF VPFSPLGKGF LTGAISETTT FDSKDFRNIV PRFSPEARKA NQALVDLLAE IAARKQATSA QVALAWLLAQ KPWIVPIPGT TKLHRLEENI RAAEVELTAE DLGNIESALA TIKVEGDRYP AHLQARVNR
|
| |