Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2472 |
Symbol | |
ID | 8013449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2471010 |
End bp | 2471999 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644825053 |
Product | aldo/keto reductase |
Protein accession | YP_002976283 |
Protein GI | 241205187 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC GCAATATCGG CGGCCTCGAA GTCTCGGCAT TTGGTCTCGG CTGCATGAGC ATGAGTGCCG CTTACGGCCC GCCCGCCGCC GAGGGCGACA TGATCAAATT GATGCGCACC GCCCATCAGC AGGGCGTCAC CTTGTTCGAC ACGGCCGAAG CCTATGGCCC TTTCGTCAAT GAAGAGCTTG TCGGCAAAGC GCTCGCACCG ATCCGCGACC AGGTGGTCAT CGCCACCAAA TTCGGCTTTG ATATCGATCA GCAAACAGGT GAACGCCGCG GCGGCACCAA CAGCCGCCCC GAACATGTCA AGGCGGTTGC CGATGCCTGC CTGCGCCGTC TGAAGACGGA CCACATCGAC CTGTTCTACC AGCACCGCGT CGATCCCGAC GTGCCGATCG AAGACGTGGC CGGTGCCGTC AAGGACCTGA TCGCAGCCGG CAAGGTCAAA CATTTCGGTC TTTCCGAAGC CGGCGTCCAG ACGATCCGCC GCGCCCATGC CGTTCAAAAA GTCACCGCCG TCCAGAGTGA ATATTCGCTC TTCTGGCGCG GCCCCGAGGC GGAACTGCTG CCCACCCTTG AGGAACTCGG CATCGGCTTC GTGCCCTTCA GCCCACTCGG CGCAGGCTTC CTGACCGGCA AGATCGACGA GAACACCAAG TTCGATCCAA GCGATTTCCG CAACAGCGTG CCGCGTTTTT CGCTCGAGGC GCGCAAGGCC AATTTTGCAC TCGTCGACCT GATCAGGCGT ATCGGCGACC GCAAGGGCGC AACGCCCGCG CAAATCGCCC TCTCCTGGCT GCTGGCCCAA AAGCCATGGA TCGTCCCGAT CCCGGGTACA ACGAAGCAGC ACCGGCTGGA AGAAAATCTC GGGGCGATAG ACGTCGACCT GCTGCCCGAG GACCTCGCCG AAATCGATGC CGCCCTCTCC GGCATCGAGG TTCACGGCGA GCGGCTTCCC GAGGCAGCGC TCAAGATGAC CGGCCGATAG
|
Protein sequence | MKRRNIGGLE VSAFGLGCMS MSAAYGPPAA EGDMIKLMRT AHQQGVTLFD TAEAYGPFVN EELVGKALAP IRDQVVIATK FGFDIDQQTG ERRGGTNSRP EHVKAVADAC LRRLKTDHID LFYQHRVDPD VPIEDVAGAV KDLIAAGKVK HFGLSEAGVQ TIRRAHAVQK VTAVQSEYSL FWRGPEAELL PTLEELGIGF VPFSPLGAGF LTGKIDENTK FDPSDFRNSV PRFSLEARKA NFALVDLIRR IGDRKGATPA QIALSWLLAQ KPWIVPIPGT TKQHRLEENL GAIDVDLLPE DLAEIDAALS GIEVHGERLP EAALKMTGR
|
| |