Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2484 |
Symbol | |
ID | 6981226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2517310 |
End bp | 2518368 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643397199 |
Product | aldo/keto reductase |
Protein accession | YP_002281984 |
Protein GI | 209550067 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.797379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.441954 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATATC GTTCGCTCGG CCGTTCCGGC CTGAAGATCT CCACATTGAC CATGGGCACC ATGACCTTCG GCGGCGTCGG ATGGGCCAAG ACCGTCGGGG ACCTCGGCGT CTCCGAGGCG CGGAAGATGA TCGATATCTG CATCGATGCC GGCATCAACC TCATCGACAC CGCCAATGCC TATTCCAACG GCGAATGCGA AAACATCATC GGCGACGTGC TCTCCGGCAA GCGGCCGCAA GGCGTGCTGC TTGCCACCAA GGCCCGTTTC GGCATGGGTG ACGGCCCGAA CGACCAAGGA CTGTCGCGCT ACCACCTGAT CCGGGAATGC GAAGCGAGCC TCAAACGCCT GAAGACCGAT GTCATCGACC TGTACCAGGT GCATGAATGG GACGGCCAGA CGCCGCTCGA AGAGACGATG GAAGCCCTCG ACACGCTGAT CAAGCAGGGC AAGGTGCGTT ACGTCGGCTG CTCGAACTAT TCCGGCTGGC ACATCATGAA GGCGCTTGGC ATCGCCAACG AGCACCGCTA CCAGCGCTTC ATCAGCCAGC AGATCCACTA TACGCTGGAA GCGCGCGACG CCGAATACGA GCTGCTGCCG ATTTCGATCG ACCAGGGCCT CGGCGTGCTG GTCTGGAGCC CGCTTGCCGG CGGCCTGCTC TCCGGCAAAC ACCGCCGCAA CCAAGCCGCT CCCGAAGGCA CGCGGCAGTT CGCCGGCTGG ACCGAACCGC CGATCCGTGA CGAGAACCGT CTCTGGAACA TCGTCGAGAC GCTGGTCGCG ATCGGGCAGG AGCGCGGCGT CTCGGCCGCA CAAGTGGCGC TCGCCTGGCT GATCGGCCGC AAGGCGGTCA CCTCGGTCAT CATCGGCGGG CGCACAGAAG CCCAGTTCCG CGACAACATC GCCGCCGCCG AACTGAAGCT CACGAACGAG GAGCGCAAGC GCCTGGATGC CGTCAGCCTG CCGCCGGTGA TCTATCCCTA TTGGCACCAG CTGAACACGG TGAGCGACAG GCTTGGTGAA GCCGATCTCG AGCTGTTCGG CCCGCATCTT CAGCAATAG
|
Protein sequence | MEYRSLGRSG LKISTLTMGT MTFGGVGWAK TVGDLGVSEA RKMIDICIDA GINLIDTANA YSNGECENII GDVLSGKRPQ GVLLATKARF GMGDGPNDQG LSRYHLIREC EASLKRLKTD VIDLYQVHEW DGQTPLEETM EALDTLIKQG KVRYVGCSNY SGWHIMKALG IANEHRYQRF ISQQIHYTLE ARDAEYELLP ISIDQGLGVL VWSPLAGGLL SGKHRRNQAA PEGTRQFAGW TEPPIRDENR LWNIVETLVA IGQERGVSAA QVALAWLIGR KAVTSVIIGG RTEAQFRDNI AAAELKLTNE ERKRLDAVSL PPVIYPYWHQ LNTVSDRLGE ADLELFGPHL QQ
|
| |