Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2969 |
Symbol | |
ID | 8013892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2961957 |
End bp | 2962952 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644825539 |
Product | aldo/keto reductase |
Protein accession | YP_002976767 |
Protein GI | 241205671 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.253493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0420926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCT ATCGTTTGGG AAAGACCGGA CCCGATGTCT CGGCCATCGG CCTCGGCTGC ATGGGCATGT CGGGCATGTA CGGCCCCTCC GATCGCGCAG AAAGCATCGC GACGATCCAT GCCGCGCTCG ATGCCGGCAT CAACCTGCTC GACACCGGCG ATTTCTACGG GATGGGCCAT AACGAAATGC TGATCGGCGA GGCATTGAAG GGTCGCAGGC GCGAAAACGC CGTCATCAGC GTCAAATTCG GCGGCCTGCG CGATCCCGTC GGCGGCTGGA GCGGCATCGA CGCCCGCCCG GTCGCGGTGA AGAACTTCCT CAGCTATACG CTGCAGCGCC TCGGCGTCGA CTATATCGAC ATCTACCGCC CCGCTCGCCT CGATCCGAAC GTGCCGATCG AGGACACGGT CGGCGCCATC GCCGACATGG TCAAGGCAGG TTATGTCAGG CATATCGGCC TGTCGGAAGT CGGCGCCGAC ACGATCCGCC GCGCCGCCAC TGTCGCCCCG ATCGTCGACC TGCAGATCGA ATATTCGCTG ATCTCGCGCG GCATTGAGAA GAAGATCCTG CCGACGACAC GCGAACTCGG CATTTCGATC ACCGCCTACG GCGTGCTCTC GCGCGGCCTG ATCAGCGGCC ACTGGCAAAA GGGCCAGGGT GGGACGGCCG GCGATTTTCG CGCCTACAGC CCGCGCTTCC AGGAAGGCAA TATCGAGCAG AACCTCGCTC TGGTGGAGAA GCTGCGCGAG ATCGCCCAGG CAAAGAGCGT CTCGGTCGCC CAGATCGCCA TCGCCTGGGT CGCCGCCAAG GGCAAGGACA TCGTCCCGAT CATCGGCGCC CGCCGCCGCG ACCGGCTGAC CGAGGCGCTC GGCTCACGCG CCATCGACCT GTCGCCAGAA GATTTCGCCA TCATCGAACA CGCCGTGCCG AAAGACGCGG CCGTGGGTGG GCGTTATCCC GAGCATATGC TGCAGCATAT GGACAGCGAG AAGTAA
|
Protein sequence | MQTYRLGKTG PDVSAIGLGC MGMSGMYGPS DRAESIATIH AALDAGINLL DTGDFYGMGH NEMLIGEALK GRRRENAVIS VKFGGLRDPV GGWSGIDARP VAVKNFLSYT LQRLGVDYID IYRPARLDPN VPIEDTVGAI ADMVKAGYVR HIGLSEVGAD TIRRAATVAP IVDLQIEYSL ISRGIEKKIL PTTRELGISI TAYGVLSRGL ISGHWQKGQG GTAGDFRAYS PRFQEGNIEQ NLALVEKLRE IAQAKSVSVA QIAIAWVAAK GKDIVPIIGA RRRDRLTEAL GSRAIDLSPE DFAIIEHAVP KDAAVGGRYP EHMLQHMDSE K
|
| |