Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3544 |
Symbol | |
ID | 6982304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3674869 |
End bp | 3675849 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643398268 |
Product | aldo/keto reductase |
Protein accession | YP_002283037 |
Protein GI | 209551120 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.889794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATATG CAAAATTTGG GAAGACCGGC CTCGAAGTCT CGAAAATCTG CCTCGGCTGC ATGACTTTCG GCGATCCCGG CCGCGGCAAT CATACCTGGA GCCTGCGGGA AGAAGAAAGC CGGGCGATGA TTAGGCAAGC GATCGACCTC GGCATCAATT TCCTCGACAC CGCCAACACC TATTCCAACG GCTCCTCGGA GGAGATCGTC GGCCGCGCCA TAAAAGATTT CGCCAAGCGC GAAGACATCG TGCTGGCAAC GAAGGTGTTC AACCGCATGC GGCCGGGCCC GAATGGCGCC GGCCTGTCGC GCAAGGCGAT CTTCGACGAA ATCGACAACA GCCTGCGCCG CCTCGGCACC GACTATGTCG ACCTCTACCA GATCCACCGT TTCGACTATA CGACGCCGAT CGAGGAAACG CTTGAGGCGC TGCACGACGT CGTCAAATCG GGCAAGGCGC GTTATATCGG CGCCTCCTCC ATGTATGCTT GGCAATTTGC CAAGGCGCTC TACGTTTCCA GGCTGAACGG CTGGACAGAA TTCGTCAGCA TGCAGGACCA TCTGAACCTG CTTTACCGCG AGGAAGAGCG CGAAATGCTG CCGCTCTGCG AGGATCAGAA GATCGCCGTC ATCCCCTGGA GCCCGCTTGC CCGCGGCCGC CTGACCCGCG ACTGGGACGA GGCGACGGCG CGCAGCGAAA CCGACGAATT CGGCAAGACG CTTTACACCC AGTCCGTCGA CGCCGACCGC AGAATAGTCG AGGCGGTGGC CGATATCGCC AAGGCCCGCG GCATCTCCCG CGCCCAGGTC GCAACCGCTT GGATCCTGCA GAAGAGCGCC GTGACCGCCC CGATCATCGG CGCTTCCAAG CCGAACCACC TGACCGACGC CGTTGCCTCG CTCTCGGTCA AGCTCACCAC CGAAGAAGTC GCCGCATTGC AAGCACCCTA TATCCCGCAC GCCGTCGCCG GATTCAAGTA G
|
Protein sequence | MEYAKFGKTG LEVSKICLGC MTFGDPGRGN HTWSLREEES RAMIRQAIDL GINFLDTANT YSNGSSEEIV GRAIKDFAKR EDIVLATKVF NRMRPGPNGA GLSRKAIFDE IDNSLRRLGT DYVDLYQIHR FDYTTPIEET LEALHDVVKS GKARYIGASS MYAWQFAKAL YVSRLNGWTE FVSMQDHLNL LYREEEREML PLCEDQKIAV IPWSPLARGR LTRDWDEATA RSETDEFGKT LYTQSVDADR RIVEAVADIA KARGISRAQV ATAWILQKSA VTAPIIGASK PNHLTDAVAS LSVKLTTEEV AALQAPYIPH AVAGFK
|
| |