Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5073 |
Symbol | |
ID | 8007666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 456610 |
End bp | 457770 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644821988 |
Product | Salicylate 1-monooxygenase |
Protein accession | YP_002973248 |
Protein GI | 241113413 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.926874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.172716 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAA GTAAACCGAA AATCGCGATC GTTGGTGCCG GCATGGGTGG TCTCGCCGCC GCGGCGACCC TTCGCCAGGT CGGTATCGAC GTGAATGTCT ACGAGCAGGC ACCGAAATTT GCCCGCATCG GCGCCGGCAT CCAGATGCTG CCGAATTCGT CGCGCGTCCT GCGCGGTATC GGCGTTCTCG ACAGGCTTCA GAAACTTGCG TTCGAGCCCT ATTCTCATCT CAACCGCGTC TGGGATACCG GTGAGATCAA GCGCGAGCTT CCGATGCCGG AAAGCCTTTA CGGCGCGCCC TTTCTCTGCA TGCACCGGGC CGACCTGCAT GAAGCGCTTT ATTCCGTGCT GCCGCCGGAG ATCGTTCACC TCGGCAAGAA GCTCGTCGGC CTGGATCAGA CGAAGGGCGG CGTGACGCTC TCTTTCGCCG ACGGCACGAA GGCGGATGCC GATGCGGTGA TCGGCGCTGA TGGCGTGCAT TCGCTGGTTC GCGACATCGT CGTCGGCCCT GACAAACCGA TCCACAAGGG CCGGATCGCC TACCGCGCGG TCTTCGACGC GAGCCTGATG AACGGCGGCG AGATCCAGGC GTCCAGAACG AAGTGGTGGG GTGTCGATCG CCACATCGTC ATCTACTACA CCGCCGCAGA CCGCAGCTCG CTCTACTTCG TCACCAGCGT GCCTGAGCCT GCTGACTGGC TGACCTCGGA ATCCTGGTCC GCCAAGGGCG ACGTGAAGGA ATTGCGCACC GCCTATGAAG GCTTCCATCC GGAAGTGCAG ATGGTTCTGA ATGCATGCCC GGACTGTCAC AAGTGGGCAA TCCTCGAACG TGAACCTCTG GCGCGCTGGA GCGACGGACG CGTGGTGCTT CTCGGCGACG CCTGCCACCC GATGACGCCC TATATGGCGC AAGGAGCTGC GACCTCGATC GAGGACGCGG CAGTGCTGGC GCGGTGCCTT GCCGGCGTCG ACAATGACGA CATCGAAGGC GCGTTCCGCC GCTACGAGGC AAACCGCAAG CCGCGCACCT CACGCATCCA GGCGATTTCG AGCGCCAATA CCTGGATGTC GGGGGGCAAC GAAGACACCT CCTGGCTCTA TGGCTACGAT GCGTGGAACG TGCCGCTCGT GGGCGAAAAC GATATGGCGC TTGCCGGATA A
|
Protein sequence | MAGSKPKIAI VGAGMGGLAA AATLRQVGID VNVYEQAPKF ARIGAGIQML PNSSRVLRGI GVLDRLQKLA FEPYSHLNRV WDTGEIKREL PMPESLYGAP FLCMHRADLH EALYSVLPPE IVHLGKKLVG LDQTKGGVTL SFADGTKADA DAVIGADGVH SLVRDIVVGP DKPIHKGRIA YRAVFDASLM NGGEIQASRT KWWGVDRHIV IYYTAADRSS LYFVTSVPEP ADWLTSESWS AKGDVKELRT AYEGFHPEVQ MVLNACPDCH KWAILEREPL ARWSDGRVVL LGDACHPMTP YMAQGAATSI EDAAVLARCL AGVDNDDIEG AFRRYEANRK PRTSRIQAIS SANTWMSGGN EDTSWLYGYD AWNVPLVGEN DMALAG
|
| |