Gene Rleg_5073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5073 
Symbol 
ID8007666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp456610 
End bp457770 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID644821988 
ProductSalicylate 1-monooxygenase 
Protein accessionYP_002973248 
Protein GI241113413 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.926874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.172716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAA GTAAACCGAA AATCGCGATC GTTGGTGCCG GCATGGGTGG TCTCGCCGCC 
GCGGCGACCC TTCGCCAGGT CGGTATCGAC GTGAATGTCT ACGAGCAGGC ACCGAAATTT
GCCCGCATCG GCGCCGGCAT CCAGATGCTG CCGAATTCGT CGCGCGTCCT GCGCGGTATC
GGCGTTCTCG ACAGGCTTCA GAAACTTGCG TTCGAGCCCT ATTCTCATCT CAACCGCGTC
TGGGATACCG GTGAGATCAA GCGCGAGCTT CCGATGCCGG AAAGCCTTTA CGGCGCGCCC
TTTCTCTGCA TGCACCGGGC CGACCTGCAT GAAGCGCTTT ATTCCGTGCT GCCGCCGGAG
ATCGTTCACC TCGGCAAGAA GCTCGTCGGC CTGGATCAGA CGAAGGGCGG CGTGACGCTC
TCTTTCGCCG ACGGCACGAA GGCGGATGCC GATGCGGTGA TCGGCGCTGA TGGCGTGCAT
TCGCTGGTTC GCGACATCGT CGTCGGCCCT GACAAACCGA TCCACAAGGG CCGGATCGCC
TACCGCGCGG TCTTCGACGC GAGCCTGATG AACGGCGGCG AGATCCAGGC GTCCAGAACG
AAGTGGTGGG GTGTCGATCG CCACATCGTC ATCTACTACA CCGCCGCAGA CCGCAGCTCG
CTCTACTTCG TCACCAGCGT GCCTGAGCCT GCTGACTGGC TGACCTCGGA ATCCTGGTCC
GCCAAGGGCG ACGTGAAGGA ATTGCGCACC GCCTATGAAG GCTTCCATCC GGAAGTGCAG
ATGGTTCTGA ATGCATGCCC GGACTGTCAC AAGTGGGCAA TCCTCGAACG TGAACCTCTG
GCGCGCTGGA GCGACGGACG CGTGGTGCTT CTCGGCGACG CCTGCCACCC GATGACGCCC
TATATGGCGC AAGGAGCTGC GACCTCGATC GAGGACGCGG CAGTGCTGGC GCGGTGCCTT
GCCGGCGTCG ACAATGACGA CATCGAAGGC GCGTTCCGCC GCTACGAGGC AAACCGCAAG
CCGCGCACCT CACGCATCCA GGCGATTTCG AGCGCCAATA CCTGGATGTC GGGGGGCAAC
GAAGACACCT CCTGGCTCTA TGGCTACGAT GCGTGGAACG TGCCGCTCGT GGGCGAAAAC
GATATGGCGC TTGCCGGATA A
 
Protein sequence
MAGSKPKIAI VGAGMGGLAA AATLRQVGID VNVYEQAPKF ARIGAGIQML PNSSRVLRGI 
GVLDRLQKLA FEPYSHLNRV WDTGEIKREL PMPESLYGAP FLCMHRADLH EALYSVLPPE
IVHLGKKLVG LDQTKGGVTL SFADGTKADA DAVIGADGVH SLVRDIVVGP DKPIHKGRIA
YRAVFDASLM NGGEIQASRT KWWGVDRHIV IYYTAADRSS LYFVTSVPEP ADWLTSESWS
AKGDVKELRT AYEGFHPEVQ MVLNACPDCH KWAILEREPL ARWSDGRVVL LGDACHPMTP
YMAQGAATSI EDAAVLARCL AGVDNDDIEG AFRRYEANRK PRTSRIQAIS SANTWMSGGN
EDTSWLYGYD AWNVPLVGEN DMALAG