Gene Rleg_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3474 
Symbol 
ID8014345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3507733 
End bp3508905 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content61% 
IMG OID644826038 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_002977259 
Protein GI241206163 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR02360] 4-hydroxybenzoate 3-monooxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.119469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGAACTC AGGTCGCCAT CATCGGTTCG GGACCATCCG GCCTGCTGCT CGGCCAGCTT 
CTGACCGAAG CCGGCGTCGA CAATGTCATT CTCGATCGTG TGAACAAGGA TTACATCCTC
AGCCGGGTTC GCGCCGGCGT TCTGGAGGAA GGCACCGTCG GGCTGCTGGA TCAGGCCAAA
TCAGGCGCGA GGCTGCATTC CGAAGGCCTG CCGCATGACG GCTTCTCACT GACCTTCGAC
GGACGCGACC ATCGCATCGA CCTTCACGAA TTGACCGGCG GCAGGCGTGT CACCGTCTAC
GGACAGACCG AAGTGACGCG CGATCTCATG GAGCGGCGCG AAGAAAGCGG CTCCCCGTCG
ATCTACGATG CCGTCGATGT CGCGCCGCAT GACTTCGACG GCCATTCTCC TTTCGTCACC
TATGTGAAAG ACGGCGTTGC CAAGCGCATC GATTGTGACT TCATCGCCGG CTGCGACGGG
TTTCACGGCG CCAGCCGCAA GACCGTTCCG GAGCGGGCGA TCAGGAGTTT CGAGAAGGTC
TATCCCTTCG GCTGGCTGGG GGTCCTTGCC GACGTGGCGC CTGTCAGCCA TGAGCTGATC
TACGCCAACC ATCCAAGGGG CTTTGCGCTT TGTTCGATGC GCTCGGCCAC CCGCAGCCGC
TACTACATCC AATGTGCGCT CGACGAGAAG ATCGGGGACT GGAGCGACGA CCGTTTCTGG
GACGAGTTGA GACGGCGGCT GCCGACGCAT CATGCCGAAG CATTGGCGAC CGCGCCGTCC
TTCGAGAAAT CGATTGCGCC GCTGCGCTCC TTCGTCGCCG AACCGATGCG TTTCGGCCGG
CTTTTCCTGG TCGGCGACGC CGCCCATATC GTCCCGCCGA CCGGCGCCAA GGGATTGAAC
CTCGCCGCCA GCGACGTCCA TTATCTTTTC TCCGGGCTGA TCGAGCATTA CCGTGAAGGC
TCGAATAGTG GCATCGACGC TTACTCGCAG AAGGCGCTCG CGCGTGTATG GAAAGCCGTG
CGGTTTTCCT GGTGGATGAC GACGATGATG CATCGTTTTC CGGATACCGG TGATTTCGAC
CAGAAGATCC AGGAGGCGGA ACTCGACTAT CTCACCCATT CCCGCGCCGC CTCGACAGCG
CTCGCGGAGA ATTATGTGGG ATTGCCATTC TGA
 
Protein sequence
MRTQVAIIGS GPSGLLLGQL LTEAGVDNVI LDRVNKDYIL SRVRAGVLEE GTVGLLDQAK 
SGARLHSEGL PHDGFSLTFD GRDHRIDLHE LTGGRRVTVY GQTEVTRDLM ERREESGSPS
IYDAVDVAPH DFDGHSPFVT YVKDGVAKRI DCDFIAGCDG FHGASRKTVP ERAIRSFEKV
YPFGWLGVLA DVAPVSHELI YANHPRGFAL CSMRSATRSR YYIQCALDEK IGDWSDDRFW
DELRRRLPTH HAEALATAPS FEKSIAPLRS FVAEPMRFGR LFLVGDAAHI VPPTGAKGLN
LAASDVHYLF SGLIEHYREG SNSGIDAYSQ KALARVWKAV RFSWWMTTMM HRFPDTGDFD
QKIQEAELDY LTHSRAASTA LAENYVGLPF