Gene Rleg_5533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5533 
Symbol 
ID8016424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp119672 
End bp120559 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content59% 
IMG OID644827700 
Productintradiol ring-cleavage dioxygenase 
Protein accessionYP_002978900 
Protein GI241518272 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID[TIGR02439] catechol 1,2-dioxygenase, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.247277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00000648247 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGATG CACATGAAAA GGGTTTCTTC ACGGAAGAGA ACTCCGTTGA GGTGGTCACG 
AGCCGCAACG CCACCACCAA GGACCAGCGC CTGAAGCGCG TGATGGAGGT CGTGACACGC
AAGCTGCATG AGGCGGTGAA GGAGCTTGAG CCGACGCAGG ACGAATGGAT GGAGGCAATT
CTCTTTCTGA CCCGCACAGG ACATACATGC AACGAATGGC GACAGGAATT TATCCTGCTG
TCGGACGTGC TCGGCGTGTC GATGCTGGTC GACGCCATTA ATAACCGCAA GCCCTCAGGC
GCCTCCGAAA GCACTGTTCT TGGCCCGTTT CACGTTGCCG ACGCGCCGGA ACTGCCGATG
GGCACCAATA TCTGCCTCGA TCACAAGGGC GAGGACATGG TGATCGGCGG CAGCATCCGT
AGCACGGATG GCAGACCGAT TGCCGGCGCT GTCATCGACG TCTGGCAGGC CAACGACGAA
GGCTTCTACG ACGTGCAGCA GAAGGGGATC CAACCAGACT TCAACCTTCG CGGCATCTTT
CGCAGCGGCG CGGATGGCCG CTATTGGTTT CGCGCAGTCA AGCCCAAGTA TTACCCGATC
CCGGACGATG GACCGGTCGG CAAGCTGCTC GGCGCGCTCG GTCGTCACCC CTACAGGCCC
GCTCACCTGC ACTACATCAT CAAGGCCGAC GGCTTCGAGA CGCTCACGAC GCACATCTTT
GATCCGGACG ATCCGTACAT CCACTCCGAC GCAGTCTTTG GCGTGAAGGA GAGCTTGCTT
GCCAAGTTCC AGCAAGTCGA GGATTCGGTA CGCGCTGACG AGCTTGGTTT CTCTGGCAAG
TTTTGGCAGA TAGAGCACGA TTTCGTGCTG GCTCGGCCCG AGGAGTAG
 
Protein sequence
MIDAHEKGFF TEENSVEVVT SRNATTKDQR LKRVMEVVTR KLHEAVKELE PTQDEWMEAI 
LFLTRTGHTC NEWRQEFILL SDVLGVSMLV DAINNRKPSG ASESTVLGPF HVADAPELPM
GTNICLDHKG EDMVIGGSIR STDGRPIAGA VIDVWQANDE GFYDVQQKGI QPDFNLRGIF
RSGADGRYWF RAVKPKYYPI PDDGPVGKLL GALGRHPYRP AHLHYIIKAD GFETLTTHIF
DPDDPYIHSD AVFGVKESLL AKFQQVEDSV RADELGFSGK FWQIEHDFVL ARPEE