Gene Rleg_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1511 
Symbol 
ID8012595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1492829 
End bp1494190 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content60% 
IMG OID644824099 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_002975341 
Protein GI241204245 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000621512 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.470006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA CATCGATCCA GACGTCGGAT GGTGAAGCCG CGACAGCAAA CACGCTGAAA 
TATATGCCGG GGTTCGGCAA TGACTTCGAA ACCGAGTCGC TTCCCGGCGC CTTGCCGCAA
GGCCAGAACA GTCCGCAGAA ATGCAACTAT GGTCTCTATG CGGAGCAGCT TTCCGGCTCG
CCGTTCACCG CGCCGCGCGG GACCAACGAA AGGTCCTGGC TTTACCGCAT CCGCCCGAGC
GTGCGTCACA CCCGTCGCTT CTCCAACGCG TCCTATCCGC TCTGGAAAAC CGCACCTTGC
CTGGACGAAC ATTCGCTTCC TCTCGGCCAG CTTCGCTGGG ATCCCATCCC CGCACCCTCG
GAGAAGCTGA CGTTTCTCGA GGGGGTGCGG ACCATCACCA CGGCAGGCGA TGCCACCACC
CAGGTGGGCA TGTCAGCCCA TGCCTATGTC TTCAATGAGG ACATGGTCGA CGATTACTTC
TTCAACGCCG ATGGTGAATT GCTGATCGTG CCGCAGCTCG GCGCCATCAG AGTGTTCACC
GAAATGGGCA TCATGGATGT CGAGCCCCTG GAAATATGCC TGATCCCGCG CGGCATGATG
TTCAAGATCA TGAGGGGTGG CGACCAGACG GTCTGGCGTG GCTACATCTG CGAGAACTAC
GGCGCGAAAT TCACCCTGCC GGACCGCGGA CCGATCGGCG CCAACTGCCT GGCAAACCCG
CGTGACTTCA AGACGCCTGT CGCCGCATTC GAGGATAAGG AAACGCCGTG CCGCGTGCAT
GTGAAGTGGT GCGGAAAATT CTATGTCACC GACATCGGCC ATTCGCCGCT GGATGTGGTG
GCCTGGCACG GCAACTACGC CCCGTTCAAA TACGACTTGC GGACGTTCTC GCCGGTCGGC
GCTATCCGCT TCGATCATCC CGATCCGTCG ATCTTTTCGG TGCTGACCGC GCCGACCGAA
GATGCGGGTA CGGCGAACGT CGATTTCGTG ATCTTTCCGC CGCGCTGGCT GGTCGCCGAA
CATACGTTTC GACCGCCTTG GTACCACCGC AACATCATGA GCGAATTCAT GGGCCTGATC
CATGGCCAAT ATGACGCCAA GGAGGAGGGC TTCGTGCCGG GCGGCATGAG CCTGCACAAC
ATGATGCTTC CCCACGGGCC GGACGCGCTC GCCTTCGAAA AGGCATCCAA TACCGAGCTC
AAACCCGTGA AGCTCGATCA CACCATGGCC TTCATGTTCG AGACCCGGTA CCCGCAGCAA
CTGACGAAAT ACGCAGCCGA GCTCGAAACG CTGCAGGATA ATTACCTGGA ATGCTGGGAC
GGCCTGGAAC GCAAGTTCGA CGGAACCCCC GGCATCAAGT GA
 
Protein sequence
MDQTSIQTSD GEAATANTLK YMPGFGNDFE TESLPGALPQ GQNSPQKCNY GLYAEQLSGS 
PFTAPRGTNE RSWLYRIRPS VRHTRRFSNA SYPLWKTAPC LDEHSLPLGQ LRWDPIPAPS
EKLTFLEGVR TITTAGDATT QVGMSAHAYV FNEDMVDDYF FNADGELLIV PQLGAIRVFT
EMGIMDVEPL EICLIPRGMM FKIMRGGDQT VWRGYICENY GAKFTLPDRG PIGANCLANP
RDFKTPVAAF EDKETPCRVH VKWCGKFYVT DIGHSPLDVV AWHGNYAPFK YDLRTFSPVG
AIRFDHPDPS IFSVLTAPTE DAGTANVDFV IFPPRWLVAE HTFRPPWYHR NIMSEFMGLI
HGQYDAKEEG FVPGGMSLHN MMLPHGPDAL AFEKASNTEL KPVKLDHTMA FMFETRYPQQ
LTKYAAELET LQDNYLECWD GLERKFDGTP GIK