Gene Rleg2_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0003 
Symbol 
ID6978712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3481 
End bp4686 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content64% 
IMG OID643394714 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002279532 
Protein GI209547615 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.045882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAATT TCGACACGCC AGGCCGCCCC TCGCGCGATG CGGCGCTGCT GCCCGATACC 
GGCGAGCCCG GCTTCGGCGT CTATGTGCAT TGGCCCTTCT GCGCGGCGAA GTGTCCCTAT
TGCGATTTCA ACAGTCATGT GCGCCATCAG CCGGTCGATC AGGAACGCTT TGCATCAGCC
TTCCTGAAGG AGATGGCGGC GGTCCGGGCA TTGAGCGGGC CGAAGACGGT GACGAGCATC
TTCCTCGGCG GCGGCACGCC CTCGCTGATG AAACCGGAAA CGGTCTCCGC CATTCTCGAC
GGCATTGCCC GGCACTGGCA CGTGCCTGCC GGCATCGAGA TCACCATGGA GGCCAATCCG
TCCAGCGTCG AGGCCGAGCG CTTCCGCGGC TACCGGGCGG CCGGCGTCAA CCGCGTCTCG
CTCGGCGTGC AGGCGCTTGA TGATCGGGAC CTGAAATTCC TCGGCCGGCT GCATGATGTC
GCCGACGCGC TGAAGGCGAT CCGGCTGGCG CGCGACATTT TTCCGCGCAT GTCTTTCGAC
CTCATCTATG CCCGGCCGGA CCAGACGGTC GAGCAATGGG AAAGGGAGCT GAAGCAAGCG
ATCTCTTACG CGGTCGATCA TCTTTCGCTC TATCAACTCA CCATCGAGGA AGGCACGCCG
TTTTATGGCC TGCACAAAGC AGGCAAGCTG ATCGTGCCGG ATGGCGAGCA ATCGGCCGTG
CTCTACGAGG CGACGCAGGA AATCACCGCG CGCGAGGGCA TGCCGGCCTA CGAGGTTTCC
AACCACGCCC GGCCGGGTGC TGAAAGCCGG CATAACCTGA CCTATTGGCG TTACGGCGAT
TATGCCGGTA TCGGCCCTGG CGCGCACGGC CGGCTGACGC GCGGCCCCGA GAAGATCGCG
ACGGCGACCG AGCGCAAGCC GGAGTCCTGG CTCGACATGG TCGAGCGCGA CGGCCACGGC
ATTCTCGACG AGGAGCGGCT CGGCTATGAG GAACAATCCG ACGAATTGCT GCTGATGGGG
CTGCGGCTCC GGGAAGGCGT CGATCTTGCC CGCTGGCAGC AGCTTTCCGG CCGCGACCTC
GACCCGAAAC GCGAAGAGTT TCTGCTCGAA CACAAATTCA TCGAGCGGAT CGGCAATTCA
CGCCTGCGCT GCACGCCCTC AGGAATGCTG ATCCTCGATT CCGTCGTCGC CGATCTCGCC
TGCTGA
 
Protein sequence
MDNFDTPGRP SRDAALLPDT GEPGFGVYVH WPFCAAKCPY CDFNSHVRHQ PVDQERFASA 
FLKEMAAVRA LSGPKTVTSI FLGGGTPSLM KPETVSAILD GIARHWHVPA GIEITMEANP
SSVEAERFRG YRAAGVNRVS LGVQALDDRD LKFLGRLHDV ADALKAIRLA RDIFPRMSFD
LIYARPDQTV EQWERELKQA ISYAVDHLSL YQLTIEEGTP FYGLHKAGKL IVPDGEQSAV
LYEATQEITA REGMPAYEVS NHARPGAESR HNLTYWRYGD YAGIGPGAHG RLTRGPEKIA
TATERKPESW LDMVERDGHG ILDEERLGYE EQSDELLLMG LRLREGVDLA RWQQLSGRDL
DPKREEFLLE HKFIERIGNS RLRCTPSGML ILDSVVADLA C