Gene Rleg_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0003 
Symbol 
ID8011255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2893 
End bp4095 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content64% 
IMG OID644822594 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002973854 
Protein GI241202758 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.242877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00414143 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGACAATT TCGACACGCC AAGCACCTCG CGCGACGCGG CTCTGCTGCC CGATACCGGC 
GAGCCGGGCT TCGGCGTCTA TGTGCACTGG CCCTTCTGCG CGGCGAAGTG TCCCTATTGC
GACTTCAACA GCCATGTGCG CCACCAGCCG GTGGATCAGG AGCGCTTTAC ATCAGCCTTC
CTGACGGAGA TGGCGGCGGT CCGGGCGATG AGTGGGCCGA AGACGGTGAC GAGCATCTTC
CTCGGCGGCG GCACGCCCTC GCTGATGAAG CCGGAAGCGG TTTCCGCCAT TCTCGACGGC
ATTGCGCGGC ACTGGCATGT GCCAGATGGC ATCGAGATCA CCATGGAGGC CAATCCTTCG
AGCGTCGAGG CCGAACGCTT CCGCGGCTAC CGGGCAGCCG GCGTCAATCG CGTCTCGCTC
GGCGTGCAGG CGCTGAACGA CCGGGATCTG AAATTCCTCG GCCGGCTGCA TGATGTCGCC
GACGCGCTGA AGGCGATAAG GCTGGCGCGC GATATCTTTC CGCGCATGTC CTTCGACCTG
ATCTATGCCC GGCCCGACCA GACCGTCGAG GAATGGGAAA AGGAATTGAA GGAGGCGATC
TCCTATGCGG TCGACCATCT TTCGCTTTAT CAGCTGACCA TCGAGGAAGG CACGCCCTTC
TACGGCCTGC ACAAGGCCGG CAAGCTGATC GTGCCGGATG GCGAGCAATC GGCAGTGCTC
TACGAGGCGA CGCAAGAGAT CACCGCGCGC GAGGGCATGC CGGCCTATGA GGTTTCCAAT
CATGCCCGGC CGGGGGCTGA AAGCCGGCAT AACCTGACCT ACTGGCGTTA TGGTGATTAT
GCCGGCATCG GCCCGGGCGC CCATGGCCGG CTGACGCGTG GCCCCGAGAA GCTCGCGACG
GCGACCGAGC GCAAGCCGGA AACCTGGCTC GACATGGTCG AGCGTGACGG CCACGGCATT
CTCGACGAGG AGCGGCTCGG CTTCGAGGAA CAGTCCGACG AGCTGCTGCT GATGGGGCTG
CGGCTCAGGG AAGGCGTCGA TCTTGCCCGC TGGCAGCAAC TTTCCGGCCG CGATCTCGAC
CCGAAACGCG AGGAATTCCT GCTCGAACAC AAATTCATCG AGCGGATCGG CAATTCACGG
CTGCGCTGCA CGCCATCCGG CATGCTGATC CTCGATTCCG TCGTCGCCGA TCTCGCTTGC
TGA
 
Protein sequence
MDNFDTPSTS RDAALLPDTG EPGFGVYVHW PFCAAKCPYC DFNSHVRHQP VDQERFTSAF 
LTEMAAVRAM SGPKTVTSIF LGGGTPSLMK PEAVSAILDG IARHWHVPDG IEITMEANPS
SVEAERFRGY RAAGVNRVSL GVQALNDRDL KFLGRLHDVA DALKAIRLAR DIFPRMSFDL
IYARPDQTVE EWEKELKEAI SYAVDHLSLY QLTIEEGTPF YGLHKAGKLI VPDGEQSAVL
YEATQEITAR EGMPAYEVSN HARPGAESRH NLTYWRYGDY AGIGPGAHGR LTRGPEKLAT
ATERKPETWL DMVERDGHGI LDEERLGFEE QSDELLLMGL RLREGVDLAR WQQLSGRDLD
PKREEFLLEH KFIERIGNSR LRCTPSGMLI LDSVVADLAC