Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0003 |
Symbol | |
ID | 8011255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2893 |
End bp | 4095 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644822594 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002973854 |
Protein GI | 241202758 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.242877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00414143 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGACAATT TCGACACGCC AAGCACCTCG CGCGACGCGG CTCTGCTGCC CGATACCGGC GAGCCGGGCT TCGGCGTCTA TGTGCACTGG CCCTTCTGCG CGGCGAAGTG TCCCTATTGC GACTTCAACA GCCATGTGCG CCACCAGCCG GTGGATCAGG AGCGCTTTAC ATCAGCCTTC CTGACGGAGA TGGCGGCGGT CCGGGCGATG AGTGGGCCGA AGACGGTGAC GAGCATCTTC CTCGGCGGCG GCACGCCCTC GCTGATGAAG CCGGAAGCGG TTTCCGCCAT TCTCGACGGC ATTGCGCGGC ACTGGCATGT GCCAGATGGC ATCGAGATCA CCATGGAGGC CAATCCTTCG AGCGTCGAGG CCGAACGCTT CCGCGGCTAC CGGGCAGCCG GCGTCAATCG CGTCTCGCTC GGCGTGCAGG CGCTGAACGA CCGGGATCTG AAATTCCTCG GCCGGCTGCA TGATGTCGCC GACGCGCTGA AGGCGATAAG GCTGGCGCGC GATATCTTTC CGCGCATGTC CTTCGACCTG ATCTATGCCC GGCCCGACCA GACCGTCGAG GAATGGGAAA AGGAATTGAA GGAGGCGATC TCCTATGCGG TCGACCATCT TTCGCTTTAT CAGCTGACCA TCGAGGAAGG CACGCCCTTC TACGGCCTGC ACAAGGCCGG CAAGCTGATC GTGCCGGATG GCGAGCAATC GGCAGTGCTC TACGAGGCGA CGCAAGAGAT CACCGCGCGC GAGGGCATGC CGGCCTATGA GGTTTCCAAT CATGCCCGGC CGGGGGCTGA AAGCCGGCAT AACCTGACCT ACTGGCGTTA TGGTGATTAT GCCGGCATCG GCCCGGGCGC CCATGGCCGG CTGACGCGTG GCCCCGAGAA GCTCGCGACG GCGACCGAGC GCAAGCCGGA AACCTGGCTC GACATGGTCG AGCGTGACGG CCACGGCATT CTCGACGAGG AGCGGCTCGG CTTCGAGGAA CAGTCCGACG AGCTGCTGCT GATGGGGCTG CGGCTCAGGG AAGGCGTCGA TCTTGCCCGC TGGCAGCAAC TTTCCGGCCG CGATCTCGAC CCGAAACGCG AGGAATTCCT GCTCGAACAC AAATTCATCG AGCGGATCGG CAATTCACGG CTGCGCTGCA CGCCATCCGG CATGCTGATC CTCGATTCCG TCGTCGCCGA TCTCGCTTGC TGA
|
Protein sequence | MDNFDTPSTS RDAALLPDTG EPGFGVYVHW PFCAAKCPYC DFNSHVRHQP VDQERFTSAF LTEMAAVRAM SGPKTVTSIF LGGGTPSLMK PEAVSAILDG IARHWHVPDG IEITMEANPS SVEAERFRGY RAAGVNRVSL GVQALNDRDL KFLGRLHDVA DALKAIRLAR DIFPRMSFDL IYARPDQTVE EWEKELKEAI SYAVDHLSLY QLTIEEGTPF YGLHKAGKLI VPDGEQSAVL YEATQEITAR EGMPAYEVSN HARPGAESRH NLTYWRYGDY AGIGPGAHGR LTRGPEKLAT ATERKPETWL DMVERDGHGI LDEERLGFEE QSDELLLMGL RLREGVDLAR WQQLSGRDLD PKREEFLLEH KFIERIGNSR LRCTPSGMLI LDSVVADLAC
|
| |