Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0003 |
Symbol | |
ID | 6978712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3481 |
End bp | 4686 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643394714 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002279532 |
Protein GI | 209547615 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.045882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACAATT TCGACACGCC AGGCCGCCCC TCGCGCGATG CGGCGCTGCT GCCCGATACC GGCGAGCCCG GCTTCGGCGT CTATGTGCAT TGGCCCTTCT GCGCGGCGAA GTGTCCCTAT TGCGATTTCA ACAGTCATGT GCGCCATCAG CCGGTCGATC AGGAACGCTT TGCATCAGCC TTCCTGAAGG AGATGGCGGC GGTCCGGGCA TTGAGCGGGC CGAAGACGGT GACGAGCATC TTCCTCGGCG GCGGCACGCC CTCGCTGATG AAACCGGAAA CGGTCTCCGC CATTCTCGAC GGCATTGCCC GGCACTGGCA CGTGCCTGCC GGCATCGAGA TCACCATGGA GGCCAATCCG TCCAGCGTCG AGGCCGAGCG CTTCCGCGGC TACCGGGCGG CCGGCGTCAA CCGCGTCTCG CTCGGCGTGC AGGCGCTTGA TGATCGGGAC CTGAAATTCC TCGGCCGGCT GCATGATGTC GCCGACGCGC TGAAGGCGAT CCGGCTGGCG CGCGACATTT TTCCGCGCAT GTCTTTCGAC CTCATCTATG CCCGGCCGGA CCAGACGGTC GAGCAATGGG AAAGGGAGCT GAAGCAAGCG ATCTCTTACG CGGTCGATCA TCTTTCGCTC TATCAACTCA CCATCGAGGA AGGCACGCCG TTTTATGGCC TGCACAAAGC AGGCAAGCTG ATCGTGCCGG ATGGCGAGCA ATCGGCCGTG CTCTACGAGG CGACGCAGGA AATCACCGCG CGCGAGGGCA TGCCGGCCTA CGAGGTTTCC AACCACGCCC GGCCGGGTGC TGAAAGCCGG CATAACCTGA CCTATTGGCG TTACGGCGAT TATGCCGGTA TCGGCCCTGG CGCGCACGGC CGGCTGACGC GCGGCCCCGA GAAGATCGCG ACGGCGACCG AGCGCAAGCC GGAGTCCTGG CTCGACATGG TCGAGCGCGA CGGCCACGGC ATTCTCGACG AGGAGCGGCT CGGCTATGAG GAACAATCCG ACGAATTGCT GCTGATGGGG CTGCGGCTCC GGGAAGGCGT CGATCTTGCC CGCTGGCAGC AGCTTTCCGG CCGCGACCTC GACCCGAAAC GCGAAGAGTT TCTGCTCGAA CACAAATTCA TCGAGCGGAT CGGCAATTCA CGCCTGCGCT GCACGCCCTC AGGAATGCTG ATCCTCGATT CCGTCGTCGC CGATCTCGCC TGCTGA
|
Protein sequence | MDNFDTPGRP SRDAALLPDT GEPGFGVYVH WPFCAAKCPY CDFNSHVRHQ PVDQERFASA FLKEMAAVRA LSGPKTVTSI FLGGGTPSLM KPETVSAILD GIARHWHVPA GIEITMEANP SSVEAERFRG YRAAGVNRVS LGVQALDDRD LKFLGRLHDV ADALKAIRLA RDIFPRMSFD LIYARPDQTV EQWERELKQA ISYAVDHLSL YQLTIEEGTP FYGLHKAGKL IVPDGEQSAV LYEATQEITA REGMPAYEVS NHARPGAESR HNLTYWRYGD YAGIGPGAHG RLTRGPEKIA TATERKPESW LDMVERDGHG ILDEERLGYE EQSDELLLMG LRLREGVDLA RWQQLSGRDL DPKREEFLLE HKFIERIGNS RLRCTPSGML ILDSVVADLA C
|
| |