Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0541 |
Symbol | hemN1 |
ID | 5711994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 526456 |
End bp | 527820 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641266443 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_001531888 |
Protein GI | 159043094 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00538] oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAACA AGACCGTGGA TGAAATCGTC GAGAAATATG CGCGCGTGGC GACCCCGCGC TACACCAGCT ACCCCACGGC GCCGCATTTC GAGCCCGCGT TTCCCGAGGT GACCTATCGC GGATGGCTGA GCGCGCTCGA TGCGTCGGAG CCGATCTCTC TTTATGTCCA CATCCCCTTC TGTCGCGAAA TGTGCTGGTA TTGCGGCTGC AACATGAAAC TCGTCAAGCG CGAAGGCCCG CTTGCCGAAT ATGTCGAGAC GCTGCTGAAG GAAATCGCTC TTGTGCGGGC GGCGATGCCG GGGCGCGTAC CGGTGGCGCA TCTGCATTGG GGCGGTGGGA CGCCAACCGC ATTGTCGCCG GACCAGATTG CCCGGATCAT GGACGCCTTG CGGGCGTCCT TCGACATCCT GCCGGATGCC GAGATCGCCA TCGAGAGCGA CCCGCGTACC CTGACGGAGC CGATGGTGGC GCGGCTGGCA GAACTCGGTT TCAATCGGGC CAGTTTCGGT GTGCAGGAAT TCGACCCGAA GGTGCAGCGC GCCATCAACC GTATCCAGCC GCCGGAAATG GTTGCAAAGG CGGTCGGCAT GTTCCGGCGG CACGGCATCG CGGCAGTCAA TTTCGATCTG ATCTACGGGC TCCCGTACCA GAGTGTCGAG ACGCTGCTGC ACACGGTGGA CCTGGTTTCC GAAATGGGCC CGGACCGGAT TGCGCTCTTC GGGTACGCCC ATGTCCCCTG GGTCGCCAAG GCGCAGCGGA TGATCCCGGA GGAGTCCTTG CCGGACGCAC GGGCGCGGGC GGCGCAAGCG TCCGCTGCGG CGCGTGCGCT GACCGAGGCC GGGTATCGCG CCATCGGCAT CGATCATTTT GCCAAGCCCG AAGACGGGCT CGCCCGGGCG CAGGCCGAAG GGCGGCTCTA TCGCAATTTC CAAGGCTACA CGGACGATCC TGCGGCCACC CTGGTTGGTC TGGGCGCGAC CTCTATCGGG CGGACGCCGC AGGGCTATAT CCAGAACCAA CCTGAGACCC GCGCTTGGGC GCGCGCCATC GAAAACGGTG CGCTACCGGT TGCCAAGGGT CGCGCCTTCG CAGGCGAGGA CCTGATGCGT GCCGCGGTGA TCGAACGGAT CATGTGCGAC GGTTACGTGG ACCCGGATGC TATTGGTGCG CGCTACGGGG TGCCTGCCGG ATGGTGGGGA CCCGAGCGTG ATTCACTGCG GGATATGGAG CGGGATGGAC TTGTCCAATG CGGCGCGACG GGCCTGCGCG TGACCTCCAA AGGCGCCCCG CTGGCACGGG TCGTGGCAGC GGCATTCGAT AGCTATTTCG CTGCTTCCAA GGCCCGCCAT TCCGTGGCGT TGTAA
|
Protein sequence | MRNKTVDEIV EKYARVATPR YTSYPTAPHF EPAFPEVTYR GWLSALDASE PISLYVHIPF CREMCWYCGC NMKLVKREGP LAEYVETLLK EIALVRAAMP GRVPVAHLHW GGGTPTALSP DQIARIMDAL RASFDILPDA EIAIESDPRT LTEPMVARLA ELGFNRASFG VQEFDPKVQR AINRIQPPEM VAKAVGMFRR HGIAAVNFDL IYGLPYQSVE TLLHTVDLVS EMGPDRIALF GYAHVPWVAK AQRMIPEESL PDARARAAQA SAAARALTEA GYRAIGIDHF AKPEDGLARA QAEGRLYRNF QGYTDDPAAT LVGLGATSIG RTPQGYIQNQ PETRAWARAI ENGALPVAKG RAFAGEDLMR AAVIERIMCD GYVDPDAIGA RYGVPAGWWG PERDSLRDME RDGLVQCGAT GLRVTSKGAP LARVVAAAFD SYFAASKARH SVAL
|
| |