Gene Dshi_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0541 
SymbolhemN1 
ID5711994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp526456 
End bp527820 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content65% 
IMG OID641266443 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001531888 
Protein GI159043094 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAACA AGACCGTGGA TGAAATCGTC GAGAAATATG CGCGCGTGGC GACCCCGCGC 
TACACCAGCT ACCCCACGGC GCCGCATTTC GAGCCCGCGT TTCCCGAGGT GACCTATCGC
GGATGGCTGA GCGCGCTCGA TGCGTCGGAG CCGATCTCTC TTTATGTCCA CATCCCCTTC
TGTCGCGAAA TGTGCTGGTA TTGCGGCTGC AACATGAAAC TCGTCAAGCG CGAAGGCCCG
CTTGCCGAAT ATGTCGAGAC GCTGCTGAAG GAAATCGCTC TTGTGCGGGC GGCGATGCCG
GGGCGCGTAC CGGTGGCGCA TCTGCATTGG GGCGGTGGGA CGCCAACCGC ATTGTCGCCG
GACCAGATTG CCCGGATCAT GGACGCCTTG CGGGCGTCCT TCGACATCCT GCCGGATGCC
GAGATCGCCA TCGAGAGCGA CCCGCGTACC CTGACGGAGC CGATGGTGGC GCGGCTGGCA
GAACTCGGTT TCAATCGGGC CAGTTTCGGT GTGCAGGAAT TCGACCCGAA GGTGCAGCGC
GCCATCAACC GTATCCAGCC GCCGGAAATG GTTGCAAAGG CGGTCGGCAT GTTCCGGCGG
CACGGCATCG CGGCAGTCAA TTTCGATCTG ATCTACGGGC TCCCGTACCA GAGTGTCGAG
ACGCTGCTGC ACACGGTGGA CCTGGTTTCC GAAATGGGCC CGGACCGGAT TGCGCTCTTC
GGGTACGCCC ATGTCCCCTG GGTCGCCAAG GCGCAGCGGA TGATCCCGGA GGAGTCCTTG
CCGGACGCAC GGGCGCGGGC GGCGCAAGCG TCCGCTGCGG CGCGTGCGCT GACCGAGGCC
GGGTATCGCG CCATCGGCAT CGATCATTTT GCCAAGCCCG AAGACGGGCT CGCCCGGGCG
CAGGCCGAAG GGCGGCTCTA TCGCAATTTC CAAGGCTACA CGGACGATCC TGCGGCCACC
CTGGTTGGTC TGGGCGCGAC CTCTATCGGG CGGACGCCGC AGGGCTATAT CCAGAACCAA
CCTGAGACCC GCGCTTGGGC GCGCGCCATC GAAAACGGTG CGCTACCGGT TGCCAAGGGT
CGCGCCTTCG CAGGCGAGGA CCTGATGCGT GCCGCGGTGA TCGAACGGAT CATGTGCGAC
GGTTACGTGG ACCCGGATGC TATTGGTGCG CGCTACGGGG TGCCTGCCGG ATGGTGGGGA
CCCGAGCGTG ATTCACTGCG GGATATGGAG CGGGATGGAC TTGTCCAATG CGGCGCGACG
GGCCTGCGCG TGACCTCCAA AGGCGCCCCG CTGGCACGGG TCGTGGCAGC GGCATTCGAT
AGCTATTTCG CTGCTTCCAA GGCCCGCCAT TCCGTGGCGT TGTAA
 
Protein sequence
MRNKTVDEIV EKYARVATPR YTSYPTAPHF EPAFPEVTYR GWLSALDASE PISLYVHIPF 
CREMCWYCGC NMKLVKREGP LAEYVETLLK EIALVRAAMP GRVPVAHLHW GGGTPTALSP
DQIARIMDAL RASFDILPDA EIAIESDPRT LTEPMVARLA ELGFNRASFG VQEFDPKVQR
AINRIQPPEM VAKAVGMFRR HGIAAVNFDL IYGLPYQSVE TLLHTVDLVS EMGPDRIALF
GYAHVPWVAK AQRMIPEESL PDARARAAQA SAAARALTEA GYRAIGIDHF AKPEDGLARA
QAEGRLYRNF QGYTDDPAAT LVGLGATSIG RTPQGYIQNQ PETRAWARAI ENGALPVAKG
RAFAGEDLMR AAVIERIMCD GYVDPDAIGA RYGVPAGWWG PERDSLRDME RDGLVQCGAT
GLRVTSKGAP LARVVAAAFD SYFAASKARH SVAL