Gene Dshi_0659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0659 
SymbolhemN2 
ID5711495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp662948 
End bp664309 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content67% 
IMG OID641266568 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001532006 
Protein GI159043212 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCA AAACGCAACT TGCGCGCTAC GGGCTTTTTG ACACTCGCGT TCCACGGTAC 
ACGAGCTATC CCACGGCGAC CCATTTCTCG CAGGCCACCC GGCCAAGGGA TTTCACGGCC
TGGATCCAGG CGATCCCCCC GGGGAGCGAG ATTTCCCTCT ACGCGCATGT GCCGTTTTGT
CGCCGCCTGT GCTGGTTCTG CGCCTGCCGC ACCCAAGGCA CCCAGAGCGA CGCGCCGGTG
CGCGCCTATG TCGATACCCT GCTGGCTGAA ATCGCCCTGC TGCGCGCGGC GCTGCCCGAA
GGTGTCCGCC TGTCGCGCAT CCATTGGGGG GGCGGCACAC CGACCCTGCT CGCGCCGGAC
CTGGTGACGC GGCTGGCCGA GGCGATGTTC GCCCTGGCGC CCACCACCGA CCGGGCGGAA
TTCTCGGTCG AGATCGACCC GAACGAGATC GACGCCGCCC GCCTCGACGC GCTGGCCGCC
GCGGGCATGA ACCGCGCCTC GATCGGGGTG CAGGATTTCG ACCCGGACAT CCAGAAGGCG
ATCGGACGCG AACAGCGATT CGAAGTGACC GAGGCGGTGG TGATGGACCT GCGCGACCGC
GGCATCCGAA GCCTGAACAC CGACATCCTT TATGGCCTGC CCTTCCAGAC ACCGGTGAAG
ATCACCGAGT CGGTACAGAA GCTGCTGTCG CTGCAACCGG ACCGGGTCGC CCTCTATGGC
TACGCCCATG TGCCCTGGAT GGCCCGGCGC CAGAACCTGA TCCCGAACGA GGCGCTGCCC
ACCCCGGAGG CCCGGCTCGA CCTGTTCGAA ACCGCCCGCC GCCTGTTCCG CTGGGACAAC
TACGCCGAGA TCGGTATCGA CCATTTCGCG CACCAGGGAG ACGGGCTCGC CGTGGCCGCC
GCCGAACGGC GCCTGCATCG GAATTTCCAG GGCTACACGG ACGATTCCGC AACCGTTCTG
ATCGGCCTCG GCGCCTCGGC GATCTCGCGC TTCCCGCAAG GCTACGCCCA GAACGCCAGC
GGCACCGCCC AGTACCAGAA GGCCATCCGC GCAGGCGGCT TCGCCACGGT GCGGGGCCAC
GACTTTGCCG GGGACGATGC GATGCGGGCG CGAATGATCG AGATGATCAT GTGCGATTTC
GCCGTGGATG GCCGGGAACT GGTCCGCGTG TTCAAGGTGC CCGAGGCCCG GATCACCGCC
CTGTTCCGCG CCGCACAAGA GCAATTCGGC GGTATGGTCG AGCTCGACAG CGGGGCACTC
AGCTTCCGTA TCCCCCCCGA TGCACGCCCC CTGACCCGGA TGATTGCAAG GGCCTTCGAC
CGCTACGAGG CCCCGGCAGG CAGCCACTCG GTCGCCACCT GA
 
Protein sequence
MTPKTQLARY GLFDTRVPRY TSYPTATHFS QATRPRDFTA WIQAIPPGSE ISLYAHVPFC 
RRLCWFCACR TQGTQSDAPV RAYVDTLLAE IALLRAALPE GVRLSRIHWG GGTPTLLAPD
LVTRLAEAMF ALAPTTDRAE FSVEIDPNEI DAARLDALAA AGMNRASIGV QDFDPDIQKA
IGREQRFEVT EAVVMDLRDR GIRSLNTDIL YGLPFQTPVK ITESVQKLLS LQPDRVALYG
YAHVPWMARR QNLIPNEALP TPEARLDLFE TARRLFRWDN YAEIGIDHFA HQGDGLAVAA
AERRLHRNFQ GYTDDSATVL IGLGASAISR FPQGYAQNAS GTAQYQKAIR AGGFATVRGH
DFAGDDAMRA RMIEMIMCDF AVDGRELVRV FKVPEARITA LFRAAQEQFG GMVELDSGAL
SFRIPPDARP LTRMIARAFD RYEAPAGSHS VAT