Gene Dshi_3498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3498 
SymbolhemH 
ID5713729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3678932 
End bp3680020 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID641269427 
Productferrochelatase 
Protein accessionYP_001534832 
Protein GI159046038 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.440479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.808064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACA GAGGCACAGA ACCGATGAAC ATGATGTCCA AGACGACCTC GTCCCACCTG 
CCCGGTGATC ACCCGCCGGT CAATTTCGGC AAGGTCGGCG TACTGCTGGC CAATCTCGGG
ACGCCGGACA ATTATGACTA TTGGTCGATG CGGCGGTATC TGAATGAATT CCTGTCCGAC
AAGCGGGTGA TCGATTACAG CCCCTGGATC TGGCAACCGC TGTTGCAGCT GGTGATCCTG
ACCAAGCGGC CCTTCAGCTC GGGTGCGGCC TACAAGTCGA TCTGGAACGA AGAGGCGGGG
GAGAGCCCGC TGATGACCAT CACCAAGGAT CAGACCGCCA AGATGAAGGC GGCGATGCAG
GCTCGGTTCG GCGATGACGT GGTGGTGGAT TTCTGCATGC GCTACGGCAA TCCGTCCACC
AAGTCGAAGG TGGAGGAATT GCAGAAGCAG GGCTGCCAGA AGATCCTGTT CTTCCCGCTC
TATCCGCAAT ATGCGGGCGC GACCTCGGCC ACGGCCTGCG ACCAGTTCTT CCGGTCGCTG
GAGCATATCA AGTGGCAGCC GATCGTGCGC ACGGTGGAGC CGTATTTCGA GCATCCGATG
TATATCGAGG CGCTGGCCCA GTCCGTGGAG CGCGCCTATG CGGACATGGA AACCCGCCCC
GACGTGCTGG TCGCGTCCTA TCACGGGGTG CCGAAGCGGT ACCTGATGGA GGGTGACCCG
TACCACTGCC AGTGTCAGAA GACCTCGCGC CTGCTCAAGG AACGGCTGGG CTGGCCGGAG
GGCGAGATCG TGACCACCTT CCAGAGCCGG TTCGGCCCGG AGGAATGGCT CAAGCCCTAC
ACGGTCGAAG AGGTCGCGCG CCTGGCCGAG ACCGGCAAGA AGAAGATCGC GGTGATCGCG
CCGGCATTTT CCGCCGACTG CATCGAAACG CTCGAAGAGA TCAACGAAGA GATCAAGGAG
AGCTTCGAGG AGGCGGGCGG CGAAGAGTTC ACCTATATCC CCTGCCTGAA TGACGACGAC
GCCCATATCG CGGCGCTGGC CAAGGTCGTG GAAGAAAACC TTGCGGGCTG GATCGCGCCG
AAGGGCTGA
 
Protein sequence
MADRGTEPMN MMSKTTSSHL PGDHPPVNFG KVGVLLANLG TPDNYDYWSM RRYLNEFLSD 
KRVIDYSPWI WQPLLQLVIL TKRPFSSGAA YKSIWNEEAG ESPLMTITKD QTAKMKAAMQ
ARFGDDVVVD FCMRYGNPST KSKVEELQKQ GCQKILFFPL YPQYAGATSA TACDQFFRSL
EHIKWQPIVR TVEPYFEHPM YIEALAQSVE RAYADMETRP DVLVASYHGV PKRYLMEGDP
YHCQCQKTSR LLKERLGWPE GEIVTTFQSR FGPEEWLKPY TVEEVARLAE TGKKKIAVIA
PAFSADCIET LEEINEEIKE SFEEAGGEEF TYIPCLNDDD AHIAALAKVV EENLAGWIAP
KG