Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5313 |
Symbol | hemN |
ID | 6968245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4951110 |
End bp | 4952483 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388974 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002273383 |
Protein GI | 209396387 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00538] oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00391733 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.998982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAC AGCAAATCGA CTGGGATCTG GCCCTGATCC AGAAATATAA CTATTCCGGG CCACGATACA CCTCGTACCC GACCGCGCTG GAGTTTTCAG AAGACTTCGG CGAACAGGCG TTTTTACAAG CCGTGGCGCG CTATCCTGAG CGTCCATTAT CTCTCTACGT ACATATCCCG TTCTGCCATA AGCTTTGTTA CTTCTGCGGC TGCAATAAGA TTGTTACTCG CCAGCAGCAC AAGGCCGATC AGTATCTGGA CGCGCTGGAG CAAGAAATCG TCCATCGTGC ACCGCTGTTC GCCGGGCGTC ACGTCAGCCA GCTGCACTGG GGCGGCGGAA CGCCGACGTA TCTTAATAAA GCGCAAATCA GCCGCCTGAT GAAGCTGCTG CGCGAAAACT TCCAGTTCAA TGCCGATGCG GAGATTTCGA TCGAAGTCGA TCCGCGGGAA ATCGAACTGG ATGTACTCGA TCATTTACGC GCCGAGGGCT TTAATCGCCT GAGTATGGGC GTGCAGGACT TCAACAAAGA AGTGCAACGT CTGGTTAACC GCGAGCAGGA TGAAGAGTTT ATCTTTGCGC TGCTTAACCA TGCGCGTGAG ATTGGCTTTA CCTCCACCAA CATCGACCTG ATTTACGGCC TGCCGAAACA GACGCCGGAA AGTTTCGCCT TTACCCTGAA ACGCGTGGCG GAGCTAAACC CTGACCGTCT GAGCGTCTTT AACTATGCGC ATCTGCCGAC CATTTTTGCT GCTCAGCGCA AAATCAAAGA TGCTGACCTG CCGAGTCCGC AGCAAAAACT CGATATCCTG CAGGAAACCA TCGCCTTCCT GACGCAATCG GGCTATCAGT TTATCGGTAT GGATCACTTT GCCCGTCCGG ATGACGAGCT GGCGGTGGCC CAGCGTGAAG GCGTGCTGCA TCGTAATTTC CAGGGCTACA CCACTCAGGG CGATACCGAT CTGCTGGGGA TGGGCGTTTC CGCCATCAGC ATGATTGGCG ACTGCTACGC GCAGAACCAG AAAGAGTTGA AGCAGTACTA TCAGCAAGTG GATGAACAAG GCAATGCGCT GTGGCGTGGT ATTGCGCTAA CGCGTGATGA CTGTATTCGC CGCGATGTGA TTAAGTCGCT CATCTGCAAC TTCCGTCTGG ATTACGCCCC TATTGAGAAA CAGTGGGATT TGCACTTCGC TGATTACTTT GCGGAAGATC TCAAGCTGCT CGCCCCGTTA GCAAAAGATG GGCTGGTGGA TGTGGATGAG AAGGGAATAC AGGTGACGGC GAAAGGTCGC TTGCTGATCC GCAACATTTG CATGTGCTTT GATACCTATC TGCGCCAGAA AGCGCGGATG CAGCAGTTCT CTCGGGTGAT TTAA
|
Protein sequence | MSVQQIDWDL ALIQKYNYSG PRYTSYPTAL EFSEDFGEQA FLQAVARYPE RPLSLYVHIP FCHKLCYFCG CNKIVTRQQH KADQYLDALE QEIVHRAPLF AGRHVSQLHW GGGTPTYLNK AQISRLMKLL RENFQFNADA EISIEVDPRE IELDVLDHLR AEGFNRLSMG VQDFNKEVQR LVNREQDEEF IFALLNHARE IGFTSTNIDL IYGLPKQTPE SFAFTLKRVA ELNPDRLSVF NYAHLPTIFA AQRKIKDADL PSPQQKLDIL QETIAFLTQS GYQFIGMDHF ARPDDELAVA QREGVLHRNF QGYTTQGDTD LLGMGVSAIS MIGDCYAQNQ KELKQYYQQV DEQGNALWRG IALTRDDCIR RDVIKSLICN FRLDYAPIEK QWDLHFADYF AEDLKLLAPL AKDGLVDVDE KGIQVTAKGR LLIRNICMCF DTYLRQKARM QQFSRVI
|
| |