Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4251 |
Symbol | hemN |
ID | 6143182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4346828 |
End bp | 4348201 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619072 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_001746196 |
Protein GI | 170683331 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00538] oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.172293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0658119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAC AGCAAATCGA CTGGGATCTG GCCCTGATCC AGAAATATAA CTATTCCGGG CCACGATACA CCTCGTACCC GACCGCGCTG GAGTTTTCAG AAGACTTCGG CGAACAGGCG TTTTTACAAG CCGTGGCGCG CTATCCTGAG CGTCCATTAT CTCTCTACGT ACATATTCCG TTCTGCCATA AGCTTTGTTA CTTCTGCGGT TGCAATAAGA TAGTTACTCG CCAGCAGCAC AAGGCCGATC AGTATCTGGA CGCGCTGGAG CAAGAAATCG TCCATCGTGC ACCGCTTTTT GCCGGGCGCA AGGTGAGCCA GCTGCACTGG GGCGGTGGTA CGCCGACGTA TCTGAACAAA GCGCAAATCA GCCGTCTGAT GAAGCTGCTG CGCGAAAACT TCCAGTTCAA TGCCGATGCG GAGATTTCGA TCGAAGTCGA TCCGCGCGAA ATCGAACTGG ATGTACTCGA TCATTTACGC GCCGAGGACT TTAATCGCCT GAGCATGGGC GTGCAGGACT TCAACAAAGA AGTACAGCGT CTGGTTAACC GCGAGCAGGA TGAAGAGTTC ATCTTTGCAC TGCTTAACCA TGCGCGTGAG ATTGGCTTTA CCTCCACCAA CATCGACCTG ATTTACGGTC TGCCGAAACA GACGCCGGAA AGTTTTGCCT TTACCCTGAA ACGTGTGGCG GAGCTGAACC CCGATCGTCT GAGCGTCTTT AACTACGCGC ATTTGCCGAC CATTTTTGCT GCTCAGCGCA AAATCAAAGA TGCTGACCTG CCGAGTCCGC AGCAAAAACT CGATATCCTG CAGGAAACCA TTGCCTTCCT GACGCAATCG GGCTATCAGT TTATCGGGAT GGATCACTTT GCCCGCCCGG ATGACGAGCT GGCGGTGGCC CAGCGTGAAG GCGTGCTGCA TCGTAACTTC CAGGGCTACA CCACTCAGGG CGATACCGAT CTGCTGGGGA TGGGCGTTTC CGCCATCAGT ATGATTGGCG ACTGCTACGC GCAGAACCAG AAAGAGTTGA AGCACTACTA TCAGCAAGTG GATGAACAAG GCAACGCGCT GTGGCGTGGT ATTGCGCTAA CGCGTGATGA CTGTATTCGC CGCGATGTGA TTAAGTCGCT CATCTGCAAC TTCCGTCTGG ATTACGCTCC CATTGAGCAA CAGTGGGATT TGCACTTCGC TGATTACTTT GCGGAAGATC TCAAGCTGCT CGCCCCGTTA GCAAAAGATG GGCTGGTGGA TGTGGATGAG AAGGGGATTC AGGTGACGGC GAAAGGTCGC TTGCTGATCC GCAACATTTG CATGTGCTTT GATACCTATC TGCGCCAGAA AGCGCGGATG CAGCAGTTCT CACGGGTGAT TTAA
|
Protein sequence | MSVQQIDWDL ALIQKYNYSG PRYTSYPTAL EFSEDFGEQA FLQAVARYPE RPLSLYVHIP FCHKLCYFCG CNKIVTRQQH KADQYLDALE QEIVHRAPLF AGRKVSQLHW GGGTPTYLNK AQISRLMKLL RENFQFNADA EISIEVDPRE IELDVLDHLR AEDFNRLSMG VQDFNKEVQR LVNREQDEEF IFALLNHARE IGFTSTNIDL IYGLPKQTPE SFAFTLKRVA ELNPDRLSVF NYAHLPTIFA AQRKIKDADL PSPQQKLDIL QETIAFLTQS GYQFIGMDHF ARPDDELAVA QREGVLHRNF QGYTTQGDTD LLGMGVSAIS MIGDCYAQNQ KELKHYYQQV DEQGNALWRG IALTRDDCIR RDVIKSLICN FRLDYAPIEQ QWDLHFADYF AEDLKLLAPL AKDGLVDVDE KGIQVTAKGR LLIRNICMCF DTYLRQKARM QQFSRVI
|
| |