Gene EcSMS35_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4251 
SymbolhemN 
ID6143182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4346828 
End bp4348201 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID641619072 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001746196 
Protein GI170683331 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.172293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0658119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAC AGCAAATCGA CTGGGATCTG GCCCTGATCC AGAAATATAA CTATTCCGGG 
CCACGATACA CCTCGTACCC GACCGCGCTG GAGTTTTCAG AAGACTTCGG CGAACAGGCG
TTTTTACAAG CCGTGGCGCG CTATCCTGAG CGTCCATTAT CTCTCTACGT ACATATTCCG
TTCTGCCATA AGCTTTGTTA CTTCTGCGGT TGCAATAAGA TAGTTACTCG CCAGCAGCAC
AAGGCCGATC AGTATCTGGA CGCGCTGGAG CAAGAAATCG TCCATCGTGC ACCGCTTTTT
GCCGGGCGCA AGGTGAGCCA GCTGCACTGG GGCGGTGGTA CGCCGACGTA TCTGAACAAA
GCGCAAATCA GCCGTCTGAT GAAGCTGCTG CGCGAAAACT TCCAGTTCAA TGCCGATGCG
GAGATTTCGA TCGAAGTCGA TCCGCGCGAA ATCGAACTGG ATGTACTCGA TCATTTACGC
GCCGAGGACT TTAATCGCCT GAGCATGGGC GTGCAGGACT TCAACAAAGA AGTACAGCGT
CTGGTTAACC GCGAGCAGGA TGAAGAGTTC ATCTTTGCAC TGCTTAACCA TGCGCGTGAG
ATTGGCTTTA CCTCCACCAA CATCGACCTG ATTTACGGTC TGCCGAAACA GACGCCGGAA
AGTTTTGCCT TTACCCTGAA ACGTGTGGCG GAGCTGAACC CCGATCGTCT GAGCGTCTTT
AACTACGCGC ATTTGCCGAC CATTTTTGCT GCTCAGCGCA AAATCAAAGA TGCTGACCTG
CCGAGTCCGC AGCAAAAACT CGATATCCTG CAGGAAACCA TTGCCTTCCT GACGCAATCG
GGCTATCAGT TTATCGGGAT GGATCACTTT GCCCGCCCGG ATGACGAGCT GGCGGTGGCC
CAGCGTGAAG GCGTGCTGCA TCGTAACTTC CAGGGCTACA CCACTCAGGG CGATACCGAT
CTGCTGGGGA TGGGCGTTTC CGCCATCAGT ATGATTGGCG ACTGCTACGC GCAGAACCAG
AAAGAGTTGA AGCACTACTA TCAGCAAGTG GATGAACAAG GCAACGCGCT GTGGCGTGGT
ATTGCGCTAA CGCGTGATGA CTGTATTCGC CGCGATGTGA TTAAGTCGCT CATCTGCAAC
TTCCGTCTGG ATTACGCTCC CATTGAGCAA CAGTGGGATT TGCACTTCGC TGATTACTTT
GCGGAAGATC TCAAGCTGCT CGCCCCGTTA GCAAAAGATG GGCTGGTGGA TGTGGATGAG
AAGGGGATTC AGGTGACGGC GAAAGGTCGC TTGCTGATCC GCAACATTTG CATGTGCTTT
GATACCTATC TGCGCCAGAA AGCGCGGATG CAGCAGTTCT CACGGGTGAT TTAA
 
Protein sequence
MSVQQIDWDL ALIQKYNYSG PRYTSYPTAL EFSEDFGEQA FLQAVARYPE RPLSLYVHIP 
FCHKLCYFCG CNKIVTRQQH KADQYLDALE QEIVHRAPLF AGRKVSQLHW GGGTPTYLNK
AQISRLMKLL RENFQFNADA EISIEVDPRE IELDVLDHLR AEDFNRLSMG VQDFNKEVQR
LVNREQDEEF IFALLNHARE IGFTSTNIDL IYGLPKQTPE SFAFTLKRVA ELNPDRLSVF
NYAHLPTIFA AQRKIKDADL PSPQQKLDIL QETIAFLTQS GYQFIGMDHF ARPDDELAVA
QREGVLHRNF QGYTTQGDTD LLGMGVSAIS MIGDCYAQNQ KELKHYYQQV DEQGNALWRG
IALTRDDCIR RDVIKSLICN FRLDYAPIEQ QWDLHFADYF AEDLKLLAPL AKDGLVDVDE
KGIQVTAKGR LLIRNICMCF DTYLRQKARM QQFSRVI