Gene EcolC_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4149 
Symbol 
ID6066365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4578611 
End bp4579984 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID641603570 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001727073 
Protein GI170022119 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.20297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0195614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAC AGCAAATCGA CTGGGATCTG GCCCTGATCC AGAAATATAA CTATTCCGGG 
CCACGATACA CCTCGTACCC GACCGCGCTG GAGTTTTCAG AAGACTTCGG CGAACAGGCG
TTTTTACAAG CCGTGGCGCG CTATCCTGAG CGTCCATTAT CTCTCTACGT ACATATCCCG
TTCTGCCATA AACTTTGTTA CTTCTGCGGT TGCAATAAGA TTGTTACTCG CCAGCAGCAC
AAGGCCGATC AGTATCTGGA CGCGCTGGAG CAAGAAATCG TCCATCGTGC ACCGCTGTTC
GCCGGGCGTC ACGTCAGCCA ATTGCACTGG GGCGGCGGAA CGCCGACGTA TCTGAATAAA
GCGCAAATCA GCCGCCTGAT GAAGCTGCTG CGCGAAAACT TCCAGTTCAA TGCCGATGCG
GAGATTTCGA TCGAAGTCGA TCCGCGGGAA ATCGAACTGG ATGTACTCGA TCATTTACGC
GCCGAAGGCT TTAATCGCCT GAGCATGGGC GTGCAGGACT TCAACAAAGA AGTGCAACGT
CTGGTTAACC GCGAGCAGGA TGAAGAGTTC ATCTTTGCAC TGCTTAACCA TGCGCGTGAG
ATTGGTTTTA CCTCCACCAA CATCGACCTG ATTTACGGCC TGCCGAAACA GACGCCGGAG
AGTTTCGCCT TTACCCTGAA ACGTGTGGCG GAGCTGAACC CCGATCGTCT GAGTGTCTTT
AACTACGCGC ATCTGCCGAC CATTTTTGCT GCTCAGCGCA AAATCAAAGA TGCTGACCTG
CCGAGTCCGC AGCAAAAACT CGATATCCTG CAGGAAACCA TCGCCTTCCT GACGCAATCG
GGCTATCAGT TTATCGGTAT GGATCACTTT GCCCGTCCGG ATGACGAGCT GGCGGTGGCC
CAGCGTGAAG GCGTGCTGCA TCGTAACTTC CAGGGCTACA CCACTCAGGG CGATACCGAT
CTGCTGGGGA TGGGCGTTTC CGCCATCAGC ATGATTGGCG ACTGCTACGC GCAGAACCAG
AAAGAGTTGA AGCAGTACTA TCAGCAAGTG GATGAACAAG GCAATGCGCT GTGGCGTGGT
ATTGCGCTAA CGCGTGATGA CTGTATTCGC CGCGATGTGA TTAAGTCGCT CATCTGCAAC
TTCCGTCTGG ATTACGCCCC TATTGAGAAA CAGTGGGATT TGCACTTCGC TGATTACTTT
GCGGAAGATC TCAAGCTGCT CGCCCCGTTA GCAAAAGATG GGCTGGTGGA TGTGGATGAG
AAGGGAATAC AGGTGACGGC GAAAGGTCGC TTGCTGATCC GCAACATTTG CATGTGCTTT
GATACCTATC TGCGCCAGAA AGCGCGGATG CAGCAGTTCT CACGGGTGAT TTAA
 
Protein sequence
MSVQQIDWDL ALIQKYNYSG PRYTSYPTAL EFSEDFGEQA FLQAVARYPE RPLSLYVHIP 
FCHKLCYFCG CNKIVTRQQH KADQYLDALE QEIVHRAPLF AGRHVSQLHW GGGTPTYLNK
AQISRLMKLL RENFQFNADA EISIEVDPRE IELDVLDHLR AEGFNRLSMG VQDFNKEVQR
LVNREQDEEF IFALLNHARE IGFTSTNIDL IYGLPKQTPE SFAFTLKRVA ELNPDRLSVF
NYAHLPTIFA AQRKIKDADL PSPQQKLDIL QETIAFLTQS GYQFIGMDHF ARPDDELAVA
QREGVLHRNF QGYTTQGDTD LLGMGVSAIS MIGDCYAQNQ KELKQYYQQV DEQGNALWRG
IALTRDDCIR RDVIKSLICN FRLDYAPIEK QWDLHFADYF AEDLKLLAPL AKDGLVDVDE
KGIQVTAKGR LLIRNICMCF DTYLRQKARM QQFSRVI