Gene ECD_03752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03752 
SymbolhemN 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3958782 
End bp3960155 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID 
Productcoproporphyrinogen III oxidase 
Protein accessionACT45545 
Protein GI253979875 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0523071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTAC AGCAAATCGA CTGGGATCTG GCCCTGATCC AGAAATATAA CTATTCCGGG 
CCACGATACA CCTCGTACCC GACCGCGCTG GAGTTTTCAG AAGACTTCGG CGAACAGGCG
TTTTTACAAG CCGTGGCGCG CTATCCTGAG CGTCCATTAT CTCTCTACGT ACATATCCCG
TTCTGCCATA AGCTTTGTTA CTTCTGCGGT TGCAATAAGA TTGTTACTCG CCAGCAGCAC
AAGGCCGATC AGTATCTGGA CGCGCTGGAG CAAGAAATCG TCCATCGTGC ACCGCTGTTC
GCCGGGCGTC ACGTCAGCCA ATTGCACTGG GGCGGCGGAA CGCCGACGTA TCTGAATAAA
GCGCAAATCA GCCGCCTGAT GAAGCTGCTG CGCGAAAACT TCCAGTTCAA TGCCGATGCG
GAGATTTCGA TCGAAGTCGA TCCGCGGGAA ATCGAACTGG ATGTACTCGA TCATTTACGC
GCCGAAGGCT TTAATCGCCT GAGCATGGGC GTGCAGGACT TCAACAAAGA AGTGCAACGT
CTGGTTAACC GCGAGCAGGA TGAAGAGTTC ATCTTTGCAC TGCTTAACCA TGCGCGTGAG
ATTGGTTTTA CCTCCACCAA CATCGACCTG ATTTACGGCC TGCCGAAACA GACGCCGGAG
AGTTTCGCCT TTACCCTGAA ACGTGTGGCG GAGCTGAACC CCGATCGTCT GAGTGTCTTT
AACTACGCGC ATCTGCCGAC CATTTTTGCT GCTCAGCGCA AAATCAAAGA TGCTGACCTG
CCGAGTCCGC AGCAAAAACT CGATATCCTG CAGGAAACCA TCGCCTTCCT GACGCAATCG
GGCTATCAGT TTATCGGTAT GGATCACTTT GCCCGTCCGG ATGACGAGCT GGCGGTGGCC
CAGCGTGAAG GCGTGCTGCA TCGTAACTTC CAGGGCTACA CCACTCAGGG CGATACCGAT
CTGCTGGGGA TGGGCGTTTC CGCCATCAGC ATGATTGGCG ACTGCTACGC GCAGAACCAG
AAAGAGTTGA AGCAGTACTA TCAGCAAGTG GATGAACAAG GCAATGCGCT GTGGCGTGGT
ATTGCGCTAA CGCGTGATGA CTGTATTCGC CGCGATGTGA TTAAGTCGCT CATCTGCAAC
TTCCGTCTGG ATTACGCCCC TATTGAGAAA CAGTGGGATT TGCACTTCGC TGATTACTTT
GCGGAAGATC TCAAGCTGCT CGCCCCGTTA GCAAAAGATG GGCTGGTGGA TGTGGATGAG
AAGGGAATAC AGGTGACGGC GAAAGGTCGC TTGCTGATCC GCAACATTTG CATGTGCTTT
GATACCTATC TGCGCCAGAA AGCGCGGATG CAGCAGTTCT CACGGGTGAT TTAA
 
Protein sequence
MSVQQIDWDL ALIQKYNYSG PRYTSYPTAL EFSEDFGEQA FLQAVARYPE RPLSLYVHIP 
FCHKLCYFCG CNKIVTRQQH KADQYLDALE QEIVHRAPLF AGRHVSQLHW GGGTPTYLNK
AQISRLMKLL RENFQFNADA EISIEVDPRE IELDVLDHLR AEGFNRLSMG VQDFNKEVQR
LVNREQDEEF IFALLNHARE IGFTSTNIDL IYGLPKQTPE SFAFTLKRVA ELNPDRLSVF
NYAHLPTIFA AQRKIKDADL PSPQQKLDIL QETIAFLTQS GYQFIGMDHF ARPDDELAVA
QREGVLHRNF QGYTTQGDTD LLGMGVSAIS MIGDCYAQNQ KELKQYYQQV DEQGNALWRG
IALTRDDCIR RDVIKSLICN FRLDYAPIEK QWDLHFADYF AEDLKLLAPL AKDGLVDVDE
KGIQVTAKGR LLIRNICMCF DTYLRQKARM QQFSRVI