Gene EcDH1_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4120 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4461052 
End bp4462425 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionACX41720 
Protein GI260451298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTAC AGCAAATCGA CTGGGATCTG GCCCTGATCC AGAAATATAA CTATTCCGGG 
CCACGATACA CCTCGTACCC GACCGCGCTG GAGTTTTCAG AAGACTTCGG CGAACAGGCG
TTTTTACAAG CCGTGGCGCG CTATCCTGAG CGTCCATTAT CTCTCTACGT ACATATCCCG
TTCTGCCATA AGCTTTGTTA CTTCTGCGGT TGCAATAAGA TTGTTACTCG CCAGCAGCAC
AAGGCCGATC AGTATCTGGA CGCGCTGGAG CAAGAAATCG TCCATCGTGC ACCGCTGTTC
GCCGGGCGTC ACGTCAGCCA ATTGCACTGG GGCGGCGGAA CGCCGACGTA TCTGAATAAA
GCGCAAATCA GCCGCCTGAT GAAGCTGCTG CGCGAAAACT TCCAGTTCAA TGCCGATGCG
GAGATTTCGA TCGAAGTCGA TCCGCGGGAA ATCGAACTGG ATGTACTCGA TCATTTACGC
GCCGAAGGCT TTAATCGCCT GAGCATGGGC GTGCAGGACT TCAACAAAGA AGTGCAACGT
CTGGTTAACC GCGAGCAGGA TGAAGAGTTC ATCTTTGCAC TGCTTAACCA TGCGCGTGAG
ATTGGTTTTA CCTCCACCAA CATCGACCTG ATTTACGGCC TGCCGAAACA GACGCCGGAG
AGTTTCGCCT TTACCCTGAA ACGTGTGGCG GAGCTGAACC CCGATCGTCT GAGTGTCTTT
AACTACGCGC ATCTGCCGAC CATTTTTGCT GCTCAGCGCA AAATCAAAGA TGCTGACCTG
CCGAGTCCGC AGCAAAAACT CGATATCCTG CAGGAAACCA TCGCCTTCCT GACGCAATCG
GGCTATCAGT TTATCGGTAT GGATCACTTT GCCCGTCCGG ATGACGAGCT GGCGGTGGCC
CAGCGTGAAG GCGTGCTGCA TCGTAACTTC CAGGGCTACA CCACTCAGGG CGATACCGAT
CTGCTGGGGA TGGGCGTTTC CGCCATCAGC ATGATTGGCG ACTGCTACGC GCAGAACCAG
AAAGAGTTGA AGCAGTACTA TCAGCAAGTG GATGAACAAG GCAATGCGCT GTGGCGTGGT
ATTGCGCTAA CGCGTGATGA CTGTATTCGC CGCGATGTGA TTAAGTCGCT CATCTGCAAC
TTCCGTCTGG ATTACGCCCC TATTGAGAAA CAGTGGGATT TGCACTTCGC TGATTACTTT
GCGGAAGATC TCAAGCTGCT CGCCCCGTTA GCAAAAGATG GGCTGGTGGA TGTGGATGAG
AAGGGAATAC AGGTGACGGC GAAAGGTCGC TTGCTGATCC GCAACATTTG CATGTGCTTT
GATACCTATC TGCGCCAGAA AGCGCGGATG CAGCAGTTCT CTCGGGTGAT TTAA
 
Protein sequence
MSVQQIDWDL ALIQKYNYSG PRYTSYPTAL EFSEDFGEQA FLQAVARYPE RPLSLYVHIP 
FCHKLCYFCG CNKIVTRQQH KADQYLDALE QEIVHRAPLF AGRHVSQLHW GGGTPTYLNK
AQISRLMKLL RENFQFNADA EISIEVDPRE IELDVLDHLR AEGFNRLSMG VQDFNKEVQR
LVNREQDEEF IFALLNHARE IGFTSTNIDL IYGLPKQTPE SFAFTLKRVA ELNPDRLSVF
NYAHLPTIFA AQRKIKDADL PSPQQKLDIL QETIAFLTQS GYQFIGMDHF ARPDDELAVA
QREGVLHRNF QGYTTQGDTD LLGMGVSAIS MIGDCYAQNQ KELKQYYQQV DEQGNALWRG
IALTRDDCIR RDVIKSLICN FRLDYAPIEK QWDLHFADYF AEDLKLLAPL AKDGLVDVDE
KGIQVTAKGR LLIRNICMCF DTYLRQKARM QQFSRVI