Gene ECH74115_4258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4258 
Symbol 
ID6970781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3943170 
End bp3944306 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID643387996 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002272435 
Protein GI209398861 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAAT TACCGCCGCT GAGTCTCTAC ATTCACATCC CGTGGTGCGT GCAGAAATGC 
CCGTACTGCG ATTTCAACTC TCACGCGTTG AAAGGAGAAG TGCCGCACGA CGATTACGTT
CAGCATCTGC TTAACGATCT GGACAACGAT GTGGCTTACG CTCAGGGCCG TGAAGTAAAG
ACAATCTTTA TTGGCGGTGG TACGCCGAGC CTGCTTTCCG GCTCGGCGAT GCAAACGCTG
CTGGACGGCG TGCGTGCGCG TTTGCCGCTG GCAGCGGATG CAGAAATTAC CATGGAAGCG
AACCCCGGTA CGGTAGAAGC CGATCGCTTT GTTGATTATC AGCGTGCCGG TGTAAATCGT
ATCTCTATTG GGGTACAGAG TTTTAGCGAA GAAAAGCTGA AACGGCTGGG ACGTATTCAT
GGCCCGCAAG AAGCGAAACG AGCGGCAAAG CTGGCGAGCG GTTTAGGGTT ACGTAGCTTT
AACCTTGATT TGATGCATGG GCTACCGGAT CAATCGCTGG AAGAGGCGCT TGGCGATCTG
CGCCAGGCTA TTGAACTGAA TCCGCCGCAT CTTTCCTGGT ATCAACTGAC CATCGAACCT
AATACGCTGT TTGGTTCACG CCCGCCGGTA CTGCCGGACG ACGACGCGCT GTGGGATATT
TTCGAACAGG GGCATCAGTT ATTAACCGCA GCGGGTTATC AGCAATATGA AACTTCCGCT
TACGCCAAAC CCGGTTATCA GTGCCAGCAC AATCTCAACT ACTGGCGCTT TGGTGACTAC
ATCGGTATTG GCTGCGGCGC GCACGGCAAA GTGACCTTCC CGGATGGGCG CATTCTGCGT
ACCACCAAAA CGCGTCATCC GCGTGGTTTT ATGCAAGGAA GGTATCTGGA AAGCCAGCGT
GATGTCGAAG CCGCAGATAA GCCGTTTGAG TTCTTTATGA ATCGCTTCCG GTTGCTGGAA
CCTGCGCCGC GCGTGGAGTT TAGTGCGTAT ACCGGGCTTT GCGAAGATGT GATTCGCCCA
CAGTTAGACG AGGCGATTGC CCAGGGTTAT CTCACCGAAT GTGCGGATTA CTGGCAGATT
ACGGAACATG GGAAGCTGTT TTTAAATTCG CTGCTGGAGC TTTTTCTGGC TGAGTAA
 
Protein sequence
MVKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLNDLDND VAYAQGREVK 
TIFIGGGTPS LLSGSAMQTL LDGVRARLPL AADAEITMEA NPGTVEADRF VDYQRAGVNR
ISIGVQSFSE EKLKRLGRIH GPQEAKRAAK LASGLGLRSF NLDLMHGLPD QSLEEALGDL
RQAIELNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA
YAKPGYQCQH NLNYWRFGDY IGIGCGAHGK VTFPDGRILR TTKTRHPRGF MQGRYLESQR
DVEAADKPFE FFMNRFRLLE PAPRVEFSAY TGLCEDVIRP QLDEAIAQGY LTECADYWQI
TEHGKLFLNS LLELFLAE