Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4258 |
Symbol | |
ID | 6970781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3943170 |
End bp | 3944306 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387996 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002272435 |
Protein GI | 209398861 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAAT TACCGCCGCT GAGTCTCTAC ATTCACATCC CGTGGTGCGT GCAGAAATGC CCGTACTGCG ATTTCAACTC TCACGCGTTG AAAGGAGAAG TGCCGCACGA CGATTACGTT CAGCATCTGC TTAACGATCT GGACAACGAT GTGGCTTACG CTCAGGGCCG TGAAGTAAAG ACAATCTTTA TTGGCGGTGG TACGCCGAGC CTGCTTTCCG GCTCGGCGAT GCAAACGCTG CTGGACGGCG TGCGTGCGCG TTTGCCGCTG GCAGCGGATG CAGAAATTAC CATGGAAGCG AACCCCGGTA CGGTAGAAGC CGATCGCTTT GTTGATTATC AGCGTGCCGG TGTAAATCGT ATCTCTATTG GGGTACAGAG TTTTAGCGAA GAAAAGCTGA AACGGCTGGG ACGTATTCAT GGCCCGCAAG AAGCGAAACG AGCGGCAAAG CTGGCGAGCG GTTTAGGGTT ACGTAGCTTT AACCTTGATT TGATGCATGG GCTACCGGAT CAATCGCTGG AAGAGGCGCT TGGCGATCTG CGCCAGGCTA TTGAACTGAA TCCGCCGCAT CTTTCCTGGT ATCAACTGAC CATCGAACCT AATACGCTGT TTGGTTCACG CCCGCCGGTA CTGCCGGACG ACGACGCGCT GTGGGATATT TTCGAACAGG GGCATCAGTT ATTAACCGCA GCGGGTTATC AGCAATATGA AACTTCCGCT TACGCCAAAC CCGGTTATCA GTGCCAGCAC AATCTCAACT ACTGGCGCTT TGGTGACTAC ATCGGTATTG GCTGCGGCGC GCACGGCAAA GTGACCTTCC CGGATGGGCG CATTCTGCGT ACCACCAAAA CGCGTCATCC GCGTGGTTTT ATGCAAGGAA GGTATCTGGA AAGCCAGCGT GATGTCGAAG CCGCAGATAA GCCGTTTGAG TTCTTTATGA ATCGCTTCCG GTTGCTGGAA CCTGCGCCGC GCGTGGAGTT TAGTGCGTAT ACCGGGCTTT GCGAAGATGT GATTCGCCCA CAGTTAGACG AGGCGATTGC CCAGGGTTAT CTCACCGAAT GTGCGGATTA CTGGCAGATT ACGGAACATG GGAAGCTGTT TTTAAATTCG CTGCTGGAGC TTTTTCTGGC TGAGTAA
|
Protein sequence | MVKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLNDLDND VAYAQGREVK TIFIGGGTPS LLSGSAMQTL LDGVRARLPL AADAEITMEA NPGTVEADRF VDYQRAGVNR ISIGVQSFSE EKLKRLGRIH GPQEAKRAAK LASGLGLRSF NLDLMHGLPD QSLEEALGDL RQAIELNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA YAKPGYQCQH NLNYWRFGDY IGIGCGAHGK VTFPDGRILR TTKTRHPRGF MQGRYLESQR DVEAADKPFE FFMNRFRLLE PAPRVEFSAY TGLCEDVIRP QLDEAIAQGY LTECADYWQI TEHGKLFLNS LLELFLAE
|
| |