Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4880 |
Symbol | |
ID | 6967893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4513004 |
End bp | 4514401 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643388568 |
Product | di-haem cytochrome c peroxidase family protein |
Protein accession | YP_002272996 |
Protein GI | 209397635 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGG TCTCACGTAT TACCGCGATC GGCCTGGCTG GCGTCGCGAT TTGCTATTTA GGGTTATCTG GTTATGTGTG GTACCACGAT AATAAACGCA GTAAACAGGC CGATGTTCAG GCATCTGCTG TCAGTGAAAA TAATAAGGTT TTAGGCTTTC TCCGCGAAAA AGGATGCGAC TATTGCCACA CGCCTTCGGC AGAATTACCC GCCTATTATT ATATTCCTGG CGCGAAACAG TTGATGGATT ACGACATTAA GCTTGGATAT AAATCTTTTA ACCTCGAGGC CGTGCGTGCT GCTCTGCTGG CTGATAAACC CGTTTCGCAA AGCGATCTGA ATAAGATTGA ATGGGTGATG CAGTATGAAA CTATGCCGCC AACGCGTTAT ACCGCGCTAC ACTGGGCGGG TAAGGTGAGT GATGAAGAGC GGGCGGAAAT ACTGGCCTGG ATTGCAAAAC AGCGCGCGGA ATATTACGCC AGCAATGATA CTGCTCCGGA GCATCGCAAT GAACCGGTGC AGCCCATCCC GCAAAAACTG CCTACCGATG CGCAAAAAGT GGCGTTGGGC TTTGCGCTGT ATCACGATCC CCGTTTATCG GCTGATAGCA CCATTTCATG CGCTCATTGC CATGCGTTGA ATGCGGGGGG CGTCGATGGC AGAAAAACAT CGATTGGTGT TGGTGGCGCA GTTGGGCCGA TTAACGCGCC GACGGTATTT AACTCAGTAT TTAACGTTGA GCAGTTCTGG GATGGTCGTG CGGCAACATT GCAGGATCAG GCTGGTGGAC CGCCGTTGAA CCCGATTGAA ATGGCGTCAA AATCCTGGGA CGAAATTATT GCTAAGCTGG AAAAAGATCC GCAGCTTAAA GCGCAGTTCC TCGACGTCTA TCCGCAAGGT TTCAGTGGCG AAAATATTAC TGATGCCATT GCTGAATTTG AGAAAACATT AATTACGCCG GATTCCCCAT TTGATAAATG GTTGCGTGGG GATGAAAATG CGCTGACGGC GCAACAGAAA AAAGGCTATC AATTATTTAA AGATAATAAA TGTGCAACTT GTCATGGTGG AATTATTCTC GGCGGACGTT CCTTCGAACC GTTGGGGCTG AAAAAAGACT TTAACTTTGG GGAAATTACG GCGGCGGATA TTGGTCGTAT GAATGTGACT AAAGAAGAGC GTGATAAATT GCGTCAGAAA GTACCCGGTT TACGTAACGT TGCTTTAACG GCACCGTACT TCCATCGCGG TGACGTGCCG ACGCTGGACG GGGCGGTGAA ACTGATGTTG CGCTATCAGG TAGGCAAAGA GCTGCCGCAG GAGGATGTGG ATGATATCGT AGCTTTCCTG CACAGTCTGA ACGGGGTTTA CACGCCGTAT ATGCAGGATA AACAATAA
|
Protein sequence | MKMVSRITAI GLAGVAICYL GLSGYVWYHD NKRSKQADVQ ASAVSENNKV LGFLREKGCD YCHTPSAELP AYYYIPGAKQ LMDYDIKLGY KSFNLEAVRA ALLADKPVSQ SDLNKIEWVM QYETMPPTRY TALHWAGKVS DEERAEILAW IAKQRAEYYA SNDTAPEHRN EPVQPIPQKL PTDAQKVALG FALYHDPRLS ADSTISCAHC HALNAGGVDG RKTSIGVGGA VGPINAPTVF NSVFNVEQFW DGRAATLQDQ AGGPPLNPIE MASKSWDEII AKLEKDPQLK AQFLDVYPQG FSGENITDAI AEFEKTLITP DSPFDKWLRG DENALTAQQK KGYQLFKDNK CATCHGGIIL GGRSFEPLGL KKDFNFGEIT AADIGRMNVT KEERDKLRQK VPGLRNVALT APYFHRGDVP TLDGAVKLML RYQVGKELPQ EDVDDIVAFL HSLNGVYTPY MQDKQ
|
| |