Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0099 |
Symbol | katG |
ID | 6966455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 60441 |
End bp | 62651 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643383996 |
Product | catalase/peroxidase HPI |
Protein accession | YP_002268475 |
Protein GI | 209395537 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.311773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAAAA AAACTCTTCC TGTTCTGATT CTTCTGGCGC TATCGGGGAG CTTTTCTACC GCTGTAGCCG CTGATAAAAA AGAGACTCAA AATTTCTACT ATCCAGAAAC ACTGGATTTA ACTCCTCTGA GATTACACAG CCCTGAATCA AATCCCTGGG GGGCTGATTT TGATTATGCC ACCAGATTTC AACAGCTGGA TATGGAGGCT CTGAAAAAAG ATATCAAAGA TTTGCTGACA ACTTCCCAGG ATTGGTGGCC TGCGGATTAT GGTCATTATG GTCCTTTCTT TATTCGTATG GCTTGGCACG GTGCCGGAAC ATACAGGACA TATGATGGCC GGGGAGGCGC CAGTGGTGGT CAGCAACGTT TTGAACCGCT GAACAGCTGG CCGGATAACG TTAATCTGGA TAAAGCCCGT CGATTGCTGT GGCCAGTCAA GAAAAAATAC GGCTCCAGTA TTTCCTGGGG AGACCTGATG GTCCTGACTG GTAATGTTGC CCTTGAATCC ATGGGATTTA AAACGCTGGG ATTTGCTGGC GGAAGAGAAG ATGACTGGGA GTCGGACCTG GTATACTGGG GGCCTGACAA CAAGCCTCTT GCAGATAACC GGGATAAAAA CGGGAAACTT CAGAAACCTC TTGCCGCCAC GCAGATGGGA CTTATTTATG TCAATCCTGA AGGCCCCGGT GGAAAACCAG ATCCTCTGGC TTCCGCGAAA GATATCAGGG AAGCTTTTTC ACGTATGGCC ATGGATGATG AGGAGACTGT GGCCCTGATC GCGGGAGGGC ATACATTTGG TAAAGCACAT GGTGCAGCGT CTCCTGAAAA ATGTATTGGC GCAGGGCCTG ATGGTGCACC TGTGGAGGAG CAGGGACTGG GATGGAAAAA TAAATGTGGT ACAGGAAACG GCAAATATAC CATCACCAGT GGCCTGGAAG GAGCCTGGTC GACATCGCCA ACCCAGTTCA CAATGCAGTA TCTGAAGAAT TTATATAAAT ATGAATGGGA GCTGCACAAG AGTCCTGCCG GTGCTTATCA GTGGAAGCCT AAAAAAGCGG CAAATATAGT TCAGGACGCG CATGATCCGT CTGTCCTGCA TCCGTTGATG ATGTTTACGA CGGATATTGC TCTTAAAGTT GATCCTGAAT ATAAGAAAAT AACCACCCGT TTCCTGAATG ATCCAAAAGC TTTTGAGCAG GCATTCGCAA GAGCATGGTT TAAACTGACC CACCGGGATA TGGGACCGGC AGCCCGATAT CTTGGTAATG AAGTTCCTGC AGAATCATTT ATCTGGCAGG ATCCTCTTCC TGCGGCGGAT TATACAATGA TTGATGGTAA AGACATTAAG TCGCTGAAAG AGCAGGTTAT GGATTTGGGT ATCCCTGCAT CTGAGCTGAT AAAGACAGCC TGGGCTTCAG CTTCCACATT TCGTGTGACT GATTATCGTG GGGGAAATAA TGGTGCCCGC ATCAGGTTAC AGCCCGAAAT TAACTGGGAA GTTAATGAGC CTGAAAAACT GAAGAAAGTA CTGGCATCCC TGACCTCATT ACAGCGTGAA TTTAACAAAA AACAGTCTGA CGGAAAGAAA GTGTCGTTGG CTGATTTAAT TGTTCTTTCG GGTAATGCTG CAATCGAAGA TGCGGCCAGA AAAGCCGGGG TGGAACTTGA GATTCCCTTT ACTCCGGGAA GAACTGACGC CTCTCAGGAG CAGACGGATG TTGCCTCATT CAGTGTACTG GAGCCGACAG CAGATGGATT CAGAAATTAT TACTCAAAAA GCAGAAGTCA TATATCGCCG GTTGAAAGCC TCATTGATAA AGCCAGTCAG CTGGATCTCA CCGTTCCTGA AATGACGGCA TTACTGGGTG GTCTGCGGGT AATGGATATT AATACAAATA ATTCTTCGTT GGGAGTGTTT ACCGATACCC CTGGTGTTCT GGATAACAAG TTTTTTGTTA ATCTGCTGGA TATGTCAACA CGATGGAGTA AAGCAGATAA AGAAGATACA TACAATGGAT TCGATCGTAA AACGGGAGCA TTAAAATGGA AAGCATCCTC TGTTGATTTA ATCTTCAGTT CAAATCCTGA ATTACGTGCG GTGGCAGAAG TATATGCCTC GGATGATGCG AGAAATAAGT TTATTCATGA TTTTGTTAAA TCGTGGAATA AAGTTATGAA TAGCGATCGG TTTGATTTAA ACAATAAATG A
|
Protein sequence | MIKKTLPVLI LLALSGSFST AVAADKKETQ NFYYPETLDL TPLRLHSPES NPWGADFDYA TRFQQLDMEA LKKDIKDLLT TSQDWWPADY GHYGPFFIRM AWHGAGTYRT YDGRGGASGG QQRFEPLNSW PDNVNLDKAR RLLWPVKKKY GSSISWGDLM VLTGNVALES MGFKTLGFAG GREDDWESDL VYWGPDNKPL ADNRDKNGKL QKPLAATQMG LIYVNPEGPG GKPDPLASAK DIREAFSRMA MDDEETVALI AGGHTFGKAH GAASPEKCIG AGPDGAPVEE QGLGWKNKCG TGNGKYTITS GLEGAWSTSP TQFTMQYLKN LYKYEWELHK SPAGAYQWKP KKAANIVQDA HDPSVLHPLM MFTTDIALKV DPEYKKITTR FLNDPKAFEQ AFARAWFKLT HRDMGPAARY LGNEVPAESF IWQDPLPAAD YTMIDGKDIK SLKEQVMDLG IPASELIKTA WASASTFRVT DYRGGNNGAR IRLQPEINWE VNEPEKLKKV LASLTSLQRE FNKKQSDGKK VSLADLIVLS GNAAIEDAAR KAGVELEIPF TPGRTDASQE QTDVASFSVL EPTADGFRNY YSKSRSHISP VESLIDKASQ LDLTVPEMTA LLGGLRVMDI NTNNSSLGVF TDTPGVLDNK FFVNLLDMST RWSKADKEDT YNGFDRKTGA LKWKASSVDL IFSSNPELRA VAEVYASDDA RNKFIHDFVK SWNKVMNSDR FDLNNK
|
| |