Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5403 |
Symbol | katG |
ID | 6972248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5043830 |
End bp | 5046010 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643389056 |
Product | catalase/peroxidase HPI |
Protein accession | YP_002273465 |
Protein GI | 209395777 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.610703 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGT CAGACGATAT CCATAACACG ACAGCCACTG GCAAATGCCC GTTCCATCAG GGCGGTCACG ACCAGAGCGC GGGGGCGGGC ACAACCACTC GCGACTGGTG GCCAAATCAA CTCCGTGTTG ACCTGTTAAA CCAACATTCT AATCGTTCTA ACCCGCTGGG TGAGGACTTT GACTACCGCA AAGAATTCAG CAAATTAGAT TACTACGGCC TGAAAAAAGA TCTGAAAGCC CTGCTGACAG AATCTCAGCC GTGGTGGCCA GCCGACTGGG GCAGCTACGC CGGTCTGTTT ATTCGTATGG CCTGGCACGG CGCGGGGACT TACCGATCAA TCGATGGACG CGGTGGCGCG GGTCGTGGTC AGCAACGTTT TGCACCGCTG AACTCCTGGC CGGATAACGT AAGCCTCGAT AAAGCGCGTC GCTTGTTGTG GCCAATCAAA CAGAAATATG GTCAGAAAAT CTCCTGGGCC GACCTGTTTA TCCTCGCGGG TAACGTGGCG CTGGAAAACT CCGGCTTCCG TACCTTCGGT TTTGGTGCCG GTCGTGAAGA CGTCTGGGAA CCGGATCTGG ATGTTAACTG GGGTGATGAA AAAGCCTGGC TGACTCACCG TCATCCGGAA GCGCTGGCGA AAGCACCGCT GGGTGCAACC GAGATGGGTC TGATTTACGT TAACCCGGAA GGCCCGGATC ACAGCGGCGA ACCGCTTTCT GCGGCAGCAG CTATCCGCGC GACCTTCGGC AACATGGGTA TGAACGACGA AGAAACCGTG GCGCTGATTG CGGGTGGTCA TACGCTGGGT AAAACCCACG GTGCTGGTCC GACATCAAAT GTAGGTCCTG ATCCAGAAGC TGCGCCGATT GAAGAACAAG GTTTAGGTTG GGCGAGCACT TACGGCAGCG GCGTTGGCGC AGATGCCATT ACCTCTGGTC TGGAAGTGGT CTGGACCCAG ACGCCGACCC AGTGGAGCAA CTATTTCTTC GAGAACCTGT TCAAGTATGA GTGGGTACAG ACCCGCAGTC CGGCTGGCGC AATCCAGTTC GAAGCGGTAG ACGCACCGGA AATTATCCCG GATCCGTTCG ATCCGTCGAA GAAACGTAAA CCGACGATGC TGGTGACCGA CCTGACGCTG CGTTTTGATC CTGAGTTCGA GAAGATCTCT CGTCGTTTCC TCAACGATCC GCAGGCGTTC AACGAAGCCT TTGCCCGTGC CTGGTTCAAA CTGACGCACA GGGATATGGG GCCGAAATCT CGTTACATCG GGCCGGAAGT ACCGAAAGAA GATCTGATCT GGCAAGATCC GCTGCCGCAG CCGATCTACA ACCCGACCGA GCAGGACATT ATCGATCTGA AATTCGCGAT TGCGGATTCT GGTCTGTCTG TTAGTGAGCT GGTATCGGTG GCCTGGGCTT CTGCTTCTAC CTTCCGTGGT GGCGACAAAC GCGGTGGTGC CAACGGTGCG CGTCTGGCAT TAATGCCGCA GCGCGACTGG GATGTGAACG CCGCAGCCGT TCGTGCTCTG CCTGTTCTGG AGAAAATTCA GAAAGAGTCT GGTAAAGCCT CGCTGGCGGA TATCATCGTG CTGGCAGGTG TGGTTGGCGT TGAGAAAGCC GCAAGCGCCG CTGGTTTGAG CATTCATGTA CCGTTTGCGC CGGGTCGCGT TGATGCGCGT CAGGATCAGA CTGACATTGA GATGTTCGAA CTGCTGGAGC CAATTGCTGA CGGTTTCCGT AACTATCGCG CTCGTCTGGA TGTTTCCACC ACCGAGTCAC TGCTGATTGA TAAAGCCCAG CAACTGACCC TGACCGCGCC GGAAATGACC GCGCTGGTGG GCGGGATGCG TGTACTGGGT GCCAACTTCG ATGGCAGCAA AAACGGCGTC TTCACTGACC GCGTTGGCGT ATTGAGCAAT GACTTCTTCG TGAACTTGCT GGATATGCGT TACGAGTGGA AAGCGACCGA CGAATCGAAA GAGCTGTTCG AAGGCCGTGA CCGCGAAACC GGCGAAGTGA AATACACTGC CAGCCGTGCG GATCTGGTAT TTGGTTCTAA CTCTGTCCTG CGTGCGGTGG CTGAAGTTTA CGCCAGCAGC GATGCTCACG AGAAGTTTGT TAAAGACTTC GTGGCGGCAT GGGTGAAAGT GATGAACCTC GACCGTTTCG ACCTGCTGTA A
|
Protein sequence | MSTSDDIHNT TATGKCPFHQ GGHDQSAGAG TTTRDWWPNQ LRVDLLNQHS NRSNPLGEDF DYRKEFSKLD YYGLKKDLKA LLTESQPWWP ADWGSYAGLF IRMAWHGAGT YRSIDGRGGA GRGQQRFAPL NSWPDNVSLD KARRLLWPIK QKYGQKISWA DLFILAGNVA LENSGFRTFG FGAGREDVWE PDLDVNWGDE KAWLTHRHPE ALAKAPLGAT EMGLIYVNPE GPDHSGEPLS AAAAIRATFG NMGMNDEETV ALIAGGHTLG KTHGAGPTSN VGPDPEAAPI EEQGLGWAST YGSGVGADAI TSGLEVVWTQ TPTQWSNYFF ENLFKYEWVQ TRSPAGAIQF EAVDAPEIIP DPFDPSKKRK PTMLVTDLTL RFDPEFEKIS RRFLNDPQAF NEAFARAWFK LTHRDMGPKS RYIGPEVPKE DLIWQDPLPQ PIYNPTEQDI IDLKFAIADS GLSVSELVSV AWASASTFRG GDKRGGANGA RLALMPQRDW DVNAAAVRAL PVLEKIQKES GKASLADIIV LAGVVGVEKA ASAAGLSIHV PFAPGRVDAR QDQTDIEMFE LLEPIADGFR NYRARLDVST TESLLIDKAQ QLTLTAPEMT ALVGGMRVLG ANFDGSKNGV FTDRVGVLSN DFFVNLLDMR YEWKATDESK ELFEGRDRET GEVKYTASRA DLVFGSNSVL RAVAEVYASS DAHEKFVKDF VAAWVKVMNL DRFDLL
|
| |