Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4390 |
Symbol | katG |
ID | 6144291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4480020 |
End bp | 4482200 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619211 |
Product | catalase/peroxidase HPI |
Protein accession | YP_001746335 |
Protein GI | 170682728 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.75327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.428476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGT CAGACGATAT CCATAACACG ACAGCCACTG GCAAATGCCC GTTCCATCAG GGCGGTCACG ACCAGAGCGC GGGTGGGGGC ACAACCACGC GCGACTGGTG GCCAAATCAA CTCCGTGTTG ACCTGTTAAA CCAACATTCT AATCGTTCTA ACCCGCTGGG TGAGGACTTT GACTACCGCA AAGAATTCAG CAAATTAGAT TACTACGGCC TGAAAAAAGA TCTGAAAGCC CTGCTGACAG AATCTCAGCC GTGGTGGCCT GCCGACTGGG GCAGCTATGC CGGTCTGTTT ATTCGTATGG CCTGGCACGG CGCGGGTACT TACCGTTCCA TCGACGGACG CGGCGGCGCG GGTCGTGGTC AGCAACGTTT TGCACCGCTG AACTCCTGGC CGGATAACGT AAGCCTCGAT AAAGCGCGTC GCCTGCTGTG GCCAATCAAA CAGAAATATG GTCAGAAAAT CTCCTGGGCC GACCTGTTTA TCCTCGCGGG TAACGTGGCG CTGGAAAACT CCGGCTTCCG TACCTTCGGT TTTGGTGCCG GTCGTGAAGA CGTCTGGGAA CCGGATCTGG ATGTTAACTG GGGTGATGAA AAAGCCTGGC TGACCCACCG TCATCCGGAA GCACTGGCGA AAGCACCGCT GGGCGCAACC GAGATGGGCC TGATTTACGT TAACCCGGAA GGCCCGGATC ACAGCGGCGA ACCGCTTTCT GCGGCAGCAG CTATCCGCGC GACCTTCGGC AACATGGGCA TGAACGACGA AGAAACCGTG GCGCTGATTG CGGGTGGTCA TACGCTGGGT AAAACTCACG GTGCCGGTCC GACATCAAAT GTAGGTCCTG ATCCAGAAGC TGCACCGATT GAAGAACAAG GTTTAGGCTG GGCGAGCACT TACGGCAGCG GCGTTGGCGC AGATGCCATC ACCTCTGGTC TGGAAGTGGT CTGGACCCAG ACACCGACCC AGTGGAGCAA CTATTTCTTC GAGAACCTGT TCAAATATGA GTGGGTACAG ACCCGCAGCC CGGCTGGCGC AATCCAGTTT GAAGCGGTAG ACGCACCGGA AATTATCCCG GATCCGTTCG ATCCGTCGAA GAAACGTAAA CCGACAATGC TGGTGACCGA CCTGACGCTG CGTTTTGATC CTGAGTTCGA GAAGATTTCT CGTCGTTTCC TCAACGATCC GCAGGCGTTC AACGAAGCCT TTGCCCGTGC CTGGTTCAAA CTGACGCACA GGGATATGGG GCCGAAATCT CGCTACATCG GGCCGGAAGT ACCGAAAGAA GATCTGATCT GGCAAGATCC TCTGCCGCAG CCGATCTACA ACCCGACCGA GCAGGACATT ATCGATCTGA AATTCGCGAT TGCGGATTCT GGTCTGTCTG TCAGCGAGCT GGTATCGGTG GCCTGGGCGT CTGCTTCTAC CTTCCGTGGT GGCGACAAAC GTGGTGGTGC CAATGGTGCA CGTCTGGCAT TAATGCCGCA GCGCGACTGG GATGTGAACG CCGCAGCCGT TCGTGCTCTG CCAGTTCTGG AGAAAATTCA GAAAGAGTCT GGTAAAGCCT CGCTGGCAGA TATCATCGTG CTGGCTGGTG TGGTTGGTGT TGAGAAAGCC GCAAGCGCCG CAGGTTTGAG CATTCATGTA CCGTTTGCGC CGGGTCGCGT TGATGCGCGT CAGGATCAGA CTGACATTGA GATGTTCGAA CTGCTGGAGC CAATTGCTGA CGGTTTCCGT AACTATCGCG CTCGTCTGGA CGTTTCCACC ACCGAGTCAC TGCTGATTGA TAAAGCCCAG CAACTGACCC TGACCGCGCC GGAAATGACT GCGCTGGTGG GCGGGATGCG CGTACTGGGT GCCAACTTCG ATGGCAGCAA AAACGGCGTC TTCACTGACC GCGTTGGCGT ATTGAGCAAT GACTTCTTCG TGAACTTGCT GGATATGCGT TACGAGTGGA AAGCGACCGA CGAATCGAAA GAGCTGTTCG AAGGCCGTGA CCGCGAAACC GGCGAAGTGA AATACACAGC CAGCCGTGCG GATCTGGTCT TTGGTTCTAA CTCTGTCCTG CGTGCGGTGG CGGAAGTTTA CGCCAGCAGC GATGCCCACG AGAAGTTTGT TAAAGACTTC GTGGCGGCAT GGGTGAAAGT GATGAACCTC GACCGTTTCG ACCTGCTGTA A
|
Protein sequence | MSTSDDIHNT TATGKCPFHQ GGHDQSAGGG TTTRDWWPNQ LRVDLLNQHS NRSNPLGEDF DYRKEFSKLD YYGLKKDLKA LLTESQPWWP ADWGSYAGLF IRMAWHGAGT YRSIDGRGGA GRGQQRFAPL NSWPDNVSLD KARRLLWPIK QKYGQKISWA DLFILAGNVA LENSGFRTFG FGAGREDVWE PDLDVNWGDE KAWLTHRHPE ALAKAPLGAT EMGLIYVNPE GPDHSGEPLS AAAAIRATFG NMGMNDEETV ALIAGGHTLG KTHGAGPTSN VGPDPEAAPI EEQGLGWAST YGSGVGADAI TSGLEVVWTQ TPTQWSNYFF ENLFKYEWVQ TRSPAGAIQF EAVDAPEIIP DPFDPSKKRK PTMLVTDLTL RFDPEFEKIS RRFLNDPQAF NEAFARAWFK LTHRDMGPKS RYIGPEVPKE DLIWQDPLPQ PIYNPTEQDI IDLKFAIADS GLSVSELVSV AWASASTFRG GDKRGGANGA RLALMPQRDW DVNAAAVRAL PVLEKIQKES GKASLADIIV LAGVVGVEKA ASAAGLSIHV PFAPGRVDAR QDQTDIEMFE LLEPIADGFR NYRARLDVST TESLLIDKAQ QLTLTAPEMT ALVGGMRVLG ANFDGSKNGV FTDRVGVLSN DFFVNLLDMR YEWKATDESK ELFEGRDRET GEVKYTASRA DLVFGSNSVL RAVAEVYASS DAHEKFVKDF VAAWVKVMNL DRFDLL
|
| |