Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4441 |
Symbol | katG |
ID | 6491545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 4323966 |
End bp | 4326146 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642744523 |
Product | catalase/peroxidase HPI |
Protein accession | YP_002048112 |
Protein GI | 194451701 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.0233722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGA CCGACGATAC CCATAACACG TTATCCACTG GAAAATGTCC TTTCCATCAG GGGGGGCATG ACCGAAGCGC AGGCGCAGGG ACTGCCAGCC GCGACTGGTG GCCGAACCAG CTTCGCGTGG ATCTTTTGAA TCAACATTCC AACCGTTCTA ACCCGCTGGG TGAAGACTTT GACTACCGCA AAGAGTTTAG CAAGTTAGAC TACTCCGCCC TGAAAGGGGA TCTCAAGGCG CTGCTGACCG ATTCACAACC GTGGTGGCCC GCCGACTGGG GCAGCTATGT CGGTTTGTTT ATTCGTATGG CCTGGCATGG CGCTGGCACC TACCGTTCTA TTGATGGTCG TGGCGGCGCG GGTCGTGGTC AACAGCGTTT TGCGCCGCTT AACTCCTGGC CGGATAACGT CAGCCTGGAT AAGGCGCGTC GTTTGTTGTG GCCGATTAAG CAGAAATATG GCCAGAAAAT TTCCTGGGCT GACCTGTTTA TTCTGGCGGG TAACGTGGCG CTGGAAAACT CCGGCTTCCG TACCTTCGGT TTCGGCGCCG GGCGTGAAGA TGTCTGGGAA CCGGATCTGG ATGTGAACTG GGGCGATGAA AAAGCCTGGT TGACTCACCG ACACCCTGAA GCGCTGGCAA AAGCGCCGCT GGGCGCGACC GAGATGGGCC TTATCTACGT TAACCCGGAA GGGCCGGATC ACAGCGGCGA ACCACTTTCT GCCGCCGCCG CCATTCGCGC TACCTTTGGC AATATGGGGA TGAACGACGA AGAAACCGTG GCGTTGATCG CTGGCGGGCA TACCCTCGGT AAAACCCACG GCGCGGCAGC GGCATCCCAT GTAGGGGCCG ATCCGGAAGC CGCGCCGATT GAAGCGCAGG GCTTAGGTTG GGCCAGCAGC TATGGTAGCG GCGTTGGCGC GGATGCTATC ACCTCCGGGC TGGAAGTGGT CTGGACGCAG ACGCCGACCC AGTGGAGCAA CTATTTCTTC GAGAACCTGT TCAAATATGA GTGGGTACAA ACCCGTAGTC CGGCTGGCGC TATCCAGTTT GAAGCGGTAG ACGCGCCGGA TATCATCCCG GACCCGTTCG ATCCGTCGAA AAAACGTAAG CCAACCATGC TGGTCACCGA CCTGACGCTG CGTTTTGATC CGGAGTTCGA GAAGATTTCC CGTCGTTTCC TTAACGATCC GCAGGCCTTT AATGAAGCCT TTGCTCGTGC CTGGTTCAAA CTGACGCACA GAGATATGGG ACCAAAAGCG CGTTACATCG GACCGGAAGT CCCGAAAGAA GATCTGATCT GGCAGGACCC GTTGCCGCAA CCGCTCTATC AGCCAACGCA GGAAGACATT ATCAACCTGA AAGCGGCGAT CGCTGCATCC GGGCTTTCTA TTAGCGAGAT GGTTTCGGTT GCCTGGGCAT CCGCGTCTAC TTTCCGCGGC GGCGATAAGC GTGGCGGCGC TAACGGCGCG CGTCTGGCAT TAGCGCCTCA GCGCGACTGG GATGTCAACG CCGTTGCGGC TCGCGTTCTG CCGGTATTAG AAGAGATCCA GAAAACGACG AATAAAGCCT CGCTGGCCGA TATTATTGTG CTGGCGGGCG TGGTCGGTAT CGAGCAGGCG GCCGCTGCTG CGGGTGTCAG CATCAGCGTA CCTTTTGCGC CGGGCCGGGT GGATGCGCGT CAGGATCAGA CCGACATTGA GATGTTCTCG CTGCTTGAAC CGATTGCCGA TGGATTCCGT AACTATCGTG CGCGTCTGGA TGTGTCGACG ACCGAATCGC TGTTGATTGA TAAAGCGCAG CAGTTAACGT TGACCGCGCC GGAAATGACG GTACTGGTTG GCGGGATGCG TGTGCTGGGA ACCAACTTTG ACGGCAGCCA GAACGGTGTC TTTACCGACA GACCGGGCGT GCTCAGCACT GACTTCTTCG CTAATCTGCT GGATATGCGT TACGAGTGGA AGCCCACCGA CGACGCTAAT GAGCTGTTCG AAGGCCGGGA TCGTCTGACT GGCGAGGTAA AATACACGGC GACCCGCGCC GATCTGGTGT TTGGTTCCAA CTCCGTACTG CGCGCGCTGG CGGAAGTTTA CGCGTGTAGC GATGCGCACG AGAAGTTTGT GAAGGACTTC GTCGCGGCAT GGGTGAAAGT GATGAACCTG GACCGTTTCG ATCTGCAATA A
|
Protein sequence | MSTTDDTHNT LSTGKCPFHQ GGHDRSAGAG TASRDWWPNQ LRVDLLNQHS NRSNPLGEDF DYRKEFSKLD YSALKGDLKA LLTDSQPWWP ADWGSYVGLF IRMAWHGAGT YRSIDGRGGA GRGQQRFAPL NSWPDNVSLD KARRLLWPIK QKYGQKISWA DLFILAGNVA LENSGFRTFG FGAGREDVWE PDLDVNWGDE KAWLTHRHPE ALAKAPLGAT EMGLIYVNPE GPDHSGEPLS AAAAIRATFG NMGMNDEETV ALIAGGHTLG KTHGAAAASH VGADPEAAPI EAQGLGWASS YGSGVGADAI TSGLEVVWTQ TPTQWSNYFF ENLFKYEWVQ TRSPAGAIQF EAVDAPDIIP DPFDPSKKRK PTMLVTDLTL RFDPEFEKIS RRFLNDPQAF NEAFARAWFK LTHRDMGPKA RYIGPEVPKE DLIWQDPLPQ PLYQPTQEDI INLKAAIAAS GLSISEMVSV AWASASTFRG GDKRGGANGA RLALAPQRDW DVNAVAARVL PVLEEIQKTT NKASLADIIV LAGVVGIEQA AAAAGVSISV PFAPGRVDAR QDQTDIEMFS LLEPIADGFR NYRARLDVST TESLLIDKAQ QLTLTAPEMT VLVGGMRVLG TNFDGSQNGV FTDRPGVLST DFFANLLDMR YEWKPTDDAN ELFEGRDRLT GEVKYTATRA DLVFGSNSVL RALAEVYACS DAHEKFVKDF VAAWVKVMNL DRFDLQ
|
| |