Gene SeHA_C4441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4441 
SymbolkatG 
ID6491545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4323966 
End bp4326146 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content58% 
IMG OID642744523 
Productcatalase/peroxidase HPI 
Protein accessionYP_002048112 
Protein GI194451701 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.0233722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA CCGACGATAC CCATAACACG TTATCCACTG GAAAATGTCC TTTCCATCAG 
GGGGGGCATG ACCGAAGCGC AGGCGCAGGG ACTGCCAGCC GCGACTGGTG GCCGAACCAG
CTTCGCGTGG ATCTTTTGAA TCAACATTCC AACCGTTCTA ACCCGCTGGG TGAAGACTTT
GACTACCGCA AAGAGTTTAG CAAGTTAGAC TACTCCGCCC TGAAAGGGGA TCTCAAGGCG
CTGCTGACCG ATTCACAACC GTGGTGGCCC GCCGACTGGG GCAGCTATGT CGGTTTGTTT
ATTCGTATGG CCTGGCATGG CGCTGGCACC TACCGTTCTA TTGATGGTCG TGGCGGCGCG
GGTCGTGGTC AACAGCGTTT TGCGCCGCTT AACTCCTGGC CGGATAACGT CAGCCTGGAT
AAGGCGCGTC GTTTGTTGTG GCCGATTAAG CAGAAATATG GCCAGAAAAT TTCCTGGGCT
GACCTGTTTA TTCTGGCGGG TAACGTGGCG CTGGAAAACT CCGGCTTCCG TACCTTCGGT
TTCGGCGCCG GGCGTGAAGA TGTCTGGGAA CCGGATCTGG ATGTGAACTG GGGCGATGAA
AAAGCCTGGT TGACTCACCG ACACCCTGAA GCGCTGGCAA AAGCGCCGCT GGGCGCGACC
GAGATGGGCC TTATCTACGT TAACCCGGAA GGGCCGGATC ACAGCGGCGA ACCACTTTCT
GCCGCCGCCG CCATTCGCGC TACCTTTGGC AATATGGGGA TGAACGACGA AGAAACCGTG
GCGTTGATCG CTGGCGGGCA TACCCTCGGT AAAACCCACG GCGCGGCAGC GGCATCCCAT
GTAGGGGCCG ATCCGGAAGC CGCGCCGATT GAAGCGCAGG GCTTAGGTTG GGCCAGCAGC
TATGGTAGCG GCGTTGGCGC GGATGCTATC ACCTCCGGGC TGGAAGTGGT CTGGACGCAG
ACGCCGACCC AGTGGAGCAA CTATTTCTTC GAGAACCTGT TCAAATATGA GTGGGTACAA
ACCCGTAGTC CGGCTGGCGC TATCCAGTTT GAAGCGGTAG ACGCGCCGGA TATCATCCCG
GACCCGTTCG ATCCGTCGAA AAAACGTAAG CCAACCATGC TGGTCACCGA CCTGACGCTG
CGTTTTGATC CGGAGTTCGA GAAGATTTCC CGTCGTTTCC TTAACGATCC GCAGGCCTTT
AATGAAGCCT TTGCTCGTGC CTGGTTCAAA CTGACGCACA GAGATATGGG ACCAAAAGCG
CGTTACATCG GACCGGAAGT CCCGAAAGAA GATCTGATCT GGCAGGACCC GTTGCCGCAA
CCGCTCTATC AGCCAACGCA GGAAGACATT ATCAACCTGA AAGCGGCGAT CGCTGCATCC
GGGCTTTCTA TTAGCGAGAT GGTTTCGGTT GCCTGGGCAT CCGCGTCTAC TTTCCGCGGC
GGCGATAAGC GTGGCGGCGC TAACGGCGCG CGTCTGGCAT TAGCGCCTCA GCGCGACTGG
GATGTCAACG CCGTTGCGGC TCGCGTTCTG CCGGTATTAG AAGAGATCCA GAAAACGACG
AATAAAGCCT CGCTGGCCGA TATTATTGTG CTGGCGGGCG TGGTCGGTAT CGAGCAGGCG
GCCGCTGCTG CGGGTGTCAG CATCAGCGTA CCTTTTGCGC CGGGCCGGGT GGATGCGCGT
CAGGATCAGA CCGACATTGA GATGTTCTCG CTGCTTGAAC CGATTGCCGA TGGATTCCGT
AACTATCGTG CGCGTCTGGA TGTGTCGACG ACCGAATCGC TGTTGATTGA TAAAGCGCAG
CAGTTAACGT TGACCGCGCC GGAAATGACG GTACTGGTTG GCGGGATGCG TGTGCTGGGA
ACCAACTTTG ACGGCAGCCA GAACGGTGTC TTTACCGACA GACCGGGCGT GCTCAGCACT
GACTTCTTCG CTAATCTGCT GGATATGCGT TACGAGTGGA AGCCCACCGA CGACGCTAAT
GAGCTGTTCG AAGGCCGGGA TCGTCTGACT GGCGAGGTAA AATACACGGC GACCCGCGCC
GATCTGGTGT TTGGTTCCAA CTCCGTACTG CGCGCGCTGG CGGAAGTTTA CGCGTGTAGC
GATGCGCACG AGAAGTTTGT GAAGGACTTC GTCGCGGCAT GGGTGAAAGT GATGAACCTG
GACCGTTTCG ATCTGCAATA A
 
Protein sequence
MSTTDDTHNT LSTGKCPFHQ GGHDRSAGAG TASRDWWPNQ LRVDLLNQHS NRSNPLGEDF 
DYRKEFSKLD YSALKGDLKA LLTDSQPWWP ADWGSYVGLF IRMAWHGAGT YRSIDGRGGA
GRGQQRFAPL NSWPDNVSLD KARRLLWPIK QKYGQKISWA DLFILAGNVA LENSGFRTFG
FGAGREDVWE PDLDVNWGDE KAWLTHRHPE ALAKAPLGAT EMGLIYVNPE GPDHSGEPLS
AAAAIRATFG NMGMNDEETV ALIAGGHTLG KTHGAAAASH VGADPEAAPI EAQGLGWASS
YGSGVGADAI TSGLEVVWTQ TPTQWSNYFF ENLFKYEWVQ TRSPAGAIQF EAVDAPDIIP
DPFDPSKKRK PTMLVTDLTL RFDPEFEKIS RRFLNDPQAF NEAFARAWFK LTHRDMGPKA
RYIGPEVPKE DLIWQDPLPQ PLYQPTQEDI INLKAAIAAS GLSISEMVSV AWASASTFRG
GDKRGGANGA RLALAPQRDW DVNAVAARVL PVLEEIQKTT NKASLADIIV LAGVVGIEQA
AAAAGVSISV PFAPGRVDAR QDQTDIEMFS LLEPIADGFR NYRARLDVST TESLLIDKAQ
QLTLTAPEMT VLVGGMRVLG TNFDGSQNGV FTDRPGVLST DFFANLLDMR YEWKPTDDAN
ELFEGRDRLT GEVKYTATRA DLVFGSNSVL RALAEVYACS DAHEKFVKDF VAAWVKVMNL
DRFDLQ