Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4429 |
Symbol | katG |
ID | 6272183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4134418 |
End bp | 4136598 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641728226 |
Product | catalase/peroxidase HPI |
Protein accession | YP_001882639 |
Protein GI | 187730298 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGT CAGACGATAT CCATAACACG ACAGCCACTG GCAAATGCCC GTTCCATCAG GGCGGTCACG ACCAGAGCGC GGGGGCGGGC ACAACCACTC GCGACTGGTG GCCAAATCAA CTCCGTGTTG ACCTGTTAAA CCAACATTCT AATCGTTCTA ACCCGCTGGG TGAGGACTTT GACTACCGCA AAGAATTCAG CAAATTAGAT TACTACGGCC TGAAAAAAGA TCTGAAAGCC CTGCTGACAG AATCTCAGCC GTGGTGGCCA GCCGACTGGG GCAGCTACGC CGGTCTGTTT ATTCGTATGG CCTGGCACGG CGCAGGTACT TACCGTTCAA TCGATGGACG CGGCGGCGCG GGTCGTGGTC AGCAACGTTT TGCACCGCTG AACTCCTGGC CGGATAACGT AAGCCTCGAT AAAGCGCGTC GCCTGTTGTG GCCAATCAAA CAGAAATATG GTCAGAAAAT CTCCTGGGCC GACCTGTTTA TCCTCGCAGG TAACGTGGCG CTGGAAAACT CCGGCTTCCG TACCTTCGGT TTTGGTGCCG GTCGTGAAGA CGTCTGGGAA CCGGATCTGG ATGTTAACTG GGGTGATGAA AAAGCCTGGC TGACTCACCG TCATCCGGAA GCGCTGGCGA AAGCACCGCT GGGTGCAACC GAGATGGGTC TGATTTACGT TAACCCGGAA GGCCCGGATC ACAGCGGCGA ACCGCTTTCT GCGGCAGCAG CTATCCGTGC GACCTTCGGC AACATGGGCA TGAACGACGA AGAAACCGTG GCGCTGATTG CGGGTGGTCA TACGCTGGGT AAAACTCACG GTGCCGGTCC GACATCAAAT GTAGGTCCTG ATCCAGAAGC TGCACCGATT GAAGAACAAG GTTTAGGCTG GGCGAGCACT TACGGCAGCG GCGTTGGCGC AGATGCCATT ACCTCTGGTC TGGAAGTTGT CTGGACCCAG ACGCCGACCC AGTGGAGCAA CTATTTCTTC GAGAACCTGT TCAAATATGA GTGGGTACAG ACCCGCAGTC CGGCTGGAGC AATCCAGTTC GACGCGGTAG ACGCACCGGA AATTATCCCG GATCCGTTCG ATCCGTCGAA GAAACGTAAA CCGACAATGC TGGTGACCGA CCTGACGCTG CGTTTTGATC CTGAGTTCGA GAAGATCTCT CGTCGTTTCC TCAACGATCC GCAGGCGTTC AACGAAGCCT TTGCCCGTGC CTGGTTCAAA CTGACGCACA GGGATATGGG GCCGAAATCT CGCTACATCG GGCCGGAAGT GCCGAAAGAA GATCTGATCT GGCAAGATCC GCTGCCGCAG CCGATCTACA ACCCGACCGA GCAGGACATT ATCGATCTGA AATTCGCGAT TGCGGATTCT GGTCTGTCTG TTAGTGAGCT GGTATCGGTG GCCTGGGCAT CTGCTTCTAC CTTCCGTGGT GGCGACAAAC GCGGTGGTGC CAACGGTGCG CGTCTGGCAT TAATGCCGCA GCGCGACTGG GATGTGAACG CCGCAGCCGT TCGTGCTCTG CCTGTTCTGG AGAAAATCCA GAAAGAGTCT GGTAAAGCCT CGCTGGCGGA TATCATCGTG CTGGCTGGTG TGGTTGGTGT TGAGAAAGCC GCAAGCGCCG CAGGTTTGAG CATTCATGTA CCGTTTGCGC CGGGTCGCGT TGATGCGCGT CAGGATCAGA CTGACATTGA GATGTTTGAG CTGCTGGAGC CAATTGCTGA CGGTTTCCGT AACTATCGCG CTCGTCTGGA CGTTTCCACC ACCGAGTCAC TGCTGATCGA CAAAGCACAG CAACTGACGC TGACCGCGCC GGAAATGACT GCGCTGGTGG GCGGGATGCG TGTACTGGGT GCCAACTTCG ATAGCAGCAA AAACGGCGTC TTCACTGACC GCGTTGGCGT ATTGAGCAAT GACTTCTTCG TGAACTTGCT GGATATGCGT TACGAGTGGA AAGCGACTGA CGAATCGAAA GAGCTGTTCG AAGGACGTGA CCGCGAAACC GGCGAAGTGA AATACACAGC CAGCCGTGCG GATCTGGTAT TTGGTTCTAA CTCTGTCCTG CGTGCGGTAG CGGAAGTTTA CGCCAGCAGC GATGCCCACG AGAAGTTTGT TAAAGACTTC GTAGCGGCAT GGGTGAAAGT GATGAACCTC GACCGTTTCG ACCTGCTGTA A
|
Protein sequence | MSTSDDIHNT TATGKCPFHQ GGHDQSAGAG TTTRDWWPNQ LRVDLLNQHS NRSNPLGEDF DYRKEFSKLD YYGLKKDLKA LLTESQPWWP ADWGSYAGLF IRMAWHGAGT YRSIDGRGGA GRGQQRFAPL NSWPDNVSLD KARRLLWPIK QKYGQKISWA DLFILAGNVA LENSGFRTFG FGAGREDVWE PDLDVNWGDE KAWLTHRHPE ALAKAPLGAT EMGLIYVNPE GPDHSGEPLS AAAAIRATFG NMGMNDEETV ALIAGGHTLG KTHGAGPTSN VGPDPEAAPI EEQGLGWAST YGSGVGADAI TSGLEVVWTQ TPTQWSNYFF ENLFKYEWVQ TRSPAGAIQF DAVDAPEIIP DPFDPSKKRK PTMLVTDLTL RFDPEFEKIS RRFLNDPQAF NEAFARAWFK LTHRDMGPKS RYIGPEVPKE DLIWQDPLPQ PIYNPTEQDI IDLKFAIADS GLSVSELVSV AWASASTFRG GDKRGGANGA RLALMPQRDW DVNAAAVRAL PVLEKIQKES GKASLADIIV LAGVVGVEKA ASAAGLSIHV PFAPGRVDAR QDQTDIEMFE LLEPIADGFR NYRARLDVST TESLLIDKAQ QLTLTAPEMT ALVGGMRVLG ANFDSSKNGV FTDRVGVLSN DFFVNLLDMR YEWKATDESK ELFEGRDRET GEVKYTASRA DLVFGSNSVL RAVAEVYASS DAHEKFVKDF VAAWVKVMNL DRFDLL
|
| |