Gene EcSMS35_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1459 
SymbolkatE 
ID6146689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1442759 
End bp1445020 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content52% 
IMG OID641616337 
Producthydroperoxidase II 
Protein accessionYP_001743517 
Protein GI170680521 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.401999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.332659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAAC ATAACGAAAA GAACCCACAT CAGCACCAGT CACCACTACA CGATTCCAGC 
GAAGCGAAAC CGGGGATGGA CTCACTGGCA CCTGAAGACG GATCTCATCG TCCAGCGGCT
GAACCAACAC CGCCTGGTGC ACAACCTACC GCCCCAGGGA GCCTGAAAGC CCCTGATACG
CGTAACGAAA AACTTAATTC TCTGGAAGAC GTACGCAAAG GCAGTGAAAA TTATGCGCTG
ACCACTAATC AGGGCGTGCG CATCGCCGAC GATCAAAACT CACTGCGTGG CGGTAGCCGT
GGTCCAACGC TGCTGGAAGA TTTTATTCTG CGCGAGAAAA TCACCCACTT TGACCATGAG
CGCATCCCGG AACGTATTGT TCATGCACGT GGATCAGCCG CTCACGGTTA TTTCCAGCCA
TATAAAAGCT TAAGCGATAT TACCAAAGCG GATTTCCTCT CAGATCCGAA CAAAATCACC
CCAGTATTTG TACGTTTCTC TACCGTTCAG GGTGGTGCTG GCTCTGCCGA TACCGTGCGT
GATATCCGTG GCTTTGCCAC CAAGTTCTAT ACCGAAGAGG GTATTTTTGA CCTTGTTGGC
AATAACACGC CAATCTTCTT TATCCAGGAT GCGCATAAAT TCCCTGATTT TGTGCATGCG
GTAAAACCAG AACCACACTG GGCAATCCCG CAGGGGCAAA GCGCTCACGA CACCTTCTGG
GATTATGTTT CCCTGCAACC GGAAACACTG CACAACGTAA TGTGGGCGAT GTCGGATCGT
GGTATCCCGC GCAGCTACCG CACTATGGAA GGCTTTGGTA TTCATACCTT CCGTCTGATT
AACGCCGAAG GTAAAGCGAC GTTCGTACGT TTCCACTGGA AACCACTGGC AGGTAAAGCC
TCACTCGTTT GGGATGAAGC ACAAAAACTA ACCGGACGTG ACCCGGACTT CCACCGCCGC
GAGTTGTGGG AAGCGATTGA AGCCGGCGAT TTTCCGGAAT ACGAACTGGG CTTCCAGTTG
ATTCCTGAGG AAGATGAATT TAAGTTCGAC TTCGACCTTC TTGATCCAAC TAAACTTATC
CCGGAAGAAC TGGTGCCCGT TCAGCGTGTC GGCAAAATGG TGCTCAATCG CAATCCGGAT
AACTTCTTTG CCGAAAACGA ACAAGCAGCA TTCCATCCAG GTCATATTGT TCCTGGTCTG
GACTTCACCA ACGATCCGCT GTTGCAGGGG CGTTTGTTCT CTTATACCGA TACACAAATC
AGTCGTCTTG GCGGACCAAA TTTCCATGAG ATTCCGATTA ACCGCCCGAC CTGCCCTTAC
CATAATTTCC AGCGTGACGG CATGCATCGT ATGGGGATCG ACACTAACCC GGCGAATTAT
GAACCGAACT CGATCAACGA TAACTGGCCG CGCGAAACAC CGCCGGGGCC GAAACGCGGC
GGTTTTGAAT CATACCAGGA GCGCGTGGAA GGCAATAAAG TTCGCGAGCG CAGCCCATCG
TTTGGCGAAT ATTATTCCCA TCCGCGTCTG TTCTGGCTAA GTCAGACGCC ATTCGAGCAG
CGCCATATTG TCGATGGTTT CAGTTTTGAG TTAAGCAAAG TAGTTCGTCC GTATATTCGT
GAGCGCGTTG TTGACCAGCT GGCACATATT GATCTCACTC TGGCCCAGGC GGTGGCGAAA
AATCTCGGTA TCGAGCTGAC GGACGACCAG CTGAATATCA CCCCGCCTCC GGACGTCAAC
GGTCTGAAAA AGGATCCATC CTTAAGTTTG TACGCCATTC CTGACGGTGA TGTGAAAGGT
CGCGTGGTAG CGATTTTACT TAATGATGAA GTGAGATCGG CAGACCTTCT GGCCATTCTC
AAGGCGCTGA AGGCCAAAGG CGTTCATGCC AAACTGCTCT ACTCCCGAAT GGGTGAAGTG
ACTGCGGATG ACGGTACGGT GCTGCCTATA GCCGCTACAT TTGCCGGAGC GCCTTCGCTG
ACGGTCGATG CGGTTATTGT CCCTTGCGGC AATATCGCGG ATATCGCTGA CAACGGCGAT
GCCAACTACT ACCTGATGGA AGCCTACAAA CACCTTAAAC CGATTGCGCT GGCAGGAGAC
GCGCGCAAGT TTAAAGTAAG AATCAAGGTC GCTGATCAGG GTGAAGAAGG GATTGTGGAA
GCTGACAGCG CCGATGGTAG TTTTATGGAT GAACTGTTAA CGCTGATGGC AGCACACCGC
GTGTGGTCAC GCATTCCTAA GATTGACAAA ATTCCGGCGT AA
 
Protein sequence
MSQHNEKNPH QHQSPLHDSS EAKPGMDSLA PEDGSHRPAA EPTPPGAQPT APGSLKAPDT 
RNEKLNSLED VRKGSENYAL TTNQGVRIAD DQNSLRGGSR GPTLLEDFIL REKITHFDHE
RIPERIVHAR GSAAHGYFQP YKSLSDITKA DFLSDPNKIT PVFVRFSTVQ GGAGSADTVR
DIRGFATKFY TEEGIFDLVG NNTPIFFIQD AHKFPDFVHA VKPEPHWAIP QGQSAHDTFW
DYVSLQPETL HNVMWAMSDR GIPRSYRTME GFGIHTFRLI NAEGKATFVR FHWKPLAGKA
SLVWDEAQKL TGRDPDFHRR ELWEAIEAGD FPEYELGFQL IPEEDEFKFD FDLLDPTKLI
PEELVPVQRV GKMVLNRNPD NFFAENEQAA FHPGHIVPGL DFTNDPLLQG RLFSYTDTQI
SRLGGPNFHE IPINRPTCPY HNFQRDGMHR MGIDTNPANY EPNSINDNWP RETPPGPKRG
GFESYQERVE GNKVRERSPS FGEYYSHPRL FWLSQTPFEQ RHIVDGFSFE LSKVVRPYIR
ERVVDQLAHI DLTLAQAVAK NLGIELTDDQ LNITPPPDVN GLKKDPSLSL YAIPDGDVKG
RVVAILLNDE VRSADLLAIL KALKAKGVHA KLLYSRMGEV TADDGTVLPI AATFAGAPSL
TVDAVIVPCG NIADIADNGD ANYYLMEAYK HLKPIALAGD ARKFKVRIKV ADQGEEGIVE
ADSADGSFMD ELLTLMAAHR VWSRIPKIDK IPA