Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0424 |
Symbol | mhpC |
ID | 6970159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 433438 |
End bp | 434304 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384476 |
Product | 2-hydroxy-6-ketonona-2,4-dienedioic acid hydrolase |
Protein accession | YP_002268990 |
Protein GI | 209399853 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTATC AGCCACAAAC CGAAGCCGCC ACCAGCCGTT TTCTGAATGT AGAAGAAGCG GGTAAAACGC TGCGCATCCA TTTTAATGAC TGCGGACAAG GCGACGAAAC CGTTGTCCTG CTGCATGGTT CCGGCCCGGG TGCTACTGGC TGGGCGAACT TCAGCCGCAA TATCGATCCG CTGGTAGAGG CGGGCTATCG GGTGATCCTG CTGGATTGTC CGGGTTGGGG CAAGAGCGAT TCGATCGTTA ATAGTGGTTC GCGATCGGAT CTTAATGCAC GAATCCTGAA AAGCGTGGTG GATCAACTGG ATATCGCCAA AATCCACCTG CTGGGCAACT CGATGGGTGG CCATAGTTCT GTGGCGTTCA CCCTTAACTG GCCGGAGCGC GTCGGCAAAC TGGTGCTGAT GGGCGGCGGT ACGGGCGGCA TGAGTTTGTT TACGCCGATG CCAACCGAAG GTATTAAGCG ACTGAATCAG CTTTATCGTC AGCCGACTAT CGAAAACCTG AAGCTGATGA TGGATATCTT CGTTTTTGAT ACCAGCGATT TGACCGACGC CCTGTTTGAA GCGCGCCTGA ATAATATGCT GTCGCGCCGC GATCACCTGG AAAACTTCGT TAAGAGCCTG GAAGCTAATC CGAAACAGTT CCCGGATTTT GGCCCACGTC TGGCGGAAAT CAAAGCGCAA ACCCTGATTG TCTGGGGGCG CAACGACCGC TTTGTGCCGA TGGATGCGGG TCTGCGTCTG CTGTCCGGCA TTGCCGGTTC TGAACTGCAT ATCTTCCGCG ACTGTGGGCA TTGGGCGCAG TGGGAACATG CCGACGCTTT CAATCAACTG GTGCTGAATT TCCTCGCACG TGCTTAA
|
Protein sequence | MSYQPQTEAA TSRFLNVEEA GKTLRIHFND CGQGDETVVL LHGSGPGATG WANFSRNIDP LVEAGYRVIL LDCPGWGKSD SIVNSGSRSD LNARILKSVV DQLDIAKIHL LGNSMGGHSS VAFTLNWPER VGKLVLMGGG TGGMSLFTPM PTEGIKRLNQ LYRQPTIENL KLMMDIFVFD TSDLTDALFE ARLNNMLSRR DHLENFVKSL EANPKQFPDF GPRLAEIKAQ TLIVWGRNDR FVPMDAGLRL LSGIAGSELH IFRDCGHWAQ WEHADAFNQL VLNFLARA
|
| |