Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0534 |
Symbol | cof |
ID | 6968343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 537062 |
End bp | 537880 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643384580 |
Product | hydrolase Cof |
Protein accession | YP_002269094 |
Protein GI | 209396101 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.982324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGTC TGGCAGCATT TGATATGGAT GGCACTTTAT TGATGCCTGA CCATCATTTA GGTGAGAAAA CCCTCTCTAC TTTGGCGCGA CTGCGTGAAC GCGACATTAC CCTCACTTTT GCCACGGGGC GTCATGCGCT GGAGATGCAG CATATTCTCG GGGCGATATC GCTGGATGCG TATTTGATTA CCGGCAACGG AACGCGCGTG CATTCTCTGG AAGGTGAACT TTTACATCGT GATGATTTAC CTGCGGATGT CGCGGAGCTG GTGCTGTATC AGCAATGGGA TACCCGAGCC AGCATGCATA TTTTCAATGA CGACGGTTGG TTTACCGGGA AAGAGATCCC TGCGTTGTTG CAGGCATTTG TCTATAGCGG TTTTCGTTAT CAAATAATCG ATGTCAAAAA AATGCCACTC GGCAGCGTCA CCAAGATCTG CTTCTGTGGC GATCACGACG ATCTTACACG TTTGCAGATC CAGCTACACG AAGCATTAGG CGAGCGTGCA CATTTGTGTT TTTCCGCCAC GGATTGCCTT GAAGTGCTGC CGGTGGGCTG CAATAAAGGC GCTGCACTGA CGGTGCTGAC CCAACATTTA GGTTTATCGT TGCGCGATTG CATGGCCTTT GGTGATGCGA TGAACGATCG CGAAATGTTA GGCAGCGTCG GTAGCGGATT TATTATGGGC AATGCGATGC CGCAACTGCG CGCGGAGCTC CCGCATTTAC CGGTGATTGG ACATTGCCGA AATCAGGCTG TCTCTCACTA TTTGACGCAC TGGCTGGACT ATCCACATCT ACCTTATTCC CCCGAATAA
|
Protein sequence | MARLAAFDMD GTLLMPDHHL GEKTLSTLAR LRERDITLTF ATGRHALEMQ HILGAISLDA YLITGNGTRV HSLEGELLHR DDLPADVAEL VLYQQWDTRA SMHIFNDDGW FTGKEIPALL QAFVYSGFRY QIIDVKKMPL GSVTKICFCG DHDDLTRLQI QLHEALGERA HLCFSATDCL EVLPVGCNKG AALTVLTQHL GLSLRDCMAF GDAMNDREML GSVGSGFIMG NAMPQLRAEL PHLPVIGHCR NQAVSHYLTH WLDYPHLPYS PE
|
| |