Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0971 |
Symbol | supH |
ID | 6968462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 986808 |
End bp | 987623 |
Gene Length | 816 bp |
Protein Length | 271 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643384991 |
Product | sugar phosphatase SupH |
Protein accession | YP_002269491 |
Protein GI | 209398068 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.526488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTAA AAGTTATCGT CACAGATATG GACGGTACTT TTCTTAACGA CGTCAAAACG TACAACCAAC CACGTTTTAT GGCGCAATAT CAGGAACTGA AAAAGCGCGG CATTGAGTTC GTTGTCGCCA GCGGTAATCA GTATTACCAG CTTATTTCAT TCTTTCCTGA GCTAAAGGAT GAGATCTCTT TTGTCGCGGA AAACGGCGCA CTGGTTTACG AACATGGCAA GCAACTGTTC CACGGCGAAC TGACCCGACA TGAATCGCGG ATTGTTATTG GCGAGTTGCT AAAAGATAAG CAACTCAATT TTGTCGCCTG CGGTCTGCAA AGTGCATATG TCAGCGAAAA TGCCCCCGAA GCATTTGTCG CACTGATGGC AAAACACTAC CATCGCCTGA AACCTGTAAA AGATTATCAG GAGATTGACG ACGTACTGTT CAAGTTTTCA CTCAACCTGC CGGATGAACA AATCCCGTTA GTGATCGACA AACTGCACAT AGCGCTCGAT GGCATTATGA AACCCGTCAC CAGTGGTTTT GGCTTTATCG ACCTGATTAT TCCCGGTCTA CATAAAGCAA ACGGTATTTC GCGGTTACTG AAACGCTGGG ATCTGTCACC GCAAAATGTG GTAGCGATTG GCGACAGCGG TAACGATGCG GAGATGCTGA AAATGGCGCG TTATTCCTTT GCGATGGGCA ATGCTGCGGA AAACATTAAA CAAATCGCCC GTTACGCTAC CGATGATAAT AATCATGAAG GCGCGCTGAA TGTGATTCAG GCTGTACTGG ATAACACATC CCCTTTTAAC AGCTGA
|
Protein sequence | MSVKVIVTDM DGTFLNDVKT YNQPRFMAQY QELKKRGIEF VVASGNQYYQ LISFFPELKD EISFVAENGA LVYEHGKQLF HGELTRHESR IVIGELLKDK QLNFVACGLQ SAYVSENAPE AFVALMAKHY HRLKPVKDYQ EIDDVLFKFS LNLPDEQIPL VIDKLHIALD GIMKPVTSGF GFIDLIIPGL HKANGISRLL KRWDLSPQNV VAIGDSGNDA EMLKMARYSF AMGNAAENIK QIARYATDDN NHEGALNVIQ AVLDNTSPFN S
|
| |