Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0880 |
Symbol | supH |
ID | 5595433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 887095 |
End bp | 887910 |
Gene Length | 816 bp |
Protein Length | 271 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640920052 |
Product | sugar phosphatase SupH |
Protein accession | YP_001457619 |
Protein GI | 157160301 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 74 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTAA AAGTTATCGT CACAGATATG GACGGTACTT TTCTTAACAA CGTCAAAACG TACAACCAAC CACGTTTTAT GGCGCAATAT CAGGAACTGA AAAAGCGCGG CATTGAGTTC GTTGTCGCCA GCGGTAATCA GTATTACCAG CTTATTTCAT TCTTTCCTGA GCTAAAGGAT GAGATCTCTT TTGTCGCGGA AAACGGCGCA CTGGTTTACG AACATGGCAA GCAACTGTTC CACGGCGAAC TGACCCGACA TGAATCGCGG ATTGTTATTG GCGAGTTGCT AAAAGATAAG CAACTCAATT TTGTCGCCTG CGGTCTGCAA AGTGCATATG TCAGCGAAAA CGCCCCCGAA GCATTCGTCG CACTGATGGC AAAACACTAC CATCGCCTGA AACCTGTAAA AGATTATCAG GAGATTGACG ACGTACTGTT CAAGTTTTCG CTCAACCTGC CGGATGAACA AATCCCGTTA GTGATCGACA AACTGCACGT AGCGCTCGAT GGCATTATGA AACCCGTCAC CAGTGGTTTT GGCTTTATCG ACCTGATTAT TCCCGGTCTA CATAAAGCAA ACGGTATTTC GCGGTTACTG AAACGCTGGG ATCTGTCACC GCAAAATGTG GTAGCGATTG GCGACAGCGG TAACGATGCG GAGATGCTGA AAATGGCGCG TTATTCCTTT GCGATGGGCA ATGCTGCGGA AAACATTAAA CAAATCGCCC GTTACGCTAC CGATGATAAT AATCATGAAG GCGCGCTGAA TGTGATTCAG GCTGTGCTGG ATAACACATC CCCTTTTAAC AGCTGA
|
Protein sequence | MSVKVIVTDM DGTFLNNVKT YNQPRFMAQY QELKKRGIEF VVASGNQYYQ LISFFPELKD EISFVAENGA LVYEHGKQLF HGELTRHESR IVIGELLKDK QLNFVACGLQ SAYVSENAPE AFVALMAKHY HRLKPVKDYQ EIDDVLFKFS LNLPDEQIPL VIDKLHVALD GIMKPVTSGF GFIDLIIPGL HKANGISRLL KRWDLSPQNV VAIGDSGNDA EMLKMARYSF AMGNAAENIK QIARYATDDN NHEGALNVIQ AVLDNTSPFN S
|
| |