Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0997 |
Symbol | ybjI |
ID | 6971722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1013966 |
End bp | 1014781 |
Gene Length | 816 bp |
Protein Length | 271 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643385013 |
Product | phosphatase YbjI |
Protein accession | YP_002269513 |
Protein GI | 209399927 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.100252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTA AATTAATTGC GGTAGACATG GATGGTACTT TCTTAAGCGA TCAAAAAACC TATAACCGTG AGCGGTTTAT GGCTCAGTAT CAGCAAATGA AAGCACAAGG CATTCGCTTT GTGGTCGCCA GCGGGAATCA ATATTATCAG TTGATCTCTT TCTTCCCTGA AATTGCTAAT GAAATTGCCT TTGTGGCTGA AAACGGCGGC TGGGTAGTGA GTGAAGGCAA AGATGTTTTT AATGGCGAGC TGTCGAAGGA TGCGTTTACT ACTGTCGTGG AACATTTGCT GACGCGCCCG GAAGTGGAAA TTATTGCCTG CGGAAAAAAT AGCGCCTATA CGCTCAAAAA ATATGACGAT GCCATGAAAA TGGTGGCGGA AATGTATTAT CACCGTCTGG AATACGTCGA TAACTTTGAC AACTTAGAAG ATATCTTCTT TAAGTTTGGC CTGAATCTTT CCGATGAACT GATTCCACAA GTACAAAAAA CATTACATGA GGCCATCGGC GATATTATGG TGCCGGTCCA CACCGGCAAC GGCAGCATCG ATCTGATTAT CCCCGGCGTA CATAAAGCCA ATGGCCTTCG CCAACTGCAG AAATTATGGG GAATAGACGA CAGCGAAGTG GTGGTCTTTG GCGATGGCGG TAACGATATT GAGATGCTGC GTCAGGCAGG CTTTAGTTTT GCAATGGAAA ATGCCGGCAA CGCGGTCGTC GCAGCGGCAA AATACCGGGC AGGCTCCAAT AACCGTGAAG GCGTACTGGA TGTGATCGAT AAAGTTCTTA AACACGAAGC GCCATTTGAC CAATAA
|
Protein sequence | MSIKLIAVDM DGTFLSDQKT YNRERFMAQY QQMKAQGIRF VVASGNQYYQ LISFFPEIAN EIAFVAENGG WVVSEGKDVF NGELSKDAFT TVVEHLLTRP EVEIIACGKN SAYTLKKYDD AMKMVAEMYY HRLEYVDNFD NLEDIFFKFG LNLSDELIPQ VQKTLHEAIG DIMVPVHTGN GSIDLIIPGV HKANGLRQLQ KLWGIDDSEV VVFGDGGNDI EMLRQAGFSF AMENAGNAVV AAAKYRAGSN NREGVLDVID KVLKHEAPFD Q
|
| |