Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0871 |
Symbol | ybjI |
ID | 6147160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 877948 |
End bp | 878763 |
Gene Length | 816 bp |
Protein Length | 271 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641615759 |
Product | phosphatase YbjI |
Protein accession | YP_001742951 |
Protein GI | 170682272 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.464954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTA AATTAATTGC GGTAGACATG GATGGTACTT TCTTAAGCGA TCAAAAAACC TATAACCGTG AGCGGTTTAT GGCTCAGTAT CAGCAAATGA AAGCACAAGG CATTCGCTTT GTGGTCGCCA GCGGGAATCA ATATTATCAG TTGATCTCTT TCTTTCCGGA AATTGCTAAT GAAATAGCCT TTGTGGCTGA AAACGGCGGC TGGGTAGTGA GCGAAGGCAA AGATGTTTTT AATGGCGAGC TGTCGAAGGA TGCGTTTGCT ACTGTCGTGG AACATTTGCT GACGCGCCCG GAAGTGGAAA TTATTGCCTG CGGAAAAAAT AGCGCCTATA CCCTCAAAAA ATATGACGAT GCCATGAAAA CGGTGGCGGA AATGTATTAT CACCGTCTGG AATACGTCGA TAACTTTGAC AACTTAGAAG ATATCTTCTT TAAGTTTGGT CTGAATCTTT CCGATGAACT GATTCCACAA GTACAAAAAG CATTACATGA GGCCATCGGC GATATTATGG TGCCGGTCCA CACCGGCAAC GGCAGCATCG ATCTGATTAT CCCCGGCGTA CATAAAGCCA ATGGCCTTCG CCAACTGCAG AAATTATGGG GAATAGACGA CAGCGAAGTG GTGGTCTTTG GCGATGGCGG TAACGATATT GAGATGCTGC GTCAGGCAGG CTTTAGTTTT GCAATGGAAA ATGCCGGCAG CGCGGTCGTC GCAGCAGCAA AATACCGGGC AGGCTCCAAT AACCGTGAAG GCGTACTGGA TGTGATCGAT AAAGTTCTTA AACACGAAGC GCCATTTAAC CAATAA
|
Protein sequence | MSIKLIAVDM DGTFLSDQKT YNRERFMAQY QQMKAQGIRF VVASGNQYYQ LISFFPEIAN EIAFVAENGG WVVSEGKDVF NGELSKDAFA TVVEHLLTRP EVEIIACGKN SAYTLKKYDD AMKTVAEMYY HRLEYVDNFD NLEDIFFKFG LNLSDELIPQ VQKALHEAIG DIMVPVHTGN GSIDLIIPGV HKANGLRQLQ KLWGIDDSEV VVFGDGGNDI EMLRQAGFSF AMENAGSAVV AAAKYRAGSN NREGVLDVID KVLKHEAPFN Q
|
| |