Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1959 |
Symbol | |
ID | 6969976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1850863 |
End bp | 1851651 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385885 |
Product | AP endonuclease, family 2 |
Protein accession | YP_002270374 |
Protein GI | 209400921 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0839995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAG GCACACAGAA TCAGGCGTTC TTCCCGGAAA ATATTCTGGA GAAATTTCGT TATATCAAAG AGATGGGCTT CGATGGTTTT GAGATTGACG GCAAACTGCT GGTCAACAAC ATCGAAGAAG TCAAATCGGC GATCAAAGAA ACCGGTTTAC CGGTGACCAC CGCCTGCGGC GGCTATGACG GCTGGATTGG TGACTTTATC GAAGAGCGTC GTCTTAATGG CTTAAAGCAG ATCGAACGCA TTCTCGAAGC GCTGGCAGAA GTGGGGGGTA AAGGTATCGT CGTTCCGGCA GCGTGGGGCA TGTTTACCTT CCGCTTACCG CCGATGACCT CGCCGCGTAG CCTGGACGGC GACCGCAAAA TGGTGAGTGA TTCCCTGCGC GTACTGGAAC AGGTCGCCGC GCGTACCGGA ACCGTGGTGT ATCTCGAACC GTTAAACCGC TATCAGGATC ATATGATCAA CACCCTCGCC GATGCTCGTC GTTACAGCGT CGAAAACAAT CTTAAACATG TGCAGATTAT CGGCGATTTC TATCACATGA ACATCGAAGA AGATAACCTG GCGCAGGCGC TGCATGACAA CCGCGACTTG CTCGGTCATG TGCATATTGC CGATAACCAT CGTTACCAGC CGGGCAGCGG CACCCTCGAT TTCCACGCGC TGTTTGAACA GCTGCGCGCC GATAACTATC AGGGTTATGT GGTGTATGAA GGGCGTATCC GGGCAGAAGA TCCTGCCCAG GCGTACCGTG ATTCGTTGAC CTGGTTGCGT ACCTGCTAA
|
Protein sequence | MKIGTQNQAF FPENILEKFR YIKEMGFDGF EIDGKLLVNN IEEVKSAIKE TGLPVTTACG GYDGWIGDFI EERRLNGLKQ IERILEALAE VGGKGIVVPA AWGMFTFRLP PMTSPRSLDG DRKMVSDSLR VLEQVAARTG TVVYLEPLNR YQDHMINTLA DARRYSVENN LKHVQIIGDF YHMNIEEDNL AQALHDNRDL LGHVHIADNH RYQPGSGTLD FHALFEQLRA DNYQGYVVYE GRIRAEDPAQ AYRDSLTWLR TC
|
| |