Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3722 |
Symbol | purM |
ID | 6967492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3441457 |
End bp | 3442494 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387515 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_002271968 |
Protein GI | 209395985 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.108743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGATA AAACCTCTCT TAGCTACAAA GATGCCGGTG TTGATATTGA CGCGGGTAAT GCTCTGGTTG GAAGAATCAA AGGCGTAGTG AAGAAAACGC GTCGTCCGGA AGTGATGGGC GGTCTGGGCG GCTTCGGTGC GCTGTGTGCA TTGCCGCAAA AATATCGTGA ACCCGTGCTG GTTTCCGGCA CTGACGGCGT AGGTACCAAG CTGCGTCTGG CAATGGACTT AAAACGTCAC GACACCATTG GTATTGATCT GGTCGCCATG TGCGTTAATG ACCTGGTGGT GCAAGGTGCA GAACCGCTGT TTTTCCTCGA CTATTACGCA ACCGGAAAAC TGGATGTTGA TACCGCTTCA GCGGTGATCA GCGGCATTGC GGAAGGTTGT CTGCAATCGG GCTGTTCACT GGTGGGTGGC GAAACGGCAG AAATGCCGGG GATGTATCAC GGTGAGGATT ACGATGTCGC GGGTTTCTGC GTGGGCGTGG TAGAAAAATC AGAAATCATC GACGGCTCTA AAGTCAGCGA CGGCGATGTG CTGATTGCAC TCGGTTCCAG CGGTCCGCAC TCGAACGGCT ATTCGCTGGT GCGCAAAATT CTTGAAGTCA GCGGTTGTGA TCCGCAAACC ACCGAACTTG ATGGTAAGCC ATTAGCCGAT CATCTGCTGG CACCGACCCG CATTTACGTG AAGTCAGTGC TGGAGTTGAT TGAAAAGGTC GATGTTAATG CCATTGCGCA CCTGACCGGC GGCGGCTTTT GGGAAAACAT TCCGCGCGTA TTGCCAGATA ATACCCAGGC AGTGATTGAT GAATCCTCCT GGCAGTGGCC GGAAGTGTTC AACTGGCTGC AAACGGCTGG TAACGTTGAG CGCCATGAAA TGTATCGCAC CTTCAACTGC GGCGTCGGGA TGATTATTGC CCTGCCTGCT CCGGAAGTGG ACAAAGCCCT CGCCCTGCTC AATGCCAACG GTGAAAACGC GTGGAAAATC GGTATCATCA AAGCCTCTGA TTCCGAACAA CGCGTGGTTA TCGAATAA
|
Protein sequence | MTDKTSLSYK DAGVDIDAGN ALVGRIKGVV KKTRRPEVMG GLGGFGALCA LPQKYREPVL VSGTDGVGTK LRLAMDLKRH DTIGIDLVAM CVNDLVVQGA EPLFFLDYYA TGKLDVDTAS AVISGIAEGC LQSGCSLVGG ETAEMPGMYH GEDYDVAGFC VGVVEKSEII DGSKVSDGDV LIALGSSGPH SNGYSLVRKI LEVSGCDPQT TELDGKPLAD HLLAPTRIYV KSVLELIEKV DVNAIAHLTG GGFWENIPRV LPDNTQAVID ESSWQWPEVF NWLQTAGNVE RHEMYRTFNC GVGMIIALPA PEVDKALALL NANGENAWKI GIIKASDSEQ RVVIE
|
| |