Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2782 |
Symbol | purM |
ID | 5589552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2771595 |
End bp | 2772632 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640926434 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001463821 |
Protein GI | 157158807 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.171997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGATA AAACCTCTCT TAGCTACAAA GATGCCGGTG TTGATATTGA CGCGGGTAAT GCTCTGGTTG GAAGAATCAA AGGCGTAGTG AAGAAAACGC GTCGTCCGGA AGTGATGGGC GGTCTGGGCG GCTTCGGTGC GCTGTGTGCA TTGCCGCAAA AATATCGTGA ACCTGTGCTG GTTTCCGGTA CTGACGGCGT AGGTACCAAG CTGCGTCTGG CGATGGACTT AAAACGTCAC GACACCATTG GTATTGATCT GGTCGCCATG TGCGTTAATG ACCTGGTGGT GCAAGGTGCA GAGCCGCTGT TTTTCCTCGA CTATTACGCA ACCGGAAAAC TGGATGTTGA TACCGCTTCA GCGGTGATCA GCGGTATCGC GGAAGGATGT CTGCAATCGG GCTGTTCACT GGTGGGTGGC GAAACGGCAG AAATGCCGGG GATGTATCAC GGTGAGGATT ACGATGTCGC GGGTTTCTGC GTTGGCGTGG TAGAAAAATC AGAAATCATC GACGGCTCTA AAGTCAGCGA CGGCGATGTG CTGATTGCAC TCGGTTCCAG CGGTCCGCAC TCGAACGGCT ATTCGCTGGT GCGCAAAATT CTTGAAGTCA GCGGTTGTGA TCCGCAAACC ACCGAACTTG ATGGTAAGCC ATTAGCCGAT CATCTGCTGG CACCGACCCG CATTTACGTG AAGTCAGTGC TGGAGTTGAT TGAAAAGGTC GATGTGCATG CCATTGCGCA CCTGACCGGC GGCGGCTTCT GGGAAAACAT TCCACGCGTA TTGCCAGATA ATACTCAGGC AGTGATTGAT GAATCTTCCT GGCAGTGGCC GGAAGTGTTC AACTGGCTGC AAACGGCAGG TAACGTTGAG CGCCATGAAA TGTATCGCAC CTTCAACTGC GGCGTCGGGA TGATTATCGC CCTGCCTGCT CCGGAAGTGG ACAAAGCCCT CGCCCTGCTA AACTCCAACG GTGAAAACGC GTGGAAAATC GGTATCATCA AAGCCTCTGA TTCCGAACAA CGCGTGGTTA TCGAATAA
|
Protein sequence | MTDKTSLSYK DAGVDIDAGN ALVGRIKGVV KKTRRPEVMG GLGGFGALCA LPQKYREPVL VSGTDGVGTK LRLAMDLKRH DTIGIDLVAM CVNDLVVQGA EPLFFLDYYA TGKLDVDTAS AVISGIAEGC LQSGCSLVGG ETAEMPGMYH GEDYDVAGFC VGVVEKSEII DGSKVSDGDV LIALGSSGPH SNGYSLVRKI LEVSGCDPQT TELDGKPLAD HLLAPTRIYV KSVLELIEKV DVHAIAHLTG GGFWENIPRV LPDNTQAVID ESSWQWPEVF NWLQTAGNVE RHEMYRTFNC GVGMIIALPA PEVDKALALL NSNGENAWKI GIIKASDSEQ RVVIE
|
| |