Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2647 |
Symbol | purM |
ID | 6144428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2705492 |
End bp | 2706529 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617518 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001744683 |
Protein GI | 170680721 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGATA AAACCTCTCT TAGCTACAAA GATGCCGGTG TTGATATTGA CGCGGGTAAT GCTCTGGTTG GAAGAATCAA AGGCGTAGTG AAGAAAACGC GTCGTCCGGA AGTGATGGGC GGTCTGGGCG GCTTCGGTGC GCTGTGTGCA TTGCCGCAAA AATATCGTGA ACCTGTGCTG GTTTCCGGCA CTGACGGCGT AGGTACCAAG CTGCGTCTGG CGATGGACTT AAAACGTCAC GACACCATTG GTATTGATCT GGTCGCCATG TGCGTTAATG ACCTGGTGGT GCAAGGTGCG GAACCGCTGT TTTTCCTCGA CTATTACGCA ACCGGAAAAC TGGATGTTGA TACCGCTTCA GCGGTGATCA GCGGCATCGC GGAAGGTTGT CTGCAATCGG GCTGTTCACT GGTGGGTGGC GAAACGGCAG AAATGCCGGG GATGTATCAC GGTGAAGATT ACGATGTCGC TGGTTTCTGC GTTGGCGTGG TAGAAAAATC AGAAATCATT GACGGATCTA AAGTCAGCGA CGGCGATGTG CTGATTGCAC TCGGCTCCAG CGGTCCGCAC TCGAACGGCT ATTCGCTGGT GCGCAAAATT CTTGAAGTCA GCGGTTGTGA TCCGCAAACC ACCGAACTTG ATGGTAAGCC ATTAGCCGAT CATCTGCTGG CACCGACCCG CATTTACGTG AAGTCAGTGC TGGAGTTGAT TGAAAAGGTC GATGTGCATG CCATTGCGCA CCTGACCGGC GGCGGCTTCT GGGAAAACAT TCCGCGCGTA TTGCCAGATA ATACCCAGGC AGTGATTGAT GAATCTTCCT GGCAGTGGCC GGAAGTGTTC AACTGGCTGC AAACGGCAGG TAACGTTGAG CGCCATGAAA TGTATCGCAC CTTCAACTGC GGCGTCGGTA TGATTATTGC CCTTCCTGCT CCGGAAGTGG ACAAAGCCCT CGCCCTGCTC AATGCCAACG GTGAAAACGC GTGGAAAATC GGTATCATCA AAGCCTCTGA TTCCGAACAA CGCGTGGTTA TCGAATAA
|
Protein sequence | MTDKTSLSYK DAGVDIDAGN ALVGRIKGVV KKTRRPEVMG GLGGFGALCA LPQKYREPVL VSGTDGVGTK LRLAMDLKRH DTIGIDLVAM CVNDLVVQGA EPLFFLDYYA TGKLDVDTAS AVISGIAEGC LQSGCSLVGG ETAEMPGMYH GEDYDVAGFC VGVVEKSEII DGSKVSDGDV LIALGSSGPH SNGYSLVRKI LEVSGCDPQT TELDGKPLAD HLLAPTRIYV KSVLELIEKV DVHAIAHLTG GGFWENIPRV LPDNTQAVID ESSWQWPEVF NWLQTAGNVE RHEMYRTFNC GVGMIIALPA PEVDKALALL NANGENAWKI GIIKASDSEQ RVVIE
|
| |