Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2873 |
Symbol | purM |
ID | 6270291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2670921 |
End bp | 2671958 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641726817 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001881290 |
Protein GI | 187730324 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.636308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGATA AAACCTCTCT TAGCTACAAA GATGCCGGTG TTGATATTGA CGCGGGTAAT GCTCTGGTTG GAAGAATCAA AGGCGTAGTG AAGAAAACGC GTCGTCCGGA AGTGATGGGC GGTCTGGGCG GCTTCGGTGC GCTGTGTGCA TTGCCGCAAA AATATCGTGA ACCCGTGCTG GTTTCCGGCA CTGACGGCGT AGGTACCAAG CTGCGTCTGG CAATGGACTT AAAACGTCAC GACACCATTG GTATTGATCT GGTCGCCATG TGCGTTAATG ACCTGGTGGT GCAAGGTGCA GAACCGCTGT TTTTCCTCGA CTATTACGCA ACCGGAAAAC TGGATGTTGA TACCGCTTCA GCGGTGATCA GCGGCATCGC GGAAGGTTGT CTGCAATCGG GCTGTTCACT GGTGGGTGGC GAAACGGCAG AAATGCCGGG GATGTATCAC GGTGAAGATT ACGATGTCGC GGGTTTCTGC GTGGGCGTGG TAGAAAAATC AGAAATCATC GACGGCTCTA AAGTCAGCGA CGGCGATGTT CTGATTGCAC TCGGTTCCAG CGGTCCGCAC TCGAACGGCT ATTCGCTGGT GCGCAAAATT CTTGAAGTCA GCGGTTGTGA TCCGCAAACC ACCGAACTTG ATGGTAAGCC ATTAGCCGAT CATCTGCTGG CACCGACCCG CATTTACGTG AAGTCAGTGC TGGAGTTGAT TGAAAAGGTC GATGTGCATG CCATTGCGCA CCTGACCGGC GGCGGCTTCT GGGAAAACAT TCCACGCGTA TTGCCAGATA ATACTCAGGC AGTGATTGAT GAATCTTCCT GGCAGTGGCC GGAAGTGTTC AACTGGCTGC AAACGGCAGG TAACGTTGAG CGCCATGAAA TGTATCGCAC CTTCAACTGC GGCGTCGGGA TGATTATTGC CCTACCTGCT CCGGAAGTGG ACAAAGCCCT CGCCCTGCTC AATGCCAACG GTGAAAACGC GTGGAAAATC GGTATCATCA AAGCCTCTGA TTCCGAACAA CGCGTGGTTA TCGAATAA
|
Protein sequence | MTDKTSLSYK DAGVDIDAGN ALVGRIKGVV KKTRRPEVMG GLGGFGALCA LPQKYREPVL VSGTDGVGTK LRLAMDLKRH DTIGIDLVAM CVNDLVVQGA EPLFFLDYYA TGKLDVDTAS AVISGIAEGC LQSGCSLVGG ETAEMPGMYH GEDYDVAGFC VGVVEKSEII DGSKVSDGDV LIALGSSGPH SNGYSLVRKI LEVSGCDPQT TELDGKPLAD HLLAPTRIYV KSVLELIEKV DVHAIAHLTG GGFWENIPRV LPDNTQAVID ESSWQWPEVF NWLQTAGNVE RHEMYRTFNC GVGMIIALPA PEVDKALALL NANGENAWKI GIIKASDSEQ RVVIE
|
| |