Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3267 |
Symbol | purM |
ID | 4883825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3202614 |
End bp | 3203669 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640129195 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001060278 |
Protein GI | 126442046 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.245483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCTC CGAAATCCGC TCCCGACGCC CAGGGCCTGT CCTACCGTGA CGCGGGCGTC GACATCGACG CGGGCGACGC GCTCGTCGAC AAGATCAAGC CCTTTGCGAA GAAAACGCTG CGCGACGGCG TGCTCGGCGG CATCGGCGGG TTCGGCGCGC TGTTCGAGGT GCCGAAGAAG TACCGGGAGC CCGTGCTCGT ATCGGGCACG GACGGCGTCG GCACGAAGCT CAAGCTCGCG TTTCATCTGA ACAAACACGA TACGGTCGGC CAGGATCTCG TCGCGATGAG CGTAAACGAC ATTCTCGTGC AGGGCGCCGA GCCGCTGTTC TTCCTCGACT ACTTCGCGTG CGGCCGTCTC GACGTCGAGA CGGCCGCGAC CGTCGTCAAG GGCATCGCGA CGGGCTGCGA GCTGGCGGGC TGCGCGCTGA TCGGCGGCGA GACGGCCGAG ATGCCGAGCA TGTACCCGGA CGGCGAATAC GATCTGGCGG GCTTCGCGGT CGGCGCGGTC GAGAAGAGCA AGATCATTGA CGGCAGCACG ATCGCCGAGG GCGACGTCGT GCTGGGCCTC GCGTCGAGCG GCATCCATTC GAACGGTTTC TCGCTCGTGC GCAAGATCAT CGAGCGCGCG AATCCGGACC TGTCGGCCGA TTTCCACGGC CGCTCGCTCG CCGACGCGCT GATGGCGCCG ACGCGCATCT ACGTGAAGCC GCTCCTCGCG CTGATGGAGA AGATCGCGGT GAAGGGAATG GCGCACATCA CGGGCGGCGG CCTCGTCGAG AACATTCCGC GCGTGCTGCG CGACGGCCTC ACGGCCGAAC TCGACCAGCA CGCATGGCCG CTGCCGCCGC TGTTCCAGTG GCTGCAGCAG CACGGCGGCG TCGCCGATGC GGAGATGCAC CGCGTGTTCA ACTGCGGGAT CGGGATGGCC GTGATCGTGT CGGCCGCCGA CGCGGACGAC GCGCTCCGCC AACTGGCCGA CGCGGGCGAG CAGGTATGGA AGATCGGCAC CGTGCGCGCG AGCCGCGAAG GCGAGGCGCA GACGGTCGTG GTCTGA
|
Protein sequence | MNPPKSAPDA QGLSYRDAGV DIDAGDALVD KIKPFAKKTL RDGVLGGIGG FGALFEVPKK YREPVLVSGT DGVGTKLKLA FHLNKHDTVG QDLVAMSVND ILVQGAEPLF FLDYFACGRL DVETAATVVK GIATGCELAG CALIGGETAE MPSMYPDGEY DLAGFAVGAV EKSKIIDGST IAEGDVVLGL ASSGIHSNGF SLVRKIIERA NPDLSADFHG RSLADALMAP TRIYVKPLLA LMEKIAVKGM AHITGGGLVE NIPRVLRDGL TAELDQHAWP LPPLFQWLQQ HGGVADAEMH RVFNCGIGMA VIVSAADADD ALRQLADAGE QVWKIGTVRA SREGEAQTVV V
|
| |