Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1317 |
Symbol | purM |
ID | 3849468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1477611 |
End bp | 1478666 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637840989 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_441864 |
Protein GI | 83721293 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.124371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCTC CGAAATCCGC TCCCGACGCC CAGGGTCTGT CCTACCGTGA CGCGGGCGTC GACATCGACG CGGGCGACGC GCTCGTCGAC AAGATCAAGC CCTTTGCGAA GAAAACGCTG CGCGACGGCG TGCTCGGCGG CATCGGCGGG TTCGGCGCGC TGTTCGAGGT GCCGAAGAAG TACCGGGAGC CCGTGCTCGT GTCGGGCACG GACGGCGTCG GCACGAAGCT CAAGCTCGCG TTTCATCTGA ACAAACACGA TACGGTCGGC CAGGATCTCG TCGCGATGAG CGTGAACGAC ATCCTCGTGC AGGGCGCCGA GCCGCTGTTC TTCCTCGACT ATTTCGCGTG CGGCAAGCTC GACGTCGAGA CGGCCGCGAC CGTCGTCAAG GGCATCGCGA CGGGCTGCGA GCTGGCGGGC TGCGCGCTGA TCGGCGGCGA GACGGCCGAG ATGCCGGGCA TGTACCCGGA CGGCGAGTAC GATCTCGCGG GCTTCGCGGT CGGCGCGGTC GAGAAGAGCA AGATCATCGA CGGCAGCGCG ATCGCCGAGG GCGACGTCGT GCTGGGCCTC GCGTCGAGCG GCATCCATTC GAACGGTTTC TCGCTCGTGC GCAAGATCAT CGAGCGCGCG AATCCGGACC TGTCTGCCGA TTTCCACGGC CGCTCGCTCG CCGACGCGCT GATGGCGCCG ACGCGCATCT ACGTGAAGCC GCTCCTCGCG CTGATGGAGA AGATCGCGGT GAAGGGGATG GCGCACATCA CGGGCGGCGG CCTCGTCGAG AACATTCCGC GCGTGCTGCG CGACGGCCTC ACGGCCGAAC TCGACCAGCG CGCGTGGCCG CTGCCGCCGC TCTTCCAGTG GCTGCAGCAG CACGGCGGCG TCGCCGATGC GGAGATGCAC CGCGTGTTCA ACTGCGGGAT CGGGATGGCC GTGATCGTGT CGGCCGCCGA TGCCGACGAA GCGCTCCGCC AGTTGACCGA AGCGGGCGAG CAGGTGTGGA AGATCGGCAC CGTGCGCGCA AGCCGCGAAG GCGAGGCGCA GACGGTCGTG GTCTGA
|
Protein sequence | MNPPKSAPDA QGLSYRDAGV DIDAGDALVD KIKPFAKKTL RDGVLGGIGG FGALFEVPKK YREPVLVSGT DGVGTKLKLA FHLNKHDTVG QDLVAMSVND ILVQGAEPLF FLDYFACGKL DVETAATVVK GIATGCELAG CALIGGETAE MPGMYPDGEY DLAGFAVGAV EKSKIIDGSA IAEGDVVLGL ASSGIHSNGF SLVRKIIERA NPDLSADFHG RSLADALMAP TRIYVKPLLA LMEKIAVKGM AHITGGGLVE NIPRVLRDGL TAELDQRAWP LPPLFQWLQQ HGGVADAEMH RVFNCGIGMA VIVSAADADE ALRQLTEAGE QVWKIGTVRA SREGEAQTVV V
|
| |