Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3363 |
Symbol | purH |
ID | 4885173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 3298054 |
End bp | 3299619 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640129290 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001060373 |
Protein GI | 126440154 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAGC AAGCGCTCAT TTCCGTTTCC GACAAGACCG GCATCGTCGA CTTCGCGAAA GCGCTGTCCG CGCTCGGCGT CAAGCTGCTG TCGACGGGCG GCACCGCGAA ACTGCTCGCC GACGCGGGCC TGCCCGTCAC CGAAGTGGCC GACTACACCG GCTTCCCGGA AATGCTCGAT GGGCGCGTGA AGACGCTGCA CCCGAAGGTG CACGGCGGCA TCCTCGCCCG CCGCGACCTG CCCGAGCACA TGCAGGCGCT CGAAGCGCAC GGGATTCCGA CGATCGACCT GCTCGTCGTG AACCTGTATC CGTTCGTCCA GACGATTGCG AAGGACGACT GCACGCTCGC CGACGCGATC GAGAACATCG ACATCGGCGG CCCGACGATG CTGCGCTCGG CGGCGAAGAA CCACCGCGAC GTGACGGTCG TCGTCGACCC GGCCGATTAC GCGGTCGTGC TCGACGAGAT GAAAGCGAAC GGCAACACGC TCGGCTACAA GACGAATTTC CGCCTCGCGA CCAAGGTGTT CGCGCACACC GCGCAGTACG ACGGCGCGAT CACGAACTAC CTGACGAGCC TCGGCGACGA TCTGCAGCAC GGCTCGCGCA GCGCATACCC GGCAACGCTG AACCTCGCGT TCGACAAGGT GCAGGACCTG CGCTACGGCG AGAATCCGCA CCAGAGCGCC GCGTTCTACC GCGACGTCGC GACGCCGGCC GGCGCGCTCG CGAACTACCG CCAGTTGCAG GGCAAGGAAC TGTCGTACAA CAACATCGCC GATTCGGACG CCGCGTGGGA ATGCGTGAAG ACGTTCGACG CGCCGGCGTG CGTGATCATC AAGCACGCGA ATCCGTGCGG CGTCGCGGTG GGCGCGGACG CGGGCGAAGC GTACGCGAAG GCGTTCCAGA CCGATCCGAC CTCCGCGTTC GGCGGCATCA TCGCGTTCAA CCGCGAAGTC GACGAGGCCG CGGCCCAGGC GGTCGCGAAG CAATTCGTCG AAGTGCTGAT CGCGCCGTCG TTCTCGGACG CGGCCAAGCA GGTGTTCGCG GCCAAGCAGA ACGTGCGCCT GCTCGAAATC GCGCTGGGCG AAGGCCATAA CGCGTTCGAT CTGAAGCGCG TGGGCGGCGG CCTGCTCGTG CAATCGCTCG ATTCGAAGAA CGTGCAGCCG CGCGAGCTGC GCGTCGTCAC GAAACGCCAC CCGACGCCGA AGGAAATGGA CGACCTCCTG TTCGCATGGC GCGTCGCGAA ATACGTGAAG TCGAACGCGA TCGTGTTCTG CGGCAACGGG ATGACGCTCG GCGTCGGCGC AGGCCAGATG AGCCGCGTCG ATTCGGCGCG CATCGCGAGC ATCAAGGCAC AGAACGCGGG CCTCACGCTC GCGGGCTCGG CCGTCGCGTC GGACGCGTTC TTCCCGTTCC GCGACGGTCT CGACGTCGTC GTCGCGGCGG GCGCGACCTG CGTGATCCAG CCAGGCGGCT CGGTGCGCGA CGACGAGGTG ATCGCCGCCG CCGACGAGCA CAACATCGCG ATGGTCGTGA CGGGCGTGCG CCACTTCCGT CACTGA
|
Protein sequence | MIKQALISVS DKTGIVDFAK ALSALGVKLL STGGTAKLLA DAGLPVTEVA DYTGFPEMLD GRVKTLHPKV HGGILARRDL PEHMQALEAH GIPTIDLLVV NLYPFVQTIA KDDCTLADAI ENIDIGGPTM LRSAAKNHRD VTVVVDPADY AVVLDEMKAN GNTLGYKTNF RLATKVFAHT AQYDGAITNY LTSLGDDLQH GSRSAYPATL NLAFDKVQDL RYGENPHQSA AFYRDVATPA GALANYRQLQ GKELSYNNIA DSDAAWECVK TFDAPACVII KHANPCGVAV GADAGEAYAK AFQTDPTSAF GGIIAFNREV DEAAAQAVAK QFVEVLIAPS FSDAAKQVFA AKQNVRLLEI ALGEGHNAFD LKRVGGGLLV QSLDSKNVQP RELRVVTKRH PTPKEMDDLL FAWRVAKYVK SNAIVFCGNG MTLGVGAGQM SRVDSARIAS IKAQNAGLTL AGSAVASDAF FPFRDGLDVV VAAGATCVIQ PGGSVRDDEV IAAADEHNIA MVVTGVRHFR H
|
| |