Gene BMA2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA2356 
SymbolpurH 
ID3090465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006348 
Strand
Start bp2457886 
End bp2459451 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content67% 
IMG OID637562980 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_103914 
Protein GI53726197 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGC AAGCGCTCAT TTCCGTTTCC GACAAGACCG GCATCGTCGA CTTCGCGAAA 
GCGCTGTCCG CGCTCGGCGT CAAGCTGCTG TCGACGGGCG GCACCGCGAA ACTGCTCGCC
GACGCGGGCC TGCCCGTCAC CGAAGTGGCC GACTACACCG GCTTCCCGGA AATGCTCGAT
GGGCGCGTGA AGACGCTGCA CCCGAAGGTG CACGGCGGCA TCCTCGCCCG CCGCGACCTG
CCCGAGCACA TGCAGGCGCT CGAAGCGCAC GGGATTCCGA CGATCGACCT GCTCGTCGTG
AACCTGTATC CGTTCGTCCA GACGATTGCG AAGGACGACT GCACGCTCGC CGACGCGATC
GAGAACATCG ACATCGGCGG CCCGACGATG CTGCGCTCGG CGGCGAAGAA CCACCGCGAC
GTGACGGTCG TCGTCGACCC GGCCGATTAC GCGGTCGTGC TCGACGAGAT GAAAGCGAAC
GGCAACACGC TCGGCTACAA GACGAATTTC CGCCTCGCGA CCAAGGTGTT CGCGCACACC
GCGCAGTACG ACGGCGCGAT CACGAACTAC CTGACGAGCC TCGGCGACGA TCTGCAGCAC
GGCTCGCGCA GCGCATACCC GGCAACGCTG AACCTCGCGT TCGACAAGGT GCAGGACCTG
CGCTACGGCG AGAATCCGCA CCAGAGCGCC GCGTTCTACC GCGACGTCGC GACGCCGGCC
GGCGCGCTCG CGAACTACCG CCAGTTGCAG GGCAAGGAAC TGTCGTACAA CAACATCGCC
GATTCGGACG CCGCGTGGGA ATGCGTGAAG ACGTTCGACG CGCCGGCGTG CGTGATCATC
AAGCACGCGA ATCCGTGCGG CGTCGCGGTG GGCGCGGACG CGGGCGAAGC GTACGCGAAG
GCGTTCCAGA CCGATCCGAC CTCCGCGTTC GGCGGCATCA TCGCGTTCAA CCGCGAAGTC
GACGAGGCCG CGGCCCAGGC GGTCGCGAAG CAATTCGTCG AAGTGCTGAT CGCGCCGTCG
TTCTCGGACG CGGCCAAGCA GGTGTTCGCG GCCAAGCAGA ACGTGCGCCT GCTCGAAATC
GCGCTGGGCG AAGGCCATAA CGCGTTCGAT CTGAAGCGCG TGGGCGGCGG CCTGCTCGTG
CAATCGCTCG ATTCGAAGAA CGTGCAGCCG CGCGAGCTGC GCGTCGTCAC GAAACGCCAC
CCGACGCCGA AGGAAATGGA CGACCTCCTG TTCGCATGGC GCGTCGCGAA ATACGTGAAG
TCGAACGCGA TCGTGTTCTG CGGCAACGGG ATGACGCTCG GCGTCGGCGC AGGCCAGATG
AGCCGCGTCG ATTCGGCGCG CATCGCGAGC ATCAAGGCAC AGAACGCGGG CCTCACGCTC
GCGGGCTCGG CCGTCGCGTC GGACGCGTTC TTCCCGTTCC GCGACGGCCT CGACGTCGTC
GTCGCGGCAG GCGCGACCTG CGTGATCCAG CCGGGCGGCT CGGTGCGCGA CGACGAGGTG
ATCGCCGCCG CCGACGAGCA CAACATCGCG ATGGTCGTGA CGGGCGTGCG CCACTTCCGT
CACTGA
 
Protein sequence
MIKQALISVS DKTGIVDFAK ALSALGVKLL STGGTAKLLA DAGLPVTEVA DYTGFPEMLD 
GRVKTLHPKV HGGILARRDL PEHMQALEAH GIPTIDLLVV NLYPFVQTIA KDDCTLADAI
ENIDIGGPTM LRSAAKNHRD VTVVVDPADY AVVLDEMKAN GNTLGYKTNF RLATKVFAHT
AQYDGAITNY LTSLGDDLQH GSRSAYPATL NLAFDKVQDL RYGENPHQSA AFYRDVATPA
GALANYRQLQ GKELSYNNIA DSDAAWECVK TFDAPACVII KHANPCGVAV GADAGEAYAK
AFQTDPTSAF GGIIAFNREV DEAAAQAVAK QFVEVLIAPS FSDAAKQVFA AKQNVRLLEI
ALGEGHNAFD LKRVGGGLLV QSLDSKNVQP RELRVVTKRH PTPKEMDDLL FAWRVAKYVK
SNAIVFCGNG MTLGVGAGQM SRVDSARIAS IKAQNAGLTL AGSAVASDAF FPFRDGLDVV
VAAGATCVIQ PGGSVRDDEV IAAADEHNIA MVVTGVRHFR H