Gene BURPS1710b_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3403 
SymbolpurH 
ID3690848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3725158 
End bp3726723 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content67% 
IMG OID637729859 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_334775 
Protein GI76811960 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.142006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGC AAGCGCTCAT TTCCGTTTCC GACAAGACCG GCATCGTCGA CTTCGCGAAA 
GCGCTGTCCG CGCTCGGCGT CAAGCTGCTG TCGACGGGCG GCACCGCGAA ACTGCTCGCC
GACGCGGGCC TGCCCGTCAC CGAAGTGGCC GACTACACCG GCTTCCCGGA AATGCTCGAT
GGGCGCGTGA AGACGCTGCA CCCGAAGGTG CACGGCGGCA TCCTCGCCCG CCGCGACCTG
CCCGAGCACA TGCAGGCGCT CGAAGCGCAC GGGATTCCGA CGATCGACCT GCTCGTCGTG
AACCTGTATC CGTTCGTCCA GACGATTGCG AAGGACGACT GCACGCTCGC CGACGCGATC
GAGAACATCG ACATCGGCGG CCCGACGATG CTGCGCTCGG CGGCGAAGAA CCACCGCGAC
GTGACGGTCG TCGTCGACCC GGCCGATTAC GCGGTCGTGC TCGACGAGAT GAAAGCGAAC
GGCAACACGC TCGGCTACAA GACGAATTTC CGCCTCGCGA CCAAGGTGTT CGCGCACACC
GCGCAGTACG ACGGCGCGAT CACGAACTAC CTGACGAGCC TCGGCGACGA TCTGCAGCAC
GGCTCGCGCA GCGCATACCC GGCAACGCTG AACCTCGCGT TCGACAAGGT GCAGGACCTG
CGCTACGGCG AGAATCCGCA CCAGAGCGCC GCGTTCTACC GCGACGTCGC GACGCCGGCC
GGCGCGCTCG CGAACTACCG CCAGTTGCAG GGCAAGGAAC TGTCGTACAA CAACATCGCC
GATTCGGACG CCGCGTGGGA ATGCGTGAAG ACGTTCGACG CGCCGGCGTG CGTGATCATC
AAGCACGCGA ATCCGTGCGG CGTCGCGGTG GGCGCGGACG CGGGCGAAGC GTACGCGAAG
GCGTTCCAGA CCGATCCGAC CTCCGCGTTC GGCGGCATTA TCGCGTTCAA CCGCGAAGTC
GACGAGGCCG CGGCCCAGGC GGTCGCGAAG CAATTCGTCG AAGTGCTGAT CGCGCCGTCG
TTCTCGGACG CGGCCAAGCA GGTGTTCGTG GCCAAGCAGA ACGTGCGCCT GCTCGAAATC
GCGCTGGGCG AAGGCCATAA CGCGTTCGAT CTGAAGCGCG TGGGCGGCGG CCTGCTCGTG
CAATCGCTCG ATTCGAAGAA CGTGCAGCCG CGCGAGCTGC GCGTCGTCAC GAAACGCCAC
CCGACGCCGA AGGAAATGGA CGACCTCCTG TTCGCATGGC GCGTCGCGAA ATACGTGAAG
TCGAACGCGA TCGTGTTCTG CGGCAACGGG ATGACGCTCG GCGTCGGCGC AGGCCAGATG
AGCCGCGTCG ATTCGGCGCG CATCGCGAGC ATCAAGGCAC AGAACGCGGG CCTCACGCTC
GCGGGCTCGG CCGTCGCGTC GGACGCGTTC TTCCCGTTCC GCGACGGCCT CGACGTCGTC
GTCGCGGCGG GCGCGACCTG CGTGATCCAG CCGGGCGGCT CGGTGCGCGA CGACGAGGTG
ATCGCCGCCG CCGACGAGCA CAACATCGCG ATGGTCGTGA CGGGCGTGCG CCACTTCCGT
CACTGA
 
Protein sequence
MIKQALISVS DKTGIVDFAK ALSALGVKLL STGGTAKLLA DAGLPVTEVA DYTGFPEMLD 
GRVKTLHPKV HGGILARRDL PEHMQALEAH GIPTIDLLVV NLYPFVQTIA KDDCTLADAI
ENIDIGGPTM LRSAAKNHRD VTVVVDPADY AVVLDEMKAN GNTLGYKTNF RLATKVFAHT
AQYDGAITNY LTSLGDDLQH GSRSAYPATL NLAFDKVQDL RYGENPHQSA AFYRDVATPA
GALANYRQLQ GKELSYNNIA DSDAAWECVK TFDAPACVII KHANPCGVAV GADAGEAYAK
AFQTDPTSAF GGIIAFNREV DEAAAQAVAK QFVEVLIAPS FSDAAKQVFV AKQNVRLLEI
ALGEGHNAFD LKRVGGGLLV QSLDSKNVQP RELRVVTKRH PTPKEMDDLL FAWRVAKYVK
SNAIVFCGNG MTLGVGAGQM SRVDSARIAS IKAQNAGLTL AGSAVASDAF FPFRDGLDVV
VAAGATCVIQ PGGSVRDDEV IAAADEHNIA MVVTGVRHFR H