Gene BURPS668_3363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3363 
SymbolpurH 
ID4885173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3298054 
End bp3299619 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content67% 
IMG OID640129290 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001060373 
Protein GI126440154 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGC AAGCGCTCAT TTCCGTTTCC GACAAGACCG GCATCGTCGA CTTCGCGAAA 
GCGCTGTCCG CGCTCGGCGT CAAGCTGCTG TCGACGGGCG GCACCGCGAA ACTGCTCGCC
GACGCGGGCC TGCCCGTCAC CGAAGTGGCC GACTACACCG GCTTCCCGGA AATGCTCGAT
GGGCGCGTGA AGACGCTGCA CCCGAAGGTG CACGGCGGCA TCCTCGCCCG CCGCGACCTG
CCCGAGCACA TGCAGGCGCT CGAAGCGCAC GGGATTCCGA CGATCGACCT GCTCGTCGTG
AACCTGTATC CGTTCGTCCA GACGATTGCG AAGGACGACT GCACGCTCGC CGACGCGATC
GAGAACATCG ACATCGGCGG CCCGACGATG CTGCGCTCGG CGGCGAAGAA CCACCGCGAC
GTGACGGTCG TCGTCGACCC GGCCGATTAC GCGGTCGTGC TCGACGAGAT GAAAGCGAAC
GGCAACACGC TCGGCTACAA GACGAATTTC CGCCTCGCGA CCAAGGTGTT CGCGCACACC
GCGCAGTACG ACGGCGCGAT CACGAACTAC CTGACGAGCC TCGGCGACGA TCTGCAGCAC
GGCTCGCGCA GCGCATACCC GGCAACGCTG AACCTCGCGT TCGACAAGGT GCAGGACCTG
CGCTACGGCG AGAATCCGCA CCAGAGCGCC GCGTTCTACC GCGACGTCGC GACGCCGGCC
GGCGCGCTCG CGAACTACCG CCAGTTGCAG GGCAAGGAAC TGTCGTACAA CAACATCGCC
GATTCGGACG CCGCGTGGGA ATGCGTGAAG ACGTTCGACG CGCCGGCGTG CGTGATCATC
AAGCACGCGA ATCCGTGCGG CGTCGCGGTG GGCGCGGACG CGGGCGAAGC GTACGCGAAG
GCGTTCCAGA CCGATCCGAC CTCCGCGTTC GGCGGCATCA TCGCGTTCAA CCGCGAAGTC
GACGAGGCCG CGGCCCAGGC GGTCGCGAAG CAATTCGTCG AAGTGCTGAT CGCGCCGTCG
TTCTCGGACG CGGCCAAGCA GGTGTTCGCG GCCAAGCAGA ACGTGCGCCT GCTCGAAATC
GCGCTGGGCG AAGGCCATAA CGCGTTCGAT CTGAAGCGCG TGGGCGGCGG CCTGCTCGTG
CAATCGCTCG ATTCGAAGAA CGTGCAGCCG CGCGAGCTGC GCGTCGTCAC GAAACGCCAC
CCGACGCCGA AGGAAATGGA CGACCTCCTG TTCGCATGGC GCGTCGCGAA ATACGTGAAG
TCGAACGCGA TCGTGTTCTG CGGCAACGGG ATGACGCTCG GCGTCGGCGC AGGCCAGATG
AGCCGCGTCG ATTCGGCGCG CATCGCGAGC ATCAAGGCAC AGAACGCGGG CCTCACGCTC
GCGGGCTCGG CCGTCGCGTC GGACGCGTTC TTCCCGTTCC GCGACGGTCT CGACGTCGTC
GTCGCGGCGG GCGCGACCTG CGTGATCCAG CCAGGCGGCT CGGTGCGCGA CGACGAGGTG
ATCGCCGCCG CCGACGAGCA CAACATCGCG ATGGTCGTGA CGGGCGTGCG CCACTTCCGT
CACTGA
 
Protein sequence
MIKQALISVS DKTGIVDFAK ALSALGVKLL STGGTAKLLA DAGLPVTEVA DYTGFPEMLD 
GRVKTLHPKV HGGILARRDL PEHMQALEAH GIPTIDLLVV NLYPFVQTIA KDDCTLADAI
ENIDIGGPTM LRSAAKNHRD VTVVVDPADY AVVLDEMKAN GNTLGYKTNF RLATKVFAHT
AQYDGAITNY LTSLGDDLQH GSRSAYPATL NLAFDKVQDL RYGENPHQSA AFYRDVATPA
GALANYRQLQ GKELSYNNIA DSDAAWECVK TFDAPACVII KHANPCGVAV GADAGEAYAK
AFQTDPTSAF GGIIAFNREV DEAAAQAVAK QFVEVLIAPS FSDAAKQVFA AKQNVRLLEI
ALGEGHNAFD LKRVGGGLLV QSLDSKNVQP RELRVVTKRH PTPKEMDDLL FAWRVAKYVK
SNAIVFCGNG MTLGVGAGQM SRVDSARIAS IKAQNAGLTL AGSAVASDAF FPFRDGLDVV
VAAGATCVIQ PGGSVRDDEV IAAADEHNIA MVVTGVRHFR H