Gene BamMC406_0610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_0610 
SymbolpurH 
ID6177943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010551 
Strand
Start bp686319 
End bp687884 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content67% 
IMG OID641680358 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001807323 
Protein GI172059671 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.951228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.279266 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGC AAGCGCTCAT TTCCGTTTCC GACAAGACCG GCATCGTCGA CTTCGCGAAG 
TCGCTGTCCG ACCTCGGCGT CAAGCTGCTG TCGACCGGCG GCACCGCGAA ACTCCTCGCC
GACGCGGGCC TGCCCGTTAC CGAAGTGGCT GATTACACGG GCTTTCCGGA AATGCTCGAT
GGGCGCGTGA AGACGCTCCA CCCGAAGGTG CACGGCGGCA TCCTCGCCCG CCGCGACCTG
CCCGAGCACA TGCAGGCGCT GGAGCAGCAC GACATCCCGA CGATCGACCT GCTGGTCGTG
AACCTGTATC CGTTCGTCGC GACGATCGCG AAGGACGACT GCACGCTCGC CGACGCGATC
GAGAACATCG ACATCGGCGG CCCGACGATG CTGCGCTCGG CCGCGAAGAA CCACCGTGAC
GTGACGGTCG TGGTCGATCC GGCCGACTAC GCGGTCGTGC TCGACGAAAT GAAGGCGAAC
GGCAACGCGA TCGGCTACGC GACCAACTTC CGCCTCGCGA CGAAGGTGTT CGCGCACACC
GCGCAGTACG ACGGCGCGAT CACGAACTAC CTGACGAGCC TGACCGACGA GCTGCAGCAC
GCGTCGCGCA GCGCGTACCC GGCGACGCTG AACATGGCGT TCGACAAGGT GCAGGACCTG
CGCTACGGCG AGAACCCGCA CCAGAGCGCC GCGTTCTACC GCGACCTCGC GGCGCCGGCC
GGGGCACTGG CGAACTACCG CCAGCTGCAG GGCAAGGAGC TGTCGTACAA CAACATCGCG
GATTCGGACG CGGCGTGGGA ATGCGTGAAG ACGTTCGACG CGCCGGCCTG CGTGATCATC
AAGCATGCGA ACCCGTGCGG CGTCGCGGTC GGCAACGACT CGGCCGACGC ATACGCGAAG
GCATTCCAGA CCGACCCGAC GTCGGCGTTC GGCGGCATCA TCGCGTTCAA CCGCGAAGTC
GACGAGGCGG CCGCGCAGGC GGTGGCGAAG CAGTTCGTCG AAGTGCTGAT CGCGCCGTCG
TTCTCCGACG CCGCGAAGCA GGTGTTCGCC GCGAAGCAGA ACGTGCGCCT GCTCGAGATC
GCGCTGGGTG ACGGCCATAA CGCCTTCGAC CTGAAGCGCG TGGGCGGCGG CCTGCTCGTG
CAGTCGCTCG ATTCGAAGAA CGTGCAGCCG AGCGAGCTGC GCGTCGTCAC GAAGCGCCAG
CCGAGCGCGA AGGAAATGGA TGACCTGCTG TTCGCATGGC GCGTCGCGAA GTACGTGAAG
TCGAACGCGA TCGTGTTCTG CGGCAACGGC ATGACGCTCG GCGTCGGCGC AGGCCAGATG
AGCCGCGTCG ATTCCGCGCG CATCGCGAGC ATCAAGGCGC AGAACGCGGG CCTGACGCTG
GCTGGCTCGG CCGTCGCGTC GGATGCGTTC TTCCCGTTCC GCGACGGTCT CGACGTCGTC
GTGGCGGCAG GCGCGACCTG CGTGATCCAG CCGGGCGGTT CGATGCGCGA CGACGAAGTG
ATCGCGGCAG CGGACGAGCA CGGCATCGCG ATGATCCTGA CGGGCGTGCG TCACTTCCGT
CACTGA
 
Protein sequence
MIKQALISVS DKTGIVDFAK SLSDLGVKLL STGGTAKLLA DAGLPVTEVA DYTGFPEMLD 
GRVKTLHPKV HGGILARRDL PEHMQALEQH DIPTIDLLVV NLYPFVATIA KDDCTLADAI
ENIDIGGPTM LRSAAKNHRD VTVVVDPADY AVVLDEMKAN GNAIGYATNF RLATKVFAHT
AQYDGAITNY LTSLTDELQH ASRSAYPATL NMAFDKVQDL RYGENPHQSA AFYRDLAAPA
GALANYRQLQ GKELSYNNIA DSDAAWECVK TFDAPACVII KHANPCGVAV GNDSADAYAK
AFQTDPTSAF GGIIAFNREV DEAAAQAVAK QFVEVLIAPS FSDAAKQVFA AKQNVRLLEI
ALGDGHNAFD LKRVGGGLLV QSLDSKNVQP SELRVVTKRQ PSAKEMDDLL FAWRVAKYVK
SNAIVFCGNG MTLGVGAGQM SRVDSARIAS IKAQNAGLTL AGSAVASDAF FPFRDGLDVV
VAAGATCVIQ PGGSMRDDEV IAAADEHGIA MILTGVRHFR H