Gene Bcep18194_A3777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3777 
SymbolpurH 
ID3748962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp671739 
End bp673304 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content66% 
IMG OID637762057 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_368022 
Protein GI78065253 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.259802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGC AAGCGCTCAT TTCCGTTTCC GACAAGACCG GCATCGTCGA CTTCGCGAAG 
TCGCTGTCCG ACCTCGGCGT CAAGCTGCTG TCGACGGGCG GCACCGCGAA ACTGCTCGCC
GACGCGGGCC TGCCCGTGAC CGAAGTGGCT GATTACACGG GCTTCCCGGA AATGCTCGAT
GGGCGCGTGA AGACGCTCCA CCCGAAGGTG CACGGCGGCA TCCTGGCCCG CCGCGACCTG
CCCGAGCACA TGCAGGCGCT GGAACAGCAC GGTATCCCGA CGATCGACCT GCTGGTCGTG
AACCTGTACC CGTTCGTCGC GACGATCGCG AAGGACGACT GCACGCTCGC CGACGCGATC
GAGAACATCG ACATCGGCGG CCCGACGATG CTGCGCTCGG CTGCGAAGAA CCACCGCGAC
GTGACGGTTG TTGTCGATCC GGCCGATTAC GCGGTCGTGC TCGATGAAAT GAAGGCGAAC
GGCAATGCGG TCGGCTACGC GACCAACTTC CGCCTCGCGA CGAAGGTGTT CGCGCACACC
GCGCAATACG ACGGCGCGAT CACGAACTAC CTGACGAGCC TGACCGACGA GCTGAAGCAC
GCATCGCGCA GCGCGTACCC GGCGACGCTG AACCTGGCGT TCGACAAGGT GCAGGACCTG
CGCTACGGCG AGAACCCGCA CCAGAGCGCC GCGTTCTACC GTGACCTCGC GACGCCGGCC
GGCGCGCTGG CGAACTACCG CCAGCTGCAG GGCAAGGAAC TGTCGTACAA CAACATCGCC
GATTCGGACG CAGCGTGGGA ATGCGTGAAG ACGTTCGACG CACCGGCCTG CGTGATCATC
AAGCATGCGA ACCCGTGCGG CGTCGCGGTC GGCAACGATT CGGCCGACGC GTACGCGAAG
GCGTTCCAGA CGGACCCGAC GTCGGCATTC GGCGGCATCA TCGCGTTCAA CCGCGAAGTG
GACGAAGCGG CGGCCCAGGC CGTTGCGAAG CAGTTCGTCG AAGTGCTGAT CGCACCGTCG
TTCTCCGACG CCGCGAAGCA GGTGTTCGCC GCGAAGCAGA ACGTGCGCCT GCTCGAAATC
GCGCTCGGCG ACGGCCATAA CGCGTTCGAC CTGAAGCGCG TGGGCGGCGG CCTGCTCGTG
CAGTCGCTCG ACTCGAAGAA CGTGCAGCCG AGCGAACTGC GCGTCGTCAC GAAGCGCCAG
CCGACCGCGA AGGAAATGGA CGACCTGCTG TTCGCATGGC GTGTCGCGAA GTACGTGAAG
TCGAACGCGA TCGTGTTCTG CGGCAACGGC ATGACGCTCG GCGTCGGCGC AGGCCAGATG
AGCCGTGTCG ATTCCGCACG CATCGCGAGC ATCAAGGCGC AGAACGCAGG CCTGACGCTG
GCCGGTTCGG CCGTGGCATC GGACGCGTTC TTCCCGTTCC GCGACGGCCT CGACGTCGTC
GTGGCGGCGG GCGCGACCTG CGTGATCCAG CCGGGCGGCT CGATGCGCGA TGACGAAGTG
ATCGCTGCAG CCGACGAGCA CAACATCGCG ATGATCCTGA CGGGCGTGCG CCACTTCCGT
CACTGA
 
Protein sequence
MIKQALISVS DKTGIVDFAK SLSDLGVKLL STGGTAKLLA DAGLPVTEVA DYTGFPEMLD 
GRVKTLHPKV HGGILARRDL PEHMQALEQH GIPTIDLLVV NLYPFVATIA KDDCTLADAI
ENIDIGGPTM LRSAAKNHRD VTVVVDPADY AVVLDEMKAN GNAVGYATNF RLATKVFAHT
AQYDGAITNY LTSLTDELKH ASRSAYPATL NLAFDKVQDL RYGENPHQSA AFYRDLATPA
GALANYRQLQ GKELSYNNIA DSDAAWECVK TFDAPACVII KHANPCGVAV GNDSADAYAK
AFQTDPTSAF GGIIAFNREV DEAAAQAVAK QFVEVLIAPS FSDAAKQVFA AKQNVRLLEI
ALGDGHNAFD LKRVGGGLLV QSLDSKNVQP SELRVVTKRQ PTAKEMDDLL FAWRVAKYVK
SNAIVFCGNG MTLGVGAGQM SRVDSARIAS IKAQNAGLTL AGSAVASDAF FPFRDGLDVV
VAAGATCVIQ PGGSMRDDEV IAAADEHNIA MILTGVRHFR H