Gene Gbem_3458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_3458 
SymbolpurH 
ID6780320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp3969435 
End bp3970997 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content64% 
IMG OID642769452 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002140247 
Protein GI197119820 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.888593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA TTGGGCGCGC GCTGATCAGC GTGTCGGAGA AGACTGGTGT GGTGGAATTT 
TCCCGGGCGC TGGCAGGCTA CGGCGTGGAG ATCCTCTCCA CCGGCGGTAC CGCCAAGCTT
TTGCGTGAGG CGGGAATCGC CGTCAAGGAC GTCTCCGAGT TCACCGGTTT CCCCGAGATG
CTGGACGGCC GGGTCAAGAC CCTGCACCCG AAGGTTCACG GCGGCATCCT CGGCATGCGC
GAGAACCCGG CGCACGTAGC CAAGATGCAG GAGCACGGCA TCGAGCCCAT CGACATGGTG
GTGGTAAACC TCTACCCGTT CGAGGCGACC GTCGCGAAAG AGGACTGCAC CATGGAGGAT
GCCATCGAGA ACATCGATAT CGGCGGCCCG ACCATGCTCC GCTCCGCAGC CAAGAACAAC
CGCGACGTCA CCGTCGTCGT TGACCACGCC GATTACGCAG TGGTCCTGGA CGAGATGAAA
AACTCCGGCG GCAGCGTGTC GCGCGAGACC AATTTCCGTC TGGCCGTGAA GGTGTATCAG
CACACCGCAG CCTACGACGG CGCCATCTCC AACTGGCTCG GCGCCCGCAC CGGCGAAGGT
GTGGCGCCCT TCCCGGACAC CCTCACCATG CAGTACAAGC TGGCCCAGGG GATGCGCTAC
GGCGAGAACC CGCACCAGTC CGGCGCCTTC TACGTCGAAA AGGGGTCCAG GGAATCCTCC
ATCTCCACGG CGCGCCAGAT CCAGGGAAAG GAACTCTCCT ACAACAACAT CGGCGACACC
GATGCGGCGC TGGAGTGCGT GAAGCAGTTC ACCGAGCCGG CCTGCGTCAT CGTAAAGCAC
GCGAACCCCT GCGGTGTCGC GCTCGGCGCG AACATCATGG AAGCCTATGA CAAGGCGTAC
AAGACCGACC CCGAGTCCGC CTTCGGCGGC ATCATCGCCT TCAACCGCGA GCTGGACGAG
TCCACCGCCC GCGCCATCGT CGAGCGCCAG TTCGTCGAAG TGATCATCGC CCCCAAGGTG
ACCGAGGCCG CCAGCGAAAT CGTCGCCGCC AAGAAGAACG TCCGCCTCAT GGAGTGCGGC
TTCTGGCCCG AGAATCCGGC GCCCCGTTTC GATTACAAGA GGGTGAACGG CGGCATGCTG
GTCCAGGACG CCGACCTCGA GCTCTTCACC GAGTTGAAGG TAGTGACCAA GAGGGCGCCG
ACCGACAAAG AGATGGAAGA CCTTCTCTTC ACCTGGCGCG TGGCCAAGTT CGTCAAATCC
AACGCCATCG TCTACGGCCG CGACAACTCC ACCGTCGGCG TCGGCGCAGG GCAGATGAGC
CGGGTCAACT CCGCCCGCAT CGCCGCCATC AAGGCCGAGC ATGCCGGCAT TCCGGTCCAG
GGTGCGGTCA TGGCGTCCGA CGCCTTCTTC CCGTTCCGGG ACGGCCTCGA CAACGCCGCC
TCCGTCGGTG TCACCGCCGT AATCCAGCCC GGCGGCAGCA TGCGTGACGC CGAGGTCATC
GCCGCAGCCG ACGAGCACGG CATCGCCATG GTCTTCACCA GCATGAGGCA CTTCAGGCAC
TGA
 
Protein sequence
MAKIGRALIS VSEKTGVVEF SRALAGYGVE ILSTGGTAKL LREAGIAVKD VSEFTGFPEM 
LDGRVKTLHP KVHGGILGMR ENPAHVAKMQ EHGIEPIDMV VVNLYPFEAT VAKEDCTMED
AIENIDIGGP TMLRSAAKNN RDVTVVVDHA DYAVVLDEMK NSGGSVSRET NFRLAVKVYQ
HTAAYDGAIS NWLGARTGEG VAPFPDTLTM QYKLAQGMRY GENPHQSGAF YVEKGSRESS
ISTARQIQGK ELSYNNIGDT DAALECVKQF TEPACVIVKH ANPCGVALGA NIMEAYDKAY
KTDPESAFGG IIAFNRELDE STARAIVERQ FVEVIIAPKV TEAASEIVAA KKNVRLMECG
FWPENPAPRF DYKRVNGGML VQDADLELFT ELKVVTKRAP TDKEMEDLLF TWRVAKFVKS
NAIVYGRDNS TVGVGAGQMS RVNSARIAAI KAEHAGIPVQ GAVMASDAFF PFRDGLDNAA
SVGVTAVIQP GGSMRDAEVI AAADEHGIAM VFTSMRHFRH