Gene GM21_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3524 
SymbolpurH 
ID8138896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4065511 
End bp4067073 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content65% 
IMG OID644871143 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003023303 
Protein GI253702114 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.218896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA TTGGGCGCGC GCTGATCAGC GTGTCGGAGA AGACTGGTGT GGTGGAATTT 
TCGCGGGCGC TGGCGGGCTA CGGCGTGGAG ATCCTCTCCA CCGGCGGTAC CGCGAAACTC
TTGCGTGAGG CTGGAATCGC CGTCAAGGAC GTCTCCGAGT TCACCGGTTT CCCCGAGATG
CTGGACGGCC GGGTGAAGAC CCTGCACCCG AAGGTTCACG GCGGCATCCT CGGCATGCGC
GAGAACCCGG CGCACGTAGC CAAGATGCAG GAGCACGGCA TCGAGCCCAT CGACATGGTG
GTGGTGAACC TCTACCCGTT TGAGGCGACC GTCGCGAAAG AGGACTGCAC CATGGAGGAC
GCCATCGAGA ACATCGACAT CGGCGGCCCG ACCATGCTCC GCTCCGCGGC CAAGAACAAC
CGCGACGTCA CCGTCGTCGT CGACCACGCC GATTACGCGG TGGTCCTGGA CGAGATGAAG
AACTCCGGCG GCAGCGTGTC GTGCGAGACC AATTTCCGCC TGGCCGTGAA GGTGTACCAG
CACACCGCAG CCTACGACGG CGCCATCTCC AACTGGCTCG GCGCCCGCAC CGGCGACGGT
GTGGCGGCTT TCCCGGACAC CCTCACCCTG CAGTACAAGC TGGCCCAGGG GATGCGCTAC
GGCGAGAACC CGCACCAGTC CGGCGCCTTC TACGTCGAGA AGGGGTCCAA GGAGGCCTCC
ATCTCCACCG CGCGCCAGAT CCAGGGGAAG GAACTCTCCT ACAACAACAT CGGCGACACC
GATGCGGCGC TCGAATGCGT GAAGCAGTTC ACGGAGCCTG CCTGCGTCAT CGTGAAGCAT
GCGAACCCCT GCGGTGTCGC GCTCGGCGCG AACATCATGG AAGCCTATGA CAAGGCGTAC
AAGACCGATC CCGAGTCCTC CTTCGGCGGC ATCATCGCCT TCAACCGCGA ACTGGACGAG
TCCACCGCCC GCGCCATCGT CGAGCGCCAG TTCGTCGAAG TGATCATCGC CCCCAAGGTG
ACCGAGGCCG CGAGCGAAGT GGTCGCGGCG AAGAAGAACG TCCGCCTCAT GGAGTGCGGC
TTCTGGCCCG AGAATCCGGC GCCCCGTTTC GATTACAAGA GGGTGAACGG CGGCATGCTG
GTCCAGGACG CCGACCTCGA ACTCTTCACC GAATTGAAGG TGGTGACCAA GAGGGCGCCG
ACCGACAAGG AGATGGAAGA CCTTCTCTTC ACCTGGCGCG TGGCCAAGTT CGTCAAATCC
AACGCCATCG TCTACGGCCG CGACAACTCC ACCGTCGGCG TCGGCGCAGG CCAGATGAGC
CGCGTCAACT CCGCCCGCAT CGCCGCCATC AAGGCCGAGC ATGCCGGCAT TCCGGTCCAG
GGTGCGGTCA TGGCGTCCGA CGCCTTCTTC CCGTTCAGGG ACGGTCTCGA CAACGCCGCC
GCCGTAGGCG TCACCGCCGT GATCCAGCCC GGCGGCAGCA TGCGTGACGC CGAGGTCATC
GCCGCGGCCG ACGAGCACGG CATCGCCATG GTCTTCACTG CGATGAGGCA CTTCAGACAC
TGA
 
Protein sequence
MAKIGRALIS VSEKTGVVEF SRALAGYGVE ILSTGGTAKL LREAGIAVKD VSEFTGFPEM 
LDGRVKTLHP KVHGGILGMR ENPAHVAKMQ EHGIEPIDMV VVNLYPFEAT VAKEDCTMED
AIENIDIGGP TMLRSAAKNN RDVTVVVDHA DYAVVLDEMK NSGGSVSCET NFRLAVKVYQ
HTAAYDGAIS NWLGARTGDG VAAFPDTLTL QYKLAQGMRY GENPHQSGAF YVEKGSKEAS
ISTARQIQGK ELSYNNIGDT DAALECVKQF TEPACVIVKH ANPCGVALGA NIMEAYDKAY
KTDPESSFGG IIAFNRELDE STARAIVERQ FVEVIIAPKV TEAASEVVAA KKNVRLMECG
FWPENPAPRF DYKRVNGGML VQDADLELFT ELKVVTKRAP TDKEMEDLLF TWRVAKFVKS
NAIVYGRDNS TVGVGAGQMS RVNSARIAAI KAEHAGIPVQ GAVMASDAFF PFRDGLDNAA
AVGVTAVIQP GGSMRDAEVI AAADEHGIAM VFTAMRHFRH