Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3524 |
Symbol | purH |
ID | 8138896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4065511 |
End bp | 4067073 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644871143 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_003023303 |
Protein GI | 253702114 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 0.218896 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGA TTGGGCGCGC GCTGATCAGC GTGTCGGAGA AGACTGGTGT GGTGGAATTT TCGCGGGCGC TGGCGGGCTA CGGCGTGGAG ATCCTCTCCA CCGGCGGTAC CGCGAAACTC TTGCGTGAGG CTGGAATCGC CGTCAAGGAC GTCTCCGAGT TCACCGGTTT CCCCGAGATG CTGGACGGCC GGGTGAAGAC CCTGCACCCG AAGGTTCACG GCGGCATCCT CGGCATGCGC GAGAACCCGG CGCACGTAGC CAAGATGCAG GAGCACGGCA TCGAGCCCAT CGACATGGTG GTGGTGAACC TCTACCCGTT TGAGGCGACC GTCGCGAAAG AGGACTGCAC CATGGAGGAC GCCATCGAGA ACATCGACAT CGGCGGCCCG ACCATGCTCC GCTCCGCGGC CAAGAACAAC CGCGACGTCA CCGTCGTCGT CGACCACGCC GATTACGCGG TGGTCCTGGA CGAGATGAAG AACTCCGGCG GCAGCGTGTC GTGCGAGACC AATTTCCGCC TGGCCGTGAA GGTGTACCAG CACACCGCAG CCTACGACGG CGCCATCTCC AACTGGCTCG GCGCCCGCAC CGGCGACGGT GTGGCGGCTT TCCCGGACAC CCTCACCCTG CAGTACAAGC TGGCCCAGGG GATGCGCTAC GGCGAGAACC CGCACCAGTC CGGCGCCTTC TACGTCGAGA AGGGGTCCAA GGAGGCCTCC ATCTCCACCG CGCGCCAGAT CCAGGGGAAG GAACTCTCCT ACAACAACAT CGGCGACACC GATGCGGCGC TCGAATGCGT GAAGCAGTTC ACGGAGCCTG CCTGCGTCAT CGTGAAGCAT GCGAACCCCT GCGGTGTCGC GCTCGGCGCG AACATCATGG AAGCCTATGA CAAGGCGTAC AAGACCGATC CCGAGTCCTC CTTCGGCGGC ATCATCGCCT TCAACCGCGA ACTGGACGAG TCCACCGCCC GCGCCATCGT CGAGCGCCAG TTCGTCGAAG TGATCATCGC CCCCAAGGTG ACCGAGGCCG CGAGCGAAGT GGTCGCGGCG AAGAAGAACG TCCGCCTCAT GGAGTGCGGC TTCTGGCCCG AGAATCCGGC GCCCCGTTTC GATTACAAGA GGGTGAACGG CGGCATGCTG GTCCAGGACG CCGACCTCGA ACTCTTCACC GAATTGAAGG TGGTGACCAA GAGGGCGCCG ACCGACAAGG AGATGGAAGA CCTTCTCTTC ACCTGGCGCG TGGCCAAGTT CGTCAAATCC AACGCCATCG TCTACGGCCG CGACAACTCC ACCGTCGGCG TCGGCGCAGG CCAGATGAGC CGCGTCAACT CCGCCCGCAT CGCCGCCATC AAGGCCGAGC ATGCCGGCAT TCCGGTCCAG GGTGCGGTCA TGGCGTCCGA CGCCTTCTTC CCGTTCAGGG ACGGTCTCGA CAACGCCGCC GCCGTAGGCG TCACCGCCGT GATCCAGCCC GGCGGCAGCA TGCGTGACGC CGAGGTCATC GCCGCGGCCG ACGAGCACGG CATCGCCATG GTCTTCACTG CGATGAGGCA CTTCAGACAC TGA
|
Protein sequence | MAKIGRALIS VSEKTGVVEF SRALAGYGVE ILSTGGTAKL LREAGIAVKD VSEFTGFPEM LDGRVKTLHP KVHGGILGMR ENPAHVAKMQ EHGIEPIDMV VVNLYPFEAT VAKEDCTMED AIENIDIGGP TMLRSAAKNN RDVTVVVDHA DYAVVLDEMK NSGGSVSCET NFRLAVKVYQ HTAAYDGAIS NWLGARTGDG VAAFPDTLTL QYKLAQGMRY GENPHQSGAF YVEKGSKEAS ISTARQIQGK ELSYNNIGDT DAALECVKQF TEPACVIVKH ANPCGVALGA NIMEAYDKAY KTDPESSFGG IIAFNRELDE STARAIVERQ FVEVIIAPKV TEAASEVVAA KKNVRLMECG FWPENPAPRF DYKRVNGGML VQDADLELFT ELKVVTKRAP TDKEMEDLLF TWRVAKFVKS NAIVYGRDNS TVGVGAGQMS RVNSARIAAI KAEHAGIPVQ GAVMASDAFF PFRDGLDNAA AVGVTAVIQP GGSMRDAEVI AAADEHGIAM VFTAMRHFRH
|
| |