Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0263 |
Symbol | purH |
ID | 7976129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 297341 |
End bp | 298879 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644797258 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002948458 |
Protein GI | 239825834 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAA AACGGGCATT ATTAAGTGTA TCGAACAAAG AAGGAATCGT ATCATTGGCG AAACAGTTAG TGGAACTCGG TGTGGAAATT ATTTCAACAG GCGGAACGAA AAAAACGTTA GAAGAAGCGG GAGTTCCTGT CATAGGCATT TCTGATGTTA CCGGATTTCC GGAAATTTTA GATGGGCGCG TCAAAACATT GCATCCGATG ATCCATGGGG GACTTCTTGC GATTCGCGAC AATGAGCGCC ATCAAAGCGA GCTTCGCGAG CATCACATTA CCCCAATTGA CCTTGTTGTC GTCAATTTGT ACCCGTTTCA ACAAACGATC GCCAAAAGCG ATGTGACGTT TGCCGAAGCG ATCGAAAACA TTGATATTGG CGGCCCGACG ATGCTGCGGG CGGCGGCGAA AAACCATCAA TATGTGACGG TTGTCGTTGA CCCGGCCGAC TATGATACGG TAGTGCAAGA ACTAAAAAAA CACGGAGATG TCTCGGTAGA AACGAAATTA AAGCTTGCGG CAAAAGTATT TCGCCATACA GCTGCGTACG ATGCGATGAT TGCGGAGTAC TTAACGAATA AAACTGGAGA AGAGTATCCG GAATCATTGA CGATCACTTT CGAGAAAAAA CAAGCGTTGC GTTATGGGGA AAACCCTCAC CAAACTGCGG CGTTTTACAA AAAACCGTTA GGCGCATCTT TCTCGATTGC CCAAGCAATG CAGCTGCATG GAAAAGAATT GTCATATAAC AACATTAACG ATGCGAATGC GGCATTGCAA ATCGTCAAAG AATTTACCGA ACCAGCCGCG GTGGCGGTGA AGCATATGAA TCCGTGCGGT GTCGGTGTTG GCGCAACGAT TTACGAAGCG TTCACCAAAG CGTACGAAGC GGATCCAACC TCGATTTTTG GCGGCATTAT CGCCCTAAAC CGCGAGGTCG ATAAAGAAAC TGCCGAAAAA ATGCATGAAA TCTTTTTAGA AATCGTGATT GCCCCATCGT TTAGCAAAGA AGCGCTCGAT ATTTTAACAC AAAAGAAAAA CATTCGCCTT CTTACCGTTG ATTTCACGGC GCCAAACACG AAAGAGAAGC TGCTTGTTTC CGTACAAGGC GGATTGCTTG TTCAAGAAAC GGATACGCAT ACGCTCGATG ACGCGGAGAT TAAAGTTGTA ACGAAACGCG AGCCGACGGA ACAAGAATGG GAAGCGTTGC GGTTTGCATG GAAAGTGGTG AAACATGTGA AATCGAACGC GATCGTGCTT GCAAAAGACG GAATGACCAT CGGCATTGGT GCCGGTCAAA TGAATCGCGT CGGCGCGGCG AAAATTGCGA TTGAACAAGC AGGGGAAAAA GCAAAAGGAG CCGTGCTTGC CTCTGACGCG TTTTTCCCAA TGGATGATAC GGTGGAGGCG GCAGCGAAAG CAGGAATCAC AGCAATCATC CAGCCAGGCG GCTCGATTCG CGACGCCGAT TCGATCAAAA AAGCAGATGA ATACGGAATT GCCATGGTCT TTACAGGAAT CCGCCATTTT AAACATTAA
|
Protein sequence | MAVKRALLSV SNKEGIVSLA KQLVELGVEI ISTGGTKKTL EEAGVPVIGI SDVTGFPEIL DGRVKTLHPM IHGGLLAIRD NERHQSELRE HHITPIDLVV VNLYPFQQTI AKSDVTFAEA IENIDIGGPT MLRAAAKNHQ YVTVVVDPAD YDTVVQELKK HGDVSVETKL KLAAKVFRHT AAYDAMIAEY LTNKTGEEYP ESLTITFEKK QALRYGENPH QTAAFYKKPL GASFSIAQAM QLHGKELSYN NINDANAALQ IVKEFTEPAA VAVKHMNPCG VGVGATIYEA FTKAYEADPT SIFGGIIALN REVDKETAEK MHEIFLEIVI APSFSKEALD ILTQKKNIRL LTVDFTAPNT KEKLLVSVQG GLLVQETDTH TLDDAEIKVV TKREPTEQEW EALRFAWKVV KHVKSNAIVL AKDGMTIGIG AGQMNRVGAA KIAIEQAGEK AKGAVLASDA FFPMDDTVEA AAKAGITAII QPGGSIRDAD SIKKADEYGI AMVFTGIRHF KH
|
| |