Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_0525 |
Symbol | purH |
ID | 5876089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 540731 |
End bp | 542257 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641540861 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001662169 |
Protein GI | 167039184 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAA AAGCGTTAAT AAGTGTTTCA AAAAAAGAAG GCATAGTTGA ATTTGCCAAA AAGCTTAATG AATTAGGATA TGAAATAATA TCAACAGGTG GGACTTATAA TCTTCTTAAA GAAAATAGAG TAAATGTAGT AAAGGTATCA GACATAACAG GTTTCCCTGA AATTATGGAC GGGAGAGTAA AGACGCTTCA CCCTAAAATT CATGGAGGGC TTCTTGCAAT AAGAGACAAT GAAGAGCACA TTAAAGCTCT TAAAGAACAT GGCATTGAAC CAATAGATAT AGTTGTCATA AATTTATATC CTTTTAAAGA AACAATCCTC AAAGAAAATG TGACTTTAGA AGAAGCGATA GAAAATATAG ACATTGGCGG TCCTTCAATG ATAAGGGCTG CAGCTAAAAA CTACAAATAT GTAACTATTC TTGTAGATCC AAAGGATTAT GATACAGTCA TTGAAGAAAT AAAACAATAT GGCAATACAA AAGAAGAGAC GAGATTTTAT CTGGCAGCAA AAGCTTTTGG CCATACAGCC CTTTACGATT CTTTGATATA CAATTATTTG ATACAAAAAA ATAACATAGA GTTTCCCGAA GTTATGGCCT TTGCCTATGA AAAAGCTCAA GACATGAGAT ATGGTGAAAA TCCTCATCAA AAGGCGGCTT TTTACAAAAA TCCCATAAAA GCCTATGGCA TTGCAGAATG TGAGCAGCTG CACGGTAAAG AGCTCTCTTT TAACAATATA AACGATGCAA ATGCCGCAAT AGAACTTTTA AGAGAGTTTA AAGAGCCTGC AGCAGTTGCC GTAAAGCACA CAAATCCTTG TGGAGTAGCT ATTGCAGATA ATATATACAA TGCTTACTTA AAAGCTTATG AGAGTGACCC TGTTTCTATT TTTGGGGGAA TTGTAGCATT AAATAGAACT GTAGATGTTA AAACAGCTGA AGAACTTATA AAAATATTCT TAGAAATAGT AATTGCTCCA GACTTTGAAG AAGAGGCCTT TGAGATTTTA AAGAAAAAGA AAAACTTAAG GATATTGAGG TTAAAAGAAG GATATGAAAA AGAATATGAT TTAAAGAAAG TAGAAGGCGG ACTTTTAGTA CAAGAAAAAG ATGAAATAGA TTTAGATGAG AATAATTTGA AAGTAGTTAC TAAAAAAGCA CCTACGCAAA AAGAGATGGA AGATTTAAGG TTTGCCTGGA AGGTAGTAAA GCACGTAAAA TCTAATGCGA TTGTTTTAGC AAAAGATGGA GCTACAGTAG GTATTGGCGT TGGACAAGTC AACAGAATAT GGCCAACAGA GCAAGCAATC AAACAAGCAG GTAGCAAAGC AAAAGGAAGT GTCCTTGCAT CAGATGCCTT TTTCCCATTT CCAGATGTTG TGGAAGCAGC TGTAAAAGGC GGTATAACTG CCATAATCCA ACCAGGCGGC TCACAAAACG ATGCTTTATC AATTGAAGCT GCAGATAAAG GCGGCGTATC AATGATATTC ACAGGCATAA GGCATTTTAA ACATTGA
|
Protein sequence | MAKKALISVS KKEGIVEFAK KLNELGYEII STGGTYNLLK ENRVNVVKVS DITGFPEIMD GRVKTLHPKI HGGLLAIRDN EEHIKALKEH GIEPIDIVVI NLYPFKETIL KENVTLEEAI ENIDIGGPSM IRAAAKNYKY VTILVDPKDY DTVIEEIKQY GNTKEETRFY LAAKAFGHTA LYDSLIYNYL IQKNNIEFPE VMAFAYEKAQ DMRYGENPHQ KAAFYKNPIK AYGIAECEQL HGKELSFNNI NDANAAIELL REFKEPAAVA VKHTNPCGVA IADNIYNAYL KAYESDPVSI FGGIVALNRT VDVKTAEELI KIFLEIVIAP DFEEEAFEIL KKKKNLRILR LKEGYEKEYD LKKVEGGLLV QEKDEIDLDE NNLKVVTKKA PTQKEMEDLR FAWKVVKHVK SNAIVLAKDG ATVGIGVGQV NRIWPTEQAI KQAGSKAKGS VLASDAFFPF PDVVEAAVKG GITAIIQPGG SQNDALSIEA ADKGGVSMIF TGIRHFKH
|
| |