Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0384 |
Symbol | purH |
ID | 4485795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 397253 |
End bp | 398848 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639729151 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_872144 |
Protein GI | 117927593 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACGT TCGATATTCC CGATACCGCT GAGGCGGGCC GGCGCCCGAT CCGCCGGGCG CTGGTCTCGG TGTATGACAA GACCTCCCTC GTTGAATTGG GCCGCGGTCT CGCGGCGGCC GGAGTCGAAA TCGTCTCAAC GGGAAGCACG GCGAGTGTCC TTGAAGAGGC AGGTGTTCCC GTCGTCCGGG TCGAAACGGT CACCGGATTT CCGGAATGCC TCGACGGCCG CGTCAAGACG CTGCATCCCG CGATTCATGC CGGGCTCCTT GCGGATGTCA CCCGGCCGGA ACACGCTCGG CAACTCAGTG ACCTCGGTGT GCGGCCCTTT GACCTGCTCG TGGTCAACCT CTACCCGTTC TCCGAGACCG TGGCGCGGGG AGCGTCAGCT GCGGAGGTCA TCGACCAGAT CGATATCGGC GGTCCGGCGA TGGTCCGCGC CGCGGCGAAG AATCACCACT GTGTCGCCGT CATCACCTCA CCCGCGCGCT ATCAGGACGT CTTGGATGCC GTGCATCACG GCGGTTTCAC GGACGCCGAG CGCCGGGCGC TCGCCGTTGA GGCGTTCGTG CACACGGCGT CCTATGACAT CGCCGTCGCG ACCTGGATGG GTGGTTCGTA CACGCCGACA GATGAGGGAA GCGGATTTCC CGAATGGCTC GGGGCAAGTT ACCGGCGCGC TGAGATATTG CGGTACGGCG AGAATCCGCA TCAACGGGCG GCCCTCTACC GGGCGGACCA CGCCGCTCCC GGGCTGGCGC AGGCGCGGGT TCTGCACGGC AAGGCCATGT CGTACAACAA CTACGTGGAC ACCGACGCCG CGCATCGGGC GGCGTACGAC TTCACCGAAC CGTGCGTCGC CATTGTGAAA CACGCAAACC CGTGCGGCAT AGCCGTCGGC CGGGACATCG CGGAAGCACA CCGTAAAGCC CACGCGTGTG ACCCCGTCTC GGCATATGGC GGCGTCATTG CGGCGAATCG GGAAGTCAGC GTCGCCATGG CGGAACAGAT TGCGGACATT TTCACGGAAG TGGTCTGTGC GCCGAGTTAC GCCGAGGGAG CCCTCGACAT TCTCACTGCG AAGAAGAATC TGCGCATTCT GCTCTGTCCC GGACTGCACC CTGCGCACCG GGTGCGGGAA TTCCGCCGGA TCAGCGGGGG GCTTCTGGTG CAGACCGTGG ACAACCTCGA CGCCGAGGGT GACGACCCGG CGAATTGGCA ACTGCGCGCC GGGGAGCCGG CGGACGACCA GACGCTCGCG GATTTGGAGT TCGCCTGGCG GGCGGTGCGG TCGGTAAAGT CGAACGCCAT TCTGCTTGCC GCCGACCGCG CATCGGTCGG CGTCGGAATG GGCCAGGTCA ATCGGGTGGA CGCCGCCCGC CTGGCGGTGC AACGGGCCGG CGACCGGGCG AAAGGCGCGG TCGCCGCATC GGACGCGTTC TTCCCGTTCG CCGACGGCCT GCAGGTTCTG ATCGACGCCG GAGTCCGAGC CGTGGTGGAG CCCGGCGGGT CCGTGCGTGA CGATGAGGTC GTCGAGGCTG CACGCCGAGC CGGTATCACC CTGTACTTCA CCGGAAGCCG TCATTTCTCC CACTGA
|
Protein sequence | MSTFDIPDTA EAGRRPIRRA LVSVYDKTSL VELGRGLAAA GVEIVSTGST ASVLEEAGVP VVRVETVTGF PECLDGRVKT LHPAIHAGLL ADVTRPEHAR QLSDLGVRPF DLLVVNLYPF SETVARGASA AEVIDQIDIG GPAMVRAAAK NHHCVAVITS PARYQDVLDA VHHGGFTDAE RRALAVEAFV HTASYDIAVA TWMGGSYTPT DEGSGFPEWL GASYRRAEIL RYGENPHQRA ALYRADHAAP GLAQARVLHG KAMSYNNYVD TDAAHRAAYD FTEPCVAIVK HANPCGIAVG RDIAEAHRKA HACDPVSAYG GVIAANREVS VAMAEQIADI FTEVVCAPSY AEGALDILTA KKNLRILLCP GLHPAHRVRE FRRISGGLLV QTVDNLDAEG DDPANWQLRA GEPADDQTLA DLEFAWRAVR SVKSNAILLA ADRASVGVGM GQVNRVDAAR LAVQRAGDRA KGAVAASDAF FPFADGLQVL IDAGVRAVVE PGGSVRDDEV VEAARRAGIT LYFTGSRHFS H
|
| |