Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0090 |
Symbol | purH |
ID | 6742873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 84037 |
End bp | 85557 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642749874 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002120760 |
Protein GI | 195952470 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCAC TTATATCTGT ATACGATAAA ACCGGTATTT TAGAACTGGC TAAGGAGCTT TTGAACCAAG GGTATGAGAT TCTATCCAGT GGTGGGACAT ACACTTATCT AAAAAATGCT GGTGTTGATG CCATAGAGGT ATCTGAGGTA ACAGGTTTTA GAGAGATTTT AGGTGGTAGA GTTAAGACGC TTCATCCCGC TATACACGGT GGTATTTTGT TTAGAGAAGA CGTAGAAAAA GATTTAGAAG AAATAAAAGA AAACTCTATA GAACCTATTG ATATCGTAGT GGTGAACTTG TATCCTTTTG AAAAGAAGAT GAAAGAGCTA AAAGATATAG ATGCTCTTGT GGAGTTTATA GACATAGGGG GTCCTACTCT TGTAAGAGCC GCTGCTAAAA ATCACAAACG AGTTAGTGTA CTTACAGATA TCGAAGATTA CGGATGGTTT ATAGAAAAGC TCAAAATGAA TGCTGTATCT CAGCAAGACA GAAAATACTT AGCTTTGAAA GCTTTTTGGT TAACATCTTA CTATGATGCT GTTATAGCTA GTTATTTTTC CAAAGTATTT GGCTTTTCAG AAAAAGATTT TAAGCATCAT ACTGTACCTA TGTTTTTGAG AGATGAATTG AGATACGGTG AAAATCCACA TCAGCAAGCT TACCTATATG AAAATCCGTT GGAAGAAAAT GGTATTGTAA GAGCTGATGT GCTTCAAGGT AAAAAGATGT CTTACAACAA CTATCTTGAT GCCGATTCTG TTGTAAAGCT CATGTCAGAG TTTTCTAATC CTTGCTGTGC TATCGTAAAA CACAACAATC CAAGCGGTAT AACCACAGAC AACAATATTC TGGAAGCTTA CAAAAAAGCT TTTCAATGTG ATCCTGAGGC GGCGTTTGGT GGTATCGTAG CTTTTAACAA GGTTGTAGAT AAAGACGTTG CTAAGGCTAT TACAGAGCAT TTTTATGAGA TAGTTATAGC TCCTGAGTTT ACCGAAGAAG CTGTTGAAGA GTTTTCCAAA AAGAAGAATT TAAGGTTAGT AAGGTATAAA AACTACAATC AGAATATTGA TCTAAGGAGT ATATCTGGCG GGTTTTTAGT GCAGGACATA GATGATAAGC TTTATGAGTC TATAGAGATA GTTTCATTGA GAAGACCTAC TGAGCAAGAG TTAGAGGATG CTATATTTGC TTGGAAAGTG GCAAAATGGA CAAAATCAAA TGCTATAGTG ATAGCTAAAA ACAATCAAAC CATAGGTATA GGCGCTGGTC AGGTGTCGAG GGTAGATTCT CTTAGAAGCG CTATAAGAAA AGCAAAAAAC TTTTCCCATG ATTTAAAAGG CGCCGTAGTG GCCTCAGACG CTTTTTTCCC GTTTAGAGAT AGCATAGATA TAGCTGCTGA AGAAGGAATA TCTGGTACAA TACAACCTGG TGGTTCTATA AGAGACAAAG AAGTTATAGA GGCTGTAAAT GAGCATAACA TGTTTATGAT ATTTACCCAT ATGAGGCATT TCAGACATTG A
|
Protein sequence | MRALISVYDK TGILELAKEL LNQGYEILSS GGTYTYLKNA GVDAIEVSEV TGFREILGGR VKTLHPAIHG GILFREDVEK DLEEIKENSI EPIDIVVVNL YPFEKKMKEL KDIDALVEFI DIGGPTLVRA AAKNHKRVSV LTDIEDYGWF IEKLKMNAVS QQDRKYLALK AFWLTSYYDA VIASYFSKVF GFSEKDFKHH TVPMFLRDEL RYGENPHQQA YLYENPLEEN GIVRADVLQG KKMSYNNYLD ADSVVKLMSE FSNPCCAIVK HNNPSGITTD NNILEAYKKA FQCDPEAAFG GIVAFNKVVD KDVAKAITEH FYEIVIAPEF TEEAVEEFSK KKNLRLVRYK NYNQNIDLRS ISGGFLVQDI DDKLYESIEI VSLRRPTEQE LEDAIFAWKV AKWTKSNAIV IAKNNQTIGI GAGQVSRVDS LRSAIRKAKN FSHDLKGAVV ASDAFFPFRD SIDIAAEEGI SGTIQPGGSI RDKEVIEAVN EHNMFMIFTH MRHFRH
|
| |