Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AFE_2260 |
Symbol | purH |
ID | 7135650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 23270 |
Kingdom | Bacteria |
Replicon accession | NC_011761 |
Strand | + |
Start bp | 2012238 |
End bp | 2013812 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643530628 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002426658 |
Protein GI | 218666571 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.258744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGATTTA TGGGTGAGAT TACAAGGGCG CTCATCAGCG TTTCCGACAA GCGGGGCGTT GTAGAATTTG CCCGGCGGTT GCAGGATTTT GGGGTAGAGA TACTCTCTAC CGGCGGTACC GCCAAAGCTC TGATGGCCGA TGGTGTCGCG GTGCAGGAAG TGGGCGACTA CACGGGTTTC CCGGAACTAC TGGAGGGCCG CCTCAAAACC CTGCACCCCA AAATCCACGG CGGACTGCTG GCGAAGCGCG ACGACAGCAG CCACACCCGG CAGATGGCCG AGTACGGCAT CCCCGCGATC GATCTCCTCT GCGTCAATCT CTACCCCTTC GCCGAGACCA TCGCCAGTGC CGATTGCACA CTGGAAGAAG CCATGGAAAA CATCGATATC GGCGGCCCGA CCATGCTCCG TGCGGCGGCG AAGAACTGGG AGGGCGTCAC TGTCCTCGTC GACCCCGATG ACTATGCCGC TGTGTTGCAG GAAATGGAAC AGAGTTACGG CGGCGTCGGC GCCAGTACCC GCTTCCGCCT GGCCACCAAG GTCTTCGCCC ACACGGCGCG CTATGACGGT GCTATCGCCA ACTATCTCTC CAGCCTGGGT CCCGATGGCA ACCGGACAAC CTTCCCGCAG ACTCTGTCCC TGCAATTTAA GAAAGCGCAG GATCTGCGCT ACGGCGAAAA TCCTCATCAG GCCGCCGCCT TCTACCGCGA TGGCAGCGGC GGCGGACTGG CGGACGCCCA TCAGTTGCAA GGCAAGGAAC TGTCTTACAA CAATATCGGG GACGGTGATG CCGCCGTCGC GCTGGTGATG GAATTTGCCG AACCCGCCTG TTGCGTGGTG AAGCATGGCA ATCCCTGCGG CGTGGCCGTG GGGCCGGATC TGCTCGGTGC CTATCAGCGC GCATGGGCCG GCGATCCGAT ATCCGCCTTC GGCGGCATCG TCGCCTGTAA CCGGCCGCTG GATGCACAGA CTGCCGAACT CATTAGCGAT CAGTTCATCG AGATGGTACT GGCGCCCGCT ATTTTGCCCG ATGCCCGGCC CATTCTGGCC AAAAGGAAAA ACCTGCGGGT GCTCGCCTTT GACGATGGCC GCGCCTGGCG GCGGACAGGC TGGGATTACA AGCGTGTGCG GGGGGGGTTG TTGGTACAGA ACTTTGACCA GGCCATGGAA GCGGAAACGG ACTGGAAAGT GGTCTCGGAA CGCGCACCGA CGGTACAGGA AGCCCGTGAT CTCGCCTTTG TCTGGCGGGT CGGTAAATAC GTGCGCTCCA ACGCCATTGT CTATGGCCGA GAAGGCCAGA CCGTCGGCAT CGGTGCAGGA CAGATGAGCC GGGTGGACGC GGCCAGATGC GGCGTAGCCA AGGCCCTGGA ACTGGGCTTC GATCTGCACG GGGCAGCGCT GGCTTCTGAC GCGTTCTTCC CCTTCCGCGA TGGGATCGAT GCGGCGGCGG CTGCGGGCGT AAAGGCGATC ATTCAACCCG GCGGCTCCAT CCGCGATGAA GAAGTCATCG CCAGCGCCAA TGAACACGGC ATCGCCATGG TCTTCACCGG CGTGCGCCAT TTCCGACATG GTTGA
|
Protein sequence | MGFMGEITRA LISVSDKRGV VEFARRLQDF GVEILSTGGT AKALMADGVA VQEVGDYTGF PELLEGRLKT LHPKIHGGLL AKRDDSSHTR QMAEYGIPAI DLLCVNLYPF AETIASADCT LEEAMENIDI GGPTMLRAAA KNWEGVTVLV DPDDYAAVLQ EMEQSYGGVG ASTRFRLATK VFAHTARYDG AIANYLSSLG PDGNRTTFPQ TLSLQFKKAQ DLRYGENPHQ AAAFYRDGSG GGLADAHQLQ GKELSYNNIG DGDAAVALVM EFAEPACCVV KHGNPCGVAV GPDLLGAYQR AWAGDPISAF GGIVACNRPL DAQTAELISD QFIEMVLAPA ILPDARPILA KRKNLRVLAF DDGRAWRRTG WDYKRVRGGL LVQNFDQAME AETDWKVVSE RAPTVQEARD LAFVWRVGKY VRSNAIVYGR EGQTVGIGAG QMSRVDAARC GVAKALELGF DLHGAALASD AFFPFRDGID AAAAAGVKAI IQPGGSIRDE EVIASANEHG IAMVFTGVRH FRHG
|
| |