Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0719 |
Symbol | purH |
ID | 4021192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 808392 |
End bp | 809984 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637960908 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_567858 |
Protein GI | 91975199 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATC ATCCGCGCCG CGTGACCCGC GCTCTGTTGT CCGTTTCCGA TAAGTCCGGG CTGATCGACT TTGCCCGCGC GCTGTCCGGC CACGGCGTCG AACTGGTCTC GACCGGCGGC ACCGCCAAGG CGATCGCGGC GGCGGGGCTT GCGGTCAAGG ACGTCTCCGA GCTGACCGGC TTTCCCGAGA TGATGGACGG TCGGGTCAAG ACGCTGCATC CGAAGGTGCA TGGCGGCCTG CTGGCGATTC GCGACAATGC CGAGCACAAG CAGGCGATGG CCGCGCACGG CATCGCGCAG ATCGATCTGC TGGTGGTCAA TCTCTATCCA TTCGAGGCGA CCGTCGACAA AGGCGCGTCC TACGAGGATT GCATCGAGAA CATCGATATC GGCGGTCCGG CGATGATCCG CGCGGCGGCG AAAAATCACG ACGACGTCGC GGTGGTGGTC GAGGCGTCCG ATTATTCTGC GGTGCTCGAC GAACTCGCCG CCAATGCCGG CGCGACCTCG CTCGATCTGC GCAAGCGCCT CGCCGCCAAG GCCTATGCCC GCACCGCGGC CTATGACGCG GCGATCTCGA ACTGGTTCGC GCTGCAGCTC GAGACCGACG CGCCGGATTT CCGCGCGATC GGCGGCCGGC TGATCCAGAG CCTGCGCTAC GGCGAGAACC CGCACCAGAG CGCCGCGTTC TACGCCACGC CGGAGAAGCG TCCGGGTGTC GCCACCGCGC GCCAGGTGCA GGGCAAGGAG CTGTCCTACA ACAACATCAA CGACACCGAC GCCGCCTATG AATGCGTCGG CGAATTCGAC GCCGGGCGCA CCGCCGCCTG CGTCATCGTC AAGCACGCCA ACCCCTGCGG CGTCGCCGAA GGGGCGAGCC TGTTTGAGGC CTATCGCAAG GCTCTGGCCT GCGATTCGAC CTCCGCCTTC GGCGGCATCG TCGCGCTCAA CCGCACGCTC GATGCCGAAG CGGCGCGCGC GATTACCGAG ATCTTCACCG AAGTGATCAT CGCGCCGGAC GCCAGCGAAG AGGCGATTGC GATCGTCGCT GCGAAGAAGA ATTTGCGGCT GCTGCTGGCG GGCGCGCTGC CCGATCCGCG CGCCATCGGC CTCACCTACA AGACCGTCGC CGGCGGCCTC TTGGTGCAGT CGCGCGACAA CGCCGTGGTC GACGACATGG CGCTGAAGGT CGTTACCAAG CGGCAGCCGA CCGAGGCGGA GCTGCGCGAT CTGAAATTCG CCTTCCGCGT CGCCAAGCAC GTCAAGTCGA ACACCATCAT CTATGCCAAG GACCTCGCCA CCGTCGGCAT CGGCGCCGGC CAGATGAGCC GCGTCGATTC CGCCCGCATC GCCGCCCGCA AGGCGCAGGA TGCCGCTACC GAGTTGAAGC TCGCCGCGCC GATGACCAAG GGCTCGGTGG TGGCCTCCGA CGCGTTCTTC CCGTTCGCCG ACGGGATGCT CGCCTGTATC GAGGCCGGCG CCACCGCGGT GATCCAGCCC GGCGGCTCGG TCCGCGACGA CGAAGTGATC AAGGCCGCGG ACGATGCCGG CATCGCGATG GTGTTCACCG GCACCCGGCA TTTCCGGCAC TAG
|
Protein sequence | MTHHPRRVTR ALLSVSDKSG LIDFARALSG HGVELVSTGG TAKAIAAAGL AVKDVSELTG FPEMMDGRVK TLHPKVHGGL LAIRDNAEHK QAMAAHGIAQ IDLLVVNLYP FEATVDKGAS YEDCIENIDI GGPAMIRAAA KNHDDVAVVV EASDYSAVLD ELAANAGATS LDLRKRLAAK AYARTAAYDA AISNWFALQL ETDAPDFRAI GGRLIQSLRY GENPHQSAAF YATPEKRPGV ATARQVQGKE LSYNNINDTD AAYECVGEFD AGRTAACVIV KHANPCGVAE GASLFEAYRK ALACDSTSAF GGIVALNRTL DAEAARAITE IFTEVIIAPD ASEEAIAIVA AKKNLRLLLA GALPDPRAIG LTYKTVAGGL LVQSRDNAVV DDMALKVVTK RQPTEAELRD LKFAFRVAKH VKSNTIIYAK DLATVGIGAG QMSRVDSARI AARKAQDAAT ELKLAAPMTK GSVVASDAFF PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGTRHFRH
|
| |