Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0029 |
Symbol | purH |
ID | 6407670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 31529 |
End bp | 33121 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642709936 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001989067 |
Protein GI | 192288462 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGA ATCCCCGTCG CGTCACCCGT GCCCTGTTGT CCGTGTCCGA TAAGACCGGT CTGATCGATT TCGCCCGTGC GCTCGCCGGC CATGGCGTCG AACTGGTTTC GACCGGTGGC ACCGCCAAGG CGATCGCTGC GGCCGGCCTT CCGGTCAAGG ACGTCTCCGA GCTGACCGGT TTCCCTGAGA TGATGGATGG TCGGGTCAAG ACGCTGCATC CGAAGGTGCA TGGCGGCCTG CTGGCGATTC GCGACAACGA CGAACACACC CAGGCGATGG CCGCGCACGG CATCCCGCAG ATCGACCTCC TGGTGGTGAA CCTCTATCCG TTCGAAGCTA CCGTCGACAA AGGCGCCTCT TACGAGGACT GCATCGAGAA CATCGACATC GGCGGCCCGG CGATGATCCG CGCCGCCGCC AAGAACCACG ACGACGTCGC GGTTGTGGTC GAGGCCAGCG ATTATCAGGC GGTGCTGGAT GAGCTGACTG CCAACAACGG CGCCACCACG CTGCCACTGC GCAAGCGCCT CGCTGCCAAG GCCTATGCGC GCACGGCCGC TTATGATGCG GCGATCTCCA ACTGGTTCGC GCTGCAGCTC AAGACCGATG CGCCGGACTT CCGCGCGATC GGTGGACGGC TGATCCAGAG CCTCCGCTAC GGCGAAAATC CGCACCAGAC CGCGGCGTTC TACGCCACGC CGGAGAAGCG TCCGGGCGTT GCCACCGCGC GCCAGGTGCA GGGCAAGGAG CTGTCCTACA ACAACATCAA CGACACCGAC GCCGCCTATG AGTGCGTCGG CGAGTTCGAC GCGGCGCGTA CCGCGGCCTG CGTCATCGTC AAGCACGCCA ATCCTTGCGG CGTCGCCGAG GGATCGAGCC TGCTCGACGC CTACAAGAAG GCGCTGGCTT GCGACTCCGT CTCGGCGTTC GGAGGCATCG TCGCGCTCAA CCGCACGCTC GACGCCGAAG CGGCGCGCGC CATCACCGAG ATCTTCACCG AAGTGATCAT CGCGCCGGAC GCCACCGACG AAGCGATCGC GATCGTCGCG GCGAAGAAGA ACCTGCGGCT GCTGCTGGCG GGCGCGCTGC CCGATCCGCG CGCCAACGGT CTGACCTACA AGACCGTCGC CGGCGGCCTG CTGGTGCAGA GCCGCGACAA TGCGGTGGTC GATGACATGG CGCTGAAGGT CGTCACCAAG CGGCAGCCGA CCGAAGCCGA GCTGCGTGAC CTGAAGTTCG CGTTCCGCGT CGGCAAGCAC GTCAAGTCCA ACACCATCAT CTATGCCAAG GACCTCGCCA CTGTCGGTAT CGGTGCCGGT CAGATGAGCC GCGTCGACTC CGCCCGCATC GCCGCCCGCA AGGCGCAGGA TGCGGCCGAG GCGATGAAGC TCGCCGCGCC GATGACCAAG GGCTCGGTGG TGGCCTCCGA CGCGTTCTTC CCGTTCGCCG ACGGCATGCT GGCCTGTATC GAAGCCGGCG CCACCGCGGT GATCCAGCCC GGCGGCTCGG TTCGCGACGA CGAAGTGATC AAGGCTGCGG ACGACGCCGG CATCGCCATG GTGTTCACCG GCACCCGGCA CTTCCGACAC TAA
|
Protein sequence | MTQNPRRVTR ALLSVSDKTG LIDFARALAG HGVELVSTGG TAKAIAAAGL PVKDVSELTG FPEMMDGRVK TLHPKVHGGL LAIRDNDEHT QAMAAHGIPQ IDLLVVNLYP FEATVDKGAS YEDCIENIDI GGPAMIRAAA KNHDDVAVVV EASDYQAVLD ELTANNGATT LPLRKRLAAK AYARTAAYDA AISNWFALQL KTDAPDFRAI GGRLIQSLRY GENPHQTAAF YATPEKRPGV ATARQVQGKE LSYNNINDTD AAYECVGEFD AARTAACVIV KHANPCGVAE GSSLLDAYKK ALACDSVSAF GGIVALNRTL DAEAARAITE IFTEVIIAPD ATDEAIAIVA AKKNLRLLLA GALPDPRANG LTYKTVAGGL LVQSRDNAVV DDMALKVVTK RQPTEAELRD LKFAFRVGKH VKSNTIIYAK DLATVGIGAG QMSRVDSARI AARKAQDAAE AMKLAAPMTK GSVVASDAFF PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGTRHFRH
|
| |