Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0084 |
Symbol | purH |
ID | 3908726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 88475 |
End bp | 90067 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881965 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_483707 |
Protein GI | 86747211 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.309239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.165447 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TTCCGCGCCG CGTGACCCGC GCTCTGTTGT CCGTTTCCGA CAAGACCGGG CTGGTCGATT TCGCCCGCGC GCTGGCCGGC CACGGCGTCG AACTGGTCTC GACCGGCGGC ACCGCCAAGG CGATCGCGGC GGCCGGGCTG CCGGTCAAGG ACGTCTCCGA GATCACCGGC TTTCCCGAGA TGATGGACGG CCGGGTCAAG ACGCTGCATC CCAAGGTGCA TGGCGGCCTG CTGGCGGTCC GCGACAATGA CGAGCACAAG CAGGCGATGG CGGCGCACGG CATCGCCCAG ATCGACCTCC TCGTGGTCAA TCTGTATCCG TTCGAGGCCA CCGTCGACAA AGGCGGCTCC TACGAGGACT GCATCGAGAA CATCGACATC GGCGGCCCGG CGATGATCCG CGCCGCAGCG AAGAATCACG ACGACGTCGC GGTGATCGTC GAATCGTCCG ACTATCAGGC GGTGCTCGAC GAACTCGCGG CCAATGCCGG CGCCACCTCG CACGGCTTGC GCAAGCGCCT CGCCGCCAAG GCCTATGCCC GCACCGCGGC CTACGACGCC GCGATCTCCA ATTGGTTCGC GCAGCAATTG AAGACCGATG CGCCGGATTT CCGCGCGATC GGCGGCCGGC TGATCCAGAG CCTGCGCTAC GGCGAGAACC CGCATCAGAC CGCGGCGTTC TACGCCACCC CGGAGAAGCG TCCGGGCGTC GCCACCGCGC GGCAGGTGCA GGGCAAGGAA CTGTCCTACA ACAACATCAA CGATACCGAC GCGGCCTATG AATGCGTCGG CGAGTTCGAC GCCAAGCGCA CTGCAGCCTG CGTCATCGTC AAGCACGCCA ATCCCTGCGG CGTCGCCGAA GGATCGAGCC TGCTCGATGC CTATCGCAAG GCGCTGGCGT GCGATTCGAC CTCGGCGTTC GGCGGCATCG TCGCGCTCAA CCGCACGCTC GACGCCGAAG CCGCACGCGC GATCGTCGAG ATCTTCACCG AAATGATCAT CGCGCCCGAG GCGAGCGAGG AAGCGATCGC GATCGTGGCG GCGAAGAAAA ACTTGCGGCT GCTGCTGGCC GGCAGCCTGC CCAACCCGCG CGCCGCCGGC CTGACCTACA AGAGCGTGTC CGGAGGGCTG CTGGTGCAGT CGCGCGACAA TGCGGTGGTC GACGACATGG CGCTCAAGGT CGTCACCAAG CGGCAGCCGA GCGAGGCCGA ACTGCGCGAC CTGAAATTCG CCTTCCGGGT CGCCAAGCAC GTCAAGTCCA ACACCATCAT CTACGCCAAG GATCTGGCCA CCGTCGGCAT CGGCGCCGGC CAGATGAGCC GGGTCGATTC CGCCCGCATT GCCGCGCGAA AAGCGCAGGA TGCCGCCGCC GAGCTGAAAC TCGCGGCGCC GATGACCAAG GGCTCGGTGG TGGCATCGGA CGCGTTCTTC CCGTTCGCCG ACGGCATGCT CGCCTGCATC GAAGCCGGCG CCACCGCGGT GATCCAGCCC GGCGGCTCGG TGCGCGACGA CGAAGTCATC AAGGCCGCGG ACGACGCCGG CATCGCCATG GTGTTCACCG GGACCAGGCA TTTCCGGCAT TGA
|
Protein sequence | MTDLPRRVTR ALLSVSDKTG LVDFARALAG HGVELVSTGG TAKAIAAAGL PVKDVSEITG FPEMMDGRVK TLHPKVHGGL LAVRDNDEHK QAMAAHGIAQ IDLLVVNLYP FEATVDKGGS YEDCIENIDI GGPAMIRAAA KNHDDVAVIV ESSDYQAVLD ELAANAGATS HGLRKRLAAK AYARTAAYDA AISNWFAQQL KTDAPDFRAI GGRLIQSLRY GENPHQTAAF YATPEKRPGV ATARQVQGKE LSYNNINDTD AAYECVGEFD AKRTAACVIV KHANPCGVAE GSSLLDAYRK ALACDSTSAF GGIVALNRTL DAEAARAIVE IFTEMIIAPE ASEEAIAIVA AKKNLRLLLA GSLPNPRAAG LTYKSVSGGL LVQSRDNAVV DDMALKVVTK RQPSEAELRD LKFAFRVAKH VKSNTIIYAK DLATVGIGAG QMSRVDSARI AARKAQDAAA ELKLAAPMTK GSVVASDAFF PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGTRHFRH
|
| |