Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0023 |
Symbol | purH |
ID | 3971448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 27674 |
End bp | 29266 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637923137 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_529921 |
Protein GI | 90421551 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC ACCCGCGCCG CGTGACCCGC GCTCTTCTGT CCGTTTCCGA TAAAGCGGGC CTGATCGACT TCGCCCGCGC GCTGGTCGAC CACGGCGTCG AACTGGTCTC CACCGGCGGC ACCGCCAAAG CGATCGCCGC TGCCGGGCTC GCGGTCAAGG ATGTCTCCGA GCTCACCGGC TTTCCGGAAA TGATGGACGG CCGGGTCAAG ACCCTGCATC CGAAGGTGCA CGGCGGCCTG TTGGCGGTCC GCGGCAATGC CGAGCACGTC AAGGCGATGG CCGACCACGA CATCGCGCCG ATCGACCTGT TGGTGGTCAA CCTCTATCCG TTCGAGGCCA CCGTCGACAA AGGCGCCGGT TACGAAGACT GCATCGAGAA CATCGACATC GGCGGGCCGG CGATGATCCG CGCCGCTGCG AAAAACCACG ACGACGTCGC GGTGGTGGTG GAAGCCGCCG ATTATCAGGC GGTGCTCGAC GAACTCGCGG CCAACAAGGG CGCAACGACA CTGACCTTGC GCAAGAAGCT CGCCGCCAAG GCCTATGCGC GCACCGCGGC TTACGACGCG GCGATCTCCA ACTGGTTCGC CGATCAGCTG AAGACCGCGG CGCCGGATTT CCGCGCCATC GGTGGCCGGC TGATCCAGAG CCTGCGCTAC GGCGAAAACC CGCACCAGAG TGCTGCGTTC TACCGCACCC CGGATCACTG CCCGGGCGTC GCCACCGCGC GGCAGATCCA GGGCAAGGAA CTATCCTACA ACAACATCAA CGATACCGAC GCCGCCTATG AGTGCATCGG CGAGTTCGAC GCCACGCGCA CTGCGGCCTG CGTCATCGTC AAGCACGCCA ACCCCTGTGG TGTGGCGGAG GGCTCGAGCC TGCTGGCCGC CTATCGGTCG GCGCTGGCCT GCGATTCGAC CTCCGCGTTC GGCGGCATCG TGGCGCTGAA CCGCACCCTG GATGCCGAGG CCGCGCGCGC CATCACCGAG ATCTTCACCG AAGTGATCAT CGCGCCCGAC GCCACGGACG AGGCGATCGC GATCGTTGCC GCGAAGAAGA ATCTGCGGCT GCTGCTGGCC GGCCAATTGC CGGATCCGCG CGCGCCTGGG CTCACCTACA AGACGGTGGC CGGCGGTCTG TTGGTGCAGT CGCGCGATAA CGCCGTGGTC GAGGATATGG CGCTGAAGGC GGTTACCAAG CGGCAGCCGA CCGAGGCCGA GCTGCGCGAT CTGAAATTCG CCTTCCGGGT CGCCAAGCAC GTGAAGTCCA ACACGATTGT GTATGCGAAA GACCTCGCCA CCGTCGGCAT CGGCGCCGGC CAGATGAGCC GCGTCGACTC CGCGCGGATC GCCGCGCGCA AGGCCGAGGA TGCGGCGGCC GAGCTGAAGC TCGCCGCGCC GATGACCAAG GGCTCGGTGG TGGCCTCCGA TGCGTTCTTC CCGTTCGCCG ACGGCATGCT GGCCTGCATC GAGGCCGGCG CCACCGCGGT GATCCAGCCC GGCGGCTCGG TGCGCGACGA CGAGGTGATC AAGGCCGCCG ACGACGCCGG CATCGCCATG GTGTTCACCG GCGTGCGGCA TTTTAGGCAT TGA
|
Protein sequence | MTDHPRRVTR ALLSVSDKAG LIDFARALVD HGVELVSTGG TAKAIAAAGL AVKDVSELTG FPEMMDGRVK TLHPKVHGGL LAVRGNAEHV KAMADHDIAP IDLLVVNLYP FEATVDKGAG YEDCIENIDI GGPAMIRAAA KNHDDVAVVV EAADYQAVLD ELAANKGATT LTLRKKLAAK AYARTAAYDA AISNWFADQL KTAAPDFRAI GGRLIQSLRY GENPHQSAAF YRTPDHCPGV ATARQIQGKE LSYNNINDTD AAYECIGEFD ATRTAACVIV KHANPCGVAE GSSLLAAYRS ALACDSTSAF GGIVALNRTL DAEAARAITE IFTEVIIAPD ATDEAIAIVA AKKNLRLLLA GQLPDPRAPG LTYKTVAGGL LVQSRDNAVV EDMALKAVTK RQPTEAELRD LKFAFRVAKH VKSNTIVYAK DLATVGIGAG QMSRVDSARI AARKAEDAAA ELKLAAPMTK GSVVASDAFF PFADGMLACI EAGATAVIQP GGSVRDDEVI KAADDAGIAM VFTGVRHFRH
|
| |