Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2928 |
Symbol | purH |
ID | 5084528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2985824 |
End bp | 2987413 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640484499 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001169119 |
Protein GI | 146278960 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0919623 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0651041 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATC TTGTTCCCGT TGGCCGCGCC CTTCTGTCGG TTTCCGACAA GTCGGGGCTC CTCGACCTCG CACGCGCCCT GGCCGAGCTG GAGGTGGAGC TGATCTCGAC CGGCGGCACG GCCGCCACGC TGCGGGCCGC GGGGCTCAAG GTGCGCGACG TGGCCGAGGT CACGGGCTTC CCCGAGATGA TGGACGGCCG GGTCAAGACG CTGCATCCGA TGGTGCATGG CGGGCTTCTG GCGCTGCGCG ACGATGACGA GCATCTGGTG GCGATGGCCG CGCACGGGAT CGAGCCGATC GACCTCCTGG TGGTGAACCT CTATCCGTTC GAAGCGGCGG TGGCGCGCGG CGCCTCCTAC GATGACTGCA TCGAGAACAT CGACATCGGC GGTCCGGCCA TGATCCGGGC GGCGGCCAAG AACCACCGCT TCGTGAACGT CGTGACCGAC ACGGCCGACT ACAAGGCGCT GCTCGACGAG CTGCGCGCGC ACGACGGGGC CACGACGCTC GCCTTCCGGC AGAAGCTGGC GCTGACGGCC TATTCGCGCA CCGCCGCCTA TGATGCGGCC GTGTCGGCCT GGATGGCCGG GGCGCTGAAG TCCGAGGCGC CGCGCCGCCG CACCTTTGCC GGCACACTGG CCCAGACCAT GCGCTACGGC GAGAATCCGC ACCAGAAGGC GGCCTTCTAC ACCGACGGCT CGCACCGGCC GGGCGTCGCC ACCGCGAAAC AGTGGCAGGG CAAGGAGCTC TCCTACAACA ACATCAACGA CACCGATGCG GCCTTCGAGC TGGTGGCCGA GTTCGATCCC TCCGAGGGTC CGGCCTGCGT GATCGTCAAG CACGCCAACC CCTGCGGCGT GGCGCGGGGC GCGACGCTGG CCGAGGCCTA CGGGCGCGCC TTCGACTGCG ACCGCGTCTC GGCCTTTGGC GGCATCATCG CGCTGAACCA GCCGCTCGAC GCGGCGACGG CCGAAAAGAT CACCGAGATC TTCACCGAGG TGGTGATCGC CCCCGGCGCC GACGAGGAGG CCCGCGCGAT CTTCGCCGCC AAGAAGAACC TGCGGCTGCT GACGACCGAG GCCCTGCCCG ATCCGCTCGC GCCGGGGCTG GCGTTCAAGC AGGTGGCGGG CGGCTTCCTC GTGCAGGACC GCGACGCGGG CCATGTCGAT GCGCTCGACC TGAAGGTGGT GACGAAGCGC GCGCCTTCGG ACGCGGAACT GGCCGACCTG CTCTTCGCCT GGACCGTGGC CAAGCATGTC AAATCCAACG CCATCGTCTA TGTGAAGGAC GGCGCCACCG TGGGCGTGGG TGCGGGCCAG ATGAGCCGGG TCGATTCCAC CCGCATCGCC GCGCGCAAGT CGCAGGACAT GGCGCAGGCG CTCGGCCTCG CGCAGCCGCT GACTCAAGGC TCGGTCGTGG CCTCGGACGC CTTCTTCCCC TTCGCCGACG GCCTTCTCGC CGCCGCCGAG GCGGGGGCGA CGGCCATCAT CCAGCCCGGC GGCTCGATGC GCGACGACGA GGTGATCGCG GCGGCCGACG AGGCGGGCCT TGCGATGGTC TTCACCGGCC AGCGGCACTT CCGGCACTGA
|
Protein sequence | MTNLVPVGRA LLSVSDKSGL LDLARALAEL EVELISTGGT AATLRAAGLK VRDVAEVTGF PEMMDGRVKT LHPMVHGGLL ALRDDDEHLV AMAAHGIEPI DLLVVNLYPF EAAVARGASY DDCIENIDIG GPAMIRAAAK NHRFVNVVTD TADYKALLDE LRAHDGATTL AFRQKLALTA YSRTAAYDAA VSAWMAGALK SEAPRRRTFA GTLAQTMRYG ENPHQKAAFY TDGSHRPGVA TAKQWQGKEL SYNNINDTDA AFELVAEFDP SEGPACVIVK HANPCGVARG ATLAEAYGRA FDCDRVSAFG GIIALNQPLD AATAEKITEI FTEVVIAPGA DEEARAIFAA KKNLRLLTTE ALPDPLAPGL AFKQVAGGFL VQDRDAGHVD ALDLKVVTKR APSDAELADL LFAWTVAKHV KSNAIVYVKD GATVGVGAGQ MSRVDSTRIA ARKSQDMAQA LGLAQPLTQG SVVASDAFFP FADGLLAAAE AGATAIIQPG GSMRDDEVIA AADEAGLAMV FTGQRHFRH
|
| |