Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2763 |
Symbol | purH |
ID | 4897104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2904960 |
End bp | 2906549 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640113365 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001044637 |
Protein GI | 126463523 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC TCGTTCCCGT TGGCCGCGCC CTTCTGTCGG TTTCCGACAA GTCGGGTCTC CTCGACCTCG CCCGTGCCCT GGCAGATCTG GAGGTGGAGC TGATCTCGAC CGGCGGCACG GCGGCCGCGC TGCGGGCGGC GGGGCTGAAG GTGCGGGATG TGGCCGAGGT CACGGGCTTT CCCGAGATGA TGGACGGCCG GGTCAAGACC CTGCATCCGA TGGTCCATGG CGGGCTTCTG GCGCTGCGCG ACGATGACGA GCATCTGGTG GCGATGGCCG CCCACGGGAT CGAGCCGATC GACCTCCTGG TGGTGAACCT CTATCCGTTC GAAGCGGCGG TCGCGCGGGG TGCCTCCTAT GACGACTGCA TCGAGAACAT CGACATCGGC GGGCCGGCCA TGATCCGGGC CGCGGCCAAG AACCACCGCT TCGTGAACGT CGTGACCGAC ACGGCCGACT ACAAGGCGCT GCTCGACGAA CTGCGCGCCC ATGACGGCGC CACGAGGCTC TCCTTCCGCC AGAAGCTCGC GCTGACCGCC TATGCGCGCA CCGCGGCCTA CGACACGGCC GTCTCGACCT GGATGGCGGG CGCGCTGAAG GCCGAGGCGC CGCGCCGCCG CTCCTTCGCG GGCACGCTCG CCCAGACGAT GCGCTACGGC GAGAATCCGC ACCAGAAGGC CGCCTTCTAC ACCGACGGCT CGGCCCGTCC GGGCGTCGCC ACCGCGAAAC AGTGGCAGGG CAAGGAGCTT TCCTACAACA ACATCAACGA CACCGACGCG GCCTTCGAGC TGGTGGCCGA GTTCGACCCG GCCGAGGGCC CGGCCTGCGT CATCGTCAAG CACGCCAACC CCTGCGGCGT GGCCCGGGGC GCGACACTGG CCGAAGCCTA TGCCCGCGCC TTCGACTGCG ACCGCGTCTC GGCCTTCGGC GGCATCATCG CGCTGAACCA GCCGCTCGAT GCGGCCACGG CCGAAAAGAT CACCGAGATC TTCACCGAGG TGGTGATCGC CCCCGGCGCC GACGAGGAAG CCCGCGCGAT CTTCGCCGCC AAGAAGAACC TCCGGCTGCT GACGACCGAG GCGCTGCCCG ATCCGCTGGC GCCGGGGCTC GCCTTCAAGC AGGTGGCGGG GGGCTTCCTC GTGCAGGACC GCGACGCGGG CCATGTCGAT GCGCTCGACC TGAAGGTGGT GACGAAGCGC GCGCCGTCGG ACGCCGAACT CGCCGACCTC CTCTTTGCCT GGACCGTGGC GAAGCATGTG AAATCGAACG CCATCGTCTA TGTGAAGGAC GGGGCCACAG TGGGCGTGGG GGCGGGGCAG ATGAGCCGCG TCGACTCGAC CCGGATCGCC GCGCGCAAGT CGCAGGACAT GGCGCAGGCG CTGGGCCTGG CCCAGCCGCT GACGCAAGGG TCCGTCGTGG CCTCCGACGC CTTCTTCCCC TTCGCCGACG GCCTGCTCGC CGCGGCCGAG GCGGGCGCCA CGGCGATCAT CCAGCCCGGC GGCTCGATGC GCGACGACGA GGTGATCGCG GCGGCCGACG AGGCGGGGCT CGCCATGGTC TTCACCGGCC AGCGTCACTT CCGGCACTGA
|
Protein sequence | MTNLVPVGRA LLSVSDKSGL LDLARALADL EVELISTGGT AAALRAAGLK VRDVAEVTGF PEMMDGRVKT LHPMVHGGLL ALRDDDEHLV AMAAHGIEPI DLLVVNLYPF EAAVARGASY DDCIENIDIG GPAMIRAAAK NHRFVNVVTD TADYKALLDE LRAHDGATRL SFRQKLALTA YARTAAYDTA VSTWMAGALK AEAPRRRSFA GTLAQTMRYG ENPHQKAAFY TDGSARPGVA TAKQWQGKEL SYNNINDTDA AFELVAEFDP AEGPACVIVK HANPCGVARG ATLAEAYARA FDCDRVSAFG GIIALNQPLD AATAEKITEI FTEVVIAPGA DEEARAIFAA KKNLRLLTTE ALPDPLAPGL AFKQVAGGFL VQDRDAGHVD ALDLKVVTKR APSDAELADL LFAWTVAKHV KSNAIVYVKD GATVGVGAGQ MSRVDSTRIA ARKSQDMAQA LGLAQPLTQG SVVASDAFFP FADGLLAAAE AGATAIIQPG GSMRDDEVIA AADEAGLAMV FTGQRHFRH
|
| |