Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_0094 |
Symbol | purP |
ID | 4616976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 87632 |
End bp | 88699 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639783175 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_929620 |
Protein GI | 119871613 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.279967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACGTC AGACTTATAT CTTTGTCGCC ATACAAAAAT TCAAAAGTCT TCAACATCTG GAGCACATGT CTGTATTAAA GAGATATGAC TTAGAAAAAC TTGCTGTAGC AACAGTGGCG TCTCATACTG CATTACAAAT TCTGAGGGGG GCAAAGAAAT ATGGTTTTAG GACTATTGCG ATAGCGGGTA GAGCCGACGT CGCCGAATTT TACCAACAGT TTAACTTCAT AGACGAAGTG TGGACTGTAG ACTTTAGAAA TTTTGTAAAA GCGGCTGAAA AACTGGTAGA AGCCAATGCA GTTTTTATAC CACATGGCTC TTACGTAGAA TACGTCGGCT GGAGACAGGC GCTGGAGGCG CCGGTTCCCA CCCTCGGCTG TAGAGAGTTA ATCAAGTGGG AGGCAGATCA GTACAAGAAG ATGGAGCTCC TCCAGAGAGG CGGCATCCCC ACGTCGAGGG TCTACAAAAC GCCGGAGGAG GTGGATAGGC CCGTCATAGT TAAGCTCTTC GGCGCCAAGG GGGGCAGGGG GTACTTCCTC GCTAGAGATA GAGAGGAGCT GAGGAGGAGG CTGGCCGGCT TGAGCGACTA CATTATCCAG GAGTACGTAT TCGGCGTGCC GGCCTACTAC CATTACTTCT CCTCGCCGGT CTACGGCAGG GTGGAGGTCT TTGGCGCAGA CATTAGGTAC GAGTCTAACG TAGACGGGAG GACCTTCGGC TGGGTAGAGC CCACTTTCGT CGTCGTGGGC AACCTCCCCC TGGTTCTTAG GGAGTCTTTA CTCCCAACGA TATGGAAGTA CGGCGTCCAG TTTGCCAAAG CCGTCGAGGA GGTGGTCGGC TGCAGACTTG CGGGGCCCTA CTGCCTGGAG TCTATAATAA GAGACGACAT GTCAATCTCA GTCTTCGAGT TCTCTGGGCG GATTGTGGCT GGGACGAACA TATACATGGG CTACGGCTCG CCCTACTCAG TCCTCTACTT CGACAGGCCT ATGGACATGG GGGAGAGGAT AGCCCACGAG ATAAGGGAGG CCGCCCGGAG GGGCCGCCTA GAGGATCTCT TCACGTAG
|
Protein sequence | MLRQTYIFVA IQKFKSLQHL EHMSVLKRYD LEKLAVATVA SHTALQILRG AKKYGFRTIA IAGRADVAEF YQQFNFIDEV WTVDFRNFVK AAEKLVEANA VFIPHGSYVE YVGWRQALEA PVPTLGCREL IKWEADQYKK MELLQRGGIP TSRVYKTPEE VDRPVIVKLF GAKGGRGYFL ARDREELRRR LAGLSDYIIQ EYVFGVPAYY HYFSSPVYGR VEVFGADIRY ESNVDGRTFG WVEPTFVVVG NLPLVLRESL LPTIWKYGVQ FAKAVEEVVG CRLAGPYCLE SIIRDDMSIS VFEFSGRIVA GTNIYMGYGS PYSVLYFDRP MDMGERIAHE IREAARRGRL EDLFT
|
| |