Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2265 |
Symbol | purP |
ID | 5056179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2026871 |
End bp | 2027878 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469817 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_001154461 |
Protein GI | 145592459 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0561077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0630751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAGA TTTTGAAAAG ATACGACCTG GACAAGCTTG CGGTTGCCAC AATCGCATCA CATACAGCTT TGCAAATCCT CAGAGGGGCA AAAAAATACG GATTTAGAAC AATCGCCATA GCAAAGAACG AGGACATCGC CCAGTTCTAC AAGCAATTTT TCTTCATAGA TGAGGTGTGG ACTGGGGATT TCTCCAACTT TAGAAAAACT GCCGAAAGGC TCGTAGCGGA AAACGCACTG TTGATACCCC ACGGCTCCTA CGTCGAATAT GTCGGCTGGA GACAAGCTCT GGAAGCCCCA GTCCCTACGC TCGGCTGTAG AGAGTTGTTG AGATGGGAGG CCGACCAGTA CAAAAAGATG GCGTTGTTGG AAGAGGCTGG GATACCCATC CCCCGGGTGT ACAGATCACC AACGGAGGTG GACGGGCCTG TTATCGTGAA GTTCTTCGGC GCAAAGGGTG GCAGGGGTTA CTTCGTGGCT AAGGGCAGGG AGGAACTGGA GGCTAGGCTA AAGGCGCTGG GCGAGGAGTA CATCATACAA GAGTACCTCT TCGGCGTGCC GGCCTACTAC CACTACTTCG CCTCGCCTGT CTACTCCCGC ATCGAAGTTT TCGGCGCCGA CATCCGCTAC GAATCCAACG TCGACGGCAG GACCTTCGGC TGGGCCGAGC CGACCTTCGT CGTGGTGGGC AACCTTTCGC TGGTGCTCCG GGAGTCCCTC TTGCCCATCA TTCACAAATA CGGAGTCCAG TTCGCCAAGG CCGTGGAGAA GCGGGTTGGA TGCAGGTTGG CCGGCCCCTA CTGCTTGGAG TCGATAATAA AAGACGACAT GTCCATCGTG GTGTTTGAGT TCTCTGGGAG GATCGTGGCG GGGACAAACA TCTACATGGG CTACGGCTCG CCCTACTCCG TCCTCTACTT CGACAAACCA ATGGACATGG GCGAGAGGAT AGCCCACGAA ATAAGAGAAG CCGCAAAAGC TGGCAAACTA GATCAGCTAT TTACTTAG
|
Protein sequence | MSQILKRYDL DKLAVATIAS HTALQILRGA KKYGFRTIAI AKNEDIAQFY KQFFFIDEVW TGDFSNFRKT AERLVAENAL LIPHGSYVEY VGWRQALEAP VPTLGCRELL RWEADQYKKM ALLEEAGIPI PRVYRSPTEV DGPVIVKFFG AKGGRGYFVA KGREELEARL KALGEEYIIQ EYLFGVPAYY HYFASPVYSR IEVFGADIRY ESNVDGRTFG WAEPTFVVVG NLSLVLRESL LPIIHKYGVQ FAKAVEKRVG CRLAGPYCLE SIIKDDMSIV VFEFSGRIVA GTNIYMGYGS PYSVLYFDKP MDMGERIAHE IREAAKAGKL DQLFT
|
| |