Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC5_0205 |
Symbol | purP |
ID | 4928183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C5 |
Kingdom | Archaea |
Replicon accession | NC_009135 |
Strand | + |
Start bp | 172735 |
End bp | 173820 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640165704 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_001096736 |
Protein GI | 134045250 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.114719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCAA AAGAAGAAAT AATGGGGATT TTTGAAAAGT ACAACAAGGA CGAAGTGACT ATCGTTACGG TGGGAAGTCA CACGTCATTA CACATCTTAA AAGGTGCGAA ATTAGAGGGC TTTTCAACTG CAGTTATAAC AACAAAAGAT AGGGCTATTC CGTACAAAAG ATTTGGGGTT GCGGATAAAT TTATCTATGT TGACCAATTT TCAGATATTT CAAAAGAAGA AATTCAACAG CAATTAAGGG ATATGAATGC AATTATTGTT CCACACGGTT CATTCATTGC TTATTGTGGT TTAGATAATG TGGAAGATTC ATTCAAAGTT CCAATGTTTG GAAACAGAGC TATTTTAAGA TGGGAAGCTG AAAGAGATCT CGAAGGACAG CTTTTGGGCG AAAGCGGTCT TAGAATCCCT AAAAAATACG GTGGACCTGA CGATATAGAT GGACCTGTAA TGGTTAAATT CCCTGGAGCA AGGGGGGGCA GAGGATACTT CCCATGTTCA ACAGTGGAAG AATTCTGGAG AAAAATAGCT GAATTCAAAG CTAAAGGTAT TCTTACAGAA GACGACGTTA AAAAAGCACA CATCGAAGAA TACGTTGTTG GTGCAAACTA CTGTATCCAC TACTTCTACT CGCCTTTAAA AGATCAGGTA GAATTAATGG GAATCGATAG AAGATATGAG AGCAGTATTG ACGGACTTGT CAGAGTTCCT GCAAAAGACC AGCTTGAGTT AAACGTCGAT CCATCATACG TTATCACTGG AAACTTCCCT GTTGTAATTA GGGAAAGTCT CTTACCTCAG GTATTTGATA TTGGTGACAA ATTATCAGCA AAATCAAAAG AACTCGTAAA ACCAGGAATG CTTGGACCAT TCTGTTTACA GTCATTGTGC AATGACAACC TGGAACTCGT TGTATTTGAA ATGAGTGCAA GGGTTGACGG TGGAACAAAC ACGTTTATGA ATGGAAGTCC TTACTCCTGC CTTTACACTG GCGAACCATT AAGCATGGGT CAGAGAATTG CAAAAGAAAT AAAATTAGCA CTTGAATTAG GTATGATTGA CAAAGTCTTA TCCTAA
|
Protein sequence | MIPKEEIMGI FEKYNKDEVT IVTVGSHTSL HILKGAKLEG FSTAVITTKD RAIPYKRFGV ADKFIYVDQF SDISKEEIQQ QLRDMNAIIV PHGSFIAYCG LDNVEDSFKV PMFGNRAILR WEAERDLEGQ LLGESGLRIP KKYGGPDDID GPVMVKFPGA RGGRGYFPCS TVEEFWRKIA EFKAKGILTE DDVKKAHIEE YVVGANYCIH YFYSPLKDQV ELMGIDRRYE SSIDGLVRVP AKDQLELNVD PSYVITGNFP VVIRESLLPQ VFDIGDKLSA KSKELVKPGM LGPFCLQSLC NDNLELVVFE MSARVDGGTN TFMNGSPYSC LYTGEPLSMG QRIAKEIKLA LELGMIDKVL S
|
| |