Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC6_1300 |
Symbol | purP |
ID | 5737554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C6 |
Kingdom | Archaea |
Replicon accession | NC_009975 |
Strand | + |
Start bp | 1206538 |
End bp | 1207623 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641283795 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_001549345 |
Protein GI | 159905683 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.842573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCAA AAGAAGAAAT AATGGGGATT TTTGAAAAGT ACAACAAGGA TGAAGTGACT ATTGTTACGG TAGGAAGCCA CACGTCCTTG CACATTTTAA AAGGTGCGAA ATTGGAGGGC TTTTCAACTG CAGTTATAAC AACAAGAGAT AGGGACATTC CGTACAAAAG ATTCGGGGTT GCGGACAAAT TTATCTATGT TGACAAATTT TCAGATATTT CAAAAGAAGA GATTCAACAG CAATTAAGAG ATATGAATGC AATTATTGTT CCACACGGTT CATTCATTGC TTATTGTGGT TTGGATAATG TGGAAGATAC ATTTAAAGTT CCAATGTTTG GAAACAGAGC TATTTTAAGA TGGGAAGCTG AAAGAGATTT GGAAGGACAG CTTTTGGGTG GAAGTGGTCT TCGGATCCCT AAAAAATACG GCGGACCTGA CGATATAGAT GGGCCAGTAA TGGTTAAATT TCCTGGGGCA AGAGGGGGCA GAGGATACTT CCCATGCTCA ACAGTGGAAG AATTCTGGAG AAAAATAGGC GAATTCAAAG CTAAAGGTAT CCTTACAGAA GACGACGTTA AAAAAGCACA CATCGAAGAA TATGTTGTTG GTGCAAACTA CTGTATTCAC TACTTCTACT CACCATTAAA AGACCAGGTT GAATTAATGG GGATTGACAG AAGGTACGAA AGCAGTATTG ATGGACTTGT TAGGGTTCCT GCAAAAGACC AGCTTGAATT AAGCATTGAC CCTTCATACG TTATCACAGG AAACTTCCCT GTTGTAATCA GGGAAAGTCT CTTACCTCAG GTATTTGACA TGGGTGACAA ATTAGCAACA AAAGCAAAAG AACTTGTAAA ACCAGGAATG CTTGGGCCAT TCTGCTTACA ATCATTGTGT AATGAAAATC TGGAACTCGT TGTATTCGAA ATGAGTGCAA GGGTAGATGG GGGAACAAAC ACGTTTATGA ACGGAAGCCC ATATTCATGC CTTTACACAG GAGAGCCATT AAGCATGGGT CAGAGAATTG CAAAAGAAAT AAAATTAGCG CTTGAACTTA AAATGATTGA CAAAGTCATA TCTTAA
|
Protein sequence | MIPKEEIMGI FEKYNKDEVT IVTVGSHTSL HILKGAKLEG FSTAVITTRD RDIPYKRFGV ADKFIYVDKF SDISKEEIQQ QLRDMNAIIV PHGSFIAYCG LDNVEDTFKV PMFGNRAILR WEAERDLEGQ LLGGSGLRIP KKYGGPDDID GPVMVKFPGA RGGRGYFPCS TVEEFWRKIG EFKAKGILTE DDVKKAHIEE YVVGANYCIH YFYSPLKDQV ELMGIDRRYE SSIDGLVRVP AKDQLELSID PSYVITGNFP VVIRESLLPQ VFDMGDKLAT KAKELVKPGM LGPFCLQSLC NENLELVVFE MSARVDGGTN TFMNGSPYSC LYTGEPLSMG QRIAKEIKLA LELKMIDKVI S
|
| |