Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A0288 |
Symbol | purP |
ID | 3624408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 337834 |
End bp | 338997 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637699180 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein |
Protein accession | YP_303852 |
Protein GI | 73667837 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGACA GGAACGAAAT TAAGGAAATC GTTGAAGGCT ACTACACCCA CGCTGATAAG ATAAAAGTCG GAACTATAGG CTCTCACTCA GGTCTCGATA TCTGTGACGG GGCAGTTGAA GAGGAATTCA GGACTCTCGC TGTATGCCAG GCTGGCAGGG AAAAAACATA CAGTGAATAC TTTAGGGCTC AGAGAGACCT TTCTGGAAAA GTAAAGAGAG GGATTGTTGA CGAGGCCATT GTTTTCAAAA AGTACAACGA AATTCTCCTG CCTGAAAACC AGCAGAAACT GGTTGATGAA AATGTGCTTT TTGTTCCGAA CCGATCCTTT ACTTCCTACT GCAGTATAGA TGAGATCGAA GAGAACTTCA GAGTGCCTCT TGTGGGAAGC AGGAACCTTC TCCGGAGCGA GGAACGTAGC GAGCAACAGA GCTATTACTG GATTCTGGAA AAAGCAGGAC TTCCTTTTCC GGAAAAAATC GAGTCTCCAA AAGATATCAA TGAGCTTGTA ATGGTCAAGC TCCCGCATGC AGTAAAAAAA CTCGAACGGG GATTTTTTAC CGCTTCAAGT TACAGAGAAT ATACGGAGAA ATCCGAGGCT CTAATTAAGC AGGGAGTTAT CACACGTGAG GCCCTTGAGA ATGCAAGGAT AGAGCGCTAT ATCATAGGTC CTGTATTCAA CTTTGATATG TTTTATTCCC CAATCGAGCC GAAAATGAGC AAACTGGAAC TTCTTGGGAT TGACTGGCGC TTTGAGACCA GCCTTGATGG ACATGTAAGG CTTCCTGCTC CGCAGCAAAT GTCCCTGGCC GAAAGCCAGC TAACTCCCGA ATACACGGTA TGCGGGCACA ATTCTGCAAC TCTGCGCGAG TCCCTTCTTG AAAAAGTGTT CAAAATGGGT GAAAAATATG TAGAAGCTAC CCAGGAATAT TATGCTCCGG GAATTATAGG ACCTTTCTGC CTGCAGACCT GCGTTGATAA GGATCTTAAC TTCTACATTT ATGATGTGGC CCCAAGAGTA GGCGGCGGTA CCAATGTTCA TATGTCAGTA GGTCATTCTT ACGGCAACTC GCTCTGGAGA AGACCAATGA GTACAGGAAG AAGACTGGCC TTTGAGATCA AGCGCGCCCT CGAACTGGAG AAGCTTGACG CTATCGTCAC ATAA
|
Protein sequence | MIDRNEIKEI VEGYYTHADK IKVGTIGSHS GLDICDGAVE EEFRTLAVCQ AGREKTYSEY FRAQRDLSGK VKRGIVDEAI VFKKYNEILL PENQQKLVDE NVLFVPNRSF TSYCSIDEIE ENFRVPLVGS RNLLRSEERS EQQSYYWILE KAGLPFPEKI ESPKDINELV MVKLPHAVKK LERGFFTASS YREYTEKSEA LIKQGVITRE ALENARIERY IIGPVFNFDM FYSPIEPKMS KLELLGIDWR FETSLDGHVR LPAPQQMSLA ESQLTPEYTV CGHNSATLRE SLLEKVFKMG EKYVEATQEY YAPGIIGPFC LQTCVDKDLN FYIYDVAPRV GGGTNVHMSV GHSYGNSLWR RPMSTGRRLA FEIKRALELE KLDAIVT
|
| |