Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A1334 |
Symbol | purP |
ID | 3627446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 1645399 |
End bp | 1646469 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637700225 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_304876 |
Protein GI | 73668861 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACAA AACAGCAGGT TCTTGAATTT CTGAAAAATT ACGATTTAGA AAACATTACA ATTGCAACAA TTTGCTCCCA CTCTAGCCTT CAAATTTTTG ACGGAGCTCG GAAAGAAGGC TTCAGAACTC TGGGAATCTG CGTCGGCAAG CCTCCAAAAT TTTATGAGGC TTTTCCCAAA GCAAAACCCG ATGAGTACCT TGTCGTCGAT AGCTATGCCG ATATAATGAA TAAGGCTGAG GAGCTTAGAC AGAAAAACAC CATTATTATC CCACACGGCT CAGTAGTTGC TTACCTTGGC ACGGAAAATT TTGCAGATAT GGCCGTACCT ACTTTTGGAA ACCGGGCTGT GCTCGAATGG GAATCAGACA GGGATAAAGA GCGTGAGTGG TTGCTCGGAG CAGGCATTCA TATGCCCGGA AAAGTCGATG ATCCTCACGA TATTAATGGG CCTGTAATGG TCAAGTACGA CGGGGCAAAA GGAGGAAAAG GTTTCTTCGT CGCAAAAACC TACGAAGAGT TCGAAGAACT TATAGACCGG ACCCAGAAGT ACACGATTCA GGAATTTATC ACCGGGACCC GTTATTACCT TCATTACTTC TATTCCCCTA TCCGAAACGA AGGATACACT TTAAGCAAGG GCAGCCTTGA ACTTCTGAGC ATGGACCGCA GGGTAGAATC CAATGCGGAT GAAATTTTCA GGCTCGGTTC CCCCAGAGAA CTCGTAGGAG CAGGAATCCG CCCGACATAT GTAGTCACAG GAAATATGCC CCTTGTAGCA AGAGAATCCC TCTTACCCCT CATCTTCTCT CTTGGAGAAA GGGTAGTTGA AGAATCTCTT GGCCTCTTTG GCGGGATGAT AGGGTCTTTC TGCCTTGAGA CTGTTTTTAC GGATACGCTA GAGATTAAAG TGTTTGAGAT CTCGGCCAGA ATTGTTGCAG GAACAAACCT TTACATTTCA GGTTCTCCTT ATGCCGACCT GATGGAAGAA AACCTCTCAA CCGGAAGAAG AATAGCCAGG GAAATCAAAA TCGCAGCCCA AACAGGCCAG CTGGATAAGA TAATATCCTA A
|
Protein sequence | MITKQQVLEF LKNYDLENIT IATICSHSSL QIFDGARKEG FRTLGICVGK PPKFYEAFPK AKPDEYLVVD SYADIMNKAE ELRQKNTIII PHGSVVAYLG TENFADMAVP TFGNRAVLEW ESDRDKEREW LLGAGIHMPG KVDDPHDING PVMVKYDGAK GGKGFFVAKT YEEFEELIDR TQKYTIQEFI TGTRYYLHYF YSPIRNEGYT LSKGSLELLS MDRRVESNAD EIFRLGSPRE LVGAGIRPTY VVTGNMPLVA RESLLPLIFS LGERVVEESL GLFGGMIGSF CLETVFTDTL EIKVFEISAR IVAGTNLYIS GSPYADLMEE NLSTGRRIAR EIKIAAQTGQ LDKIIS
|
| |