Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0872 |
Symbol | purP |
ID | 5774332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 766788 |
End bp | 767813 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316511 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_001582206 |
Protein GI | 161528380 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0867065 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCGA TTGCAACACT AGGTTCTCAC TGTTCACTAC AGGTTCTAAA AGGTGCAAAA GATGAAGGCT TGAAAACAAT TCTAGTTTGT GAGAAAAAAC GTGAAAAACT ATACCGAAGA TTTCCATTTA TTGATGAATT AATCATAGTA GACAAATTCA GAGAAGTTCT TGATGAAAAA GTCCAGTCAA CTCTTGAACA AAATGATGCA GTGTTGATTC CTCATGGAAC CTTAATTGCA CAGATGAGTT CTGATGAGAT TGAATCAATC AAGACCCCAA TCTTTGGTAA TAAATGGATT CTTAGATGGG AGTCAGATAG AGAGATGAAA GAAAAACTCA TGAGAGAAGC AACTCTTCCA ATGCCAAAAC CAGTTACAAA TCCTAGTGAG ATTGAAAAAT TATCAATTGT AAAAAGACAG GGTGCAGCCG GAGGTAAAGG TTACTTTATG GCAGCAAATG AGGATGATTA CAATACAAAA AGAAATCAAT TGATTTCAGA AGGAGTTATT TCTGAAGATG AAACATTGTA CATTCAAGAA TATGCTGCAG GTGTTTTAGC ATACTTGACA TTTTTCTATT CTCCACTAAA AGAAGAATTA GAATTCTATG GAGTTGATCA AAGACATGAA TCAGATATTG AAGGATTGGG AAGAATTCCA GCTGAGCAGC AAATGAAATC AAACAAAGTT CCATCATTCA ATGTCATTGG AAACAGTCCA CTAGTTTTGC GCGAATCATT GTTAGATGAA GTATACACAA TGGGTGAGAA TTTTGTTGAA GCATCAAAAA GAATTGTTGC ACCTGGAATG AACGGTCCAT TTTGTATAGA AGGAGTATAT GATGAGAATG CACAATTCAC ATCATTTGAA TTTTCAGCAA GAATTGTTGC AGGCTCTAAT ATCTACATGG ATGGCTCTCC ATACTACAAT TTACTATTCA ATGAAACCAT GAGTATGGGA AAAAGAATAG CCAGAGAAGT AAAGACAGCC GCAGAAACAA ACCAGTTAGA TAAAGTTACA ACATAA
|
Protein sequence | MASIATLGSH CSLQVLKGAK DEGLKTILVC EKKREKLYRR FPFIDELIIV DKFREVLDEK VQSTLEQNDA VLIPHGTLIA QMSSDEIESI KTPIFGNKWI LRWESDREMK EKLMREATLP MPKPVTNPSE IEKLSIVKRQ GAAGGKGYFM AANEDDYNTK RNQLISEGVI SEDETLYIQE YAAGVLAYLT FFYSPLKEEL EFYGVDQRHE SDIEGLGRIP AEQQMKSNKV PSFNVIGNSP LVLRESLLDE VYTMGENFVE ASKRIVAPGM NGPFCIEGVY DENAQFTSFE FSARIVAGSN IYMDGSPYYN LLFNETMSMG KRIAREVKTA AETNQLDKVT T
|
| |