Gene Nmar_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0872 
SymbolpurP 
ID5774332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp766788 
End bp767813 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content35% 
IMG OID641316511 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_001582206 
Protein GI161528380 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0867065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGA TTGCAACACT AGGTTCTCAC TGTTCACTAC AGGTTCTAAA AGGTGCAAAA 
GATGAAGGCT TGAAAACAAT TCTAGTTTGT GAGAAAAAAC GTGAAAAACT ATACCGAAGA
TTTCCATTTA TTGATGAATT AATCATAGTA GACAAATTCA GAGAAGTTCT TGATGAAAAA
GTCCAGTCAA CTCTTGAACA AAATGATGCA GTGTTGATTC CTCATGGAAC CTTAATTGCA
CAGATGAGTT CTGATGAGAT TGAATCAATC AAGACCCCAA TCTTTGGTAA TAAATGGATT
CTTAGATGGG AGTCAGATAG AGAGATGAAA GAAAAACTCA TGAGAGAAGC AACTCTTCCA
ATGCCAAAAC CAGTTACAAA TCCTAGTGAG ATTGAAAAAT TATCAATTGT AAAAAGACAG
GGTGCAGCCG GAGGTAAAGG TTACTTTATG GCAGCAAATG AGGATGATTA CAATACAAAA
AGAAATCAAT TGATTTCAGA AGGAGTTATT TCTGAAGATG AAACATTGTA CATTCAAGAA
TATGCTGCAG GTGTTTTAGC ATACTTGACA TTTTTCTATT CTCCACTAAA AGAAGAATTA
GAATTCTATG GAGTTGATCA AAGACATGAA TCAGATATTG AAGGATTGGG AAGAATTCCA
GCTGAGCAGC AAATGAAATC AAACAAAGTT CCATCATTCA ATGTCATTGG AAACAGTCCA
CTAGTTTTGC GCGAATCATT GTTAGATGAA GTATACACAA TGGGTGAGAA TTTTGTTGAA
GCATCAAAAA GAATTGTTGC ACCTGGAATG AACGGTCCAT TTTGTATAGA AGGAGTATAT
GATGAGAATG CACAATTCAC ATCATTTGAA TTTTCAGCAA GAATTGTTGC AGGCTCTAAT
ATCTACATGG ATGGCTCTCC ATACTACAAT TTACTATTCA ATGAAACCAT GAGTATGGGA
AAAAGAATAG CCAGAGAAGT AAAGACAGCC GCAGAAACAA ACCAGTTAGA TAAAGTTACA
ACATAA
 
Protein sequence
MASIATLGSH CSLQVLKGAK DEGLKTILVC EKKREKLYRR FPFIDELIIV DKFREVLDEK 
VQSTLEQNDA VLIPHGTLIA QMSSDEIESI KTPIFGNKWI LRWESDREMK EKLMREATLP
MPKPVTNPSE IEKLSIVKRQ GAAGGKGYFM AANEDDYNTK RNQLISEGVI SEDETLYIQE
YAAGVLAYLT FFYSPLKEEL EFYGVDQRHE SDIEGLGRIP AEQQMKSNKV PSFNVIGNSP
LVLRESLLDE VYTMGENFVE ASKRIVAPGM NGPFCIEGVY DENAQFTSFE FSARIVAGSN
IYMDGSPYYN LLFNETMSMG KRIAREVKTA AETNQLDKVT T