Gene MmarC7_0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC7_0618 
SymbolpurP 
ID5328502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C7 
KingdomArchaea 
Replicon accessionNC_009637 
Strand
Start bp622817 
End bp623902 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content39% 
IMG OID640793143 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_001329836 
Protein GI150402542 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCAA AAGAAGAAAT AATGGGGATT TTTGAAAAGT ACAACAAGGA CGAAGTGACT 
ATCGTTACGG TGGGCAGTCA CACGTCCTTG CACATCTTAA AAGGTGCAAA ATTGGAGGGC
TTTTCAACTG CAGTTATAAC AACAAGGGAT AGGGACATTC CGTACAAAAG ATTTGGGGTT
GCGGACAAAT TTATCTATGT TGACAAATTT TCAGATATTT CAAAAGAAGA AATTCAACAG
CAATTAAGGG ATATGAATGC AATTATTGTT CCACACGGTT CGTTTATAGC TTATTGTGGT
TTAGATAATG TGGAAGATAC ATTCAAAGTT CCAATGTTTG GAAACAGGGC TATTTTAAGA
TGGGAGGCTG AAAGAGATCT CGAAGGACAG CTTTTGGGCG GAAGCGGCCT TAGAATCCCT
AAAAAATACG GTGGACCTGA TGACATAGAT GGGCCAGTAA TGGTTAAATT CCCTGGAGCA
AGGGGTGGCA GAGGATACTT CCCATGCTCA ACTGTGGAAG AATTCTGGAG AAAAATAGAC
GAATTCAAAG CTAAAGGTAT TCTTACAGAA GACGATGTTG CAAAAGCACA CATCGAAGAA
TATGTTGTTG GTGCAAACTA CTGTATTCAC TACTTCTATT CACCATTAAA AGACCAGGTT
GAATTAATGG GTATTGATAG AAGATATGAA AGCAGTATTG ATGGACTTGT TAGGGTTCCT
GCAAAAGACC AGCTTGAATT AAGCATTGAC CCATCATACG TTATCACAGG AAACTTCCCT
GTTGTAATCA GAGAAAGTCT CTTGCCTCAA GTATTTGACA TGGGTGACAA ATTAGCAACA
AAAGCAAAAG AACTCGTAAA ACCGGGAATG CTTGGACCGT TCTGTTTACA GTCATTATGT
AACGAAAACC TAGAACTCGT TGTATTCGAA ATGAGTGCAA GGGTAGATGG TGGAACAAAC
ACGTTCATGA ACGGAAGCCC GTATTCATGC CTTTACACAG GAGAACCATT AAGCATGGGC
CAGAGAATTG CAAGAGAAAT AAAATTAGCA CTCGAACTCA AAATGATTGA TAAAGTTATA
TCTTAA
 
Protein sequence
MIPKEEIMGI FEKYNKDEVT IVTVGSHTSL HILKGAKLEG FSTAVITTRD RDIPYKRFGV 
ADKFIYVDKF SDISKEEIQQ QLRDMNAIIV PHGSFIAYCG LDNVEDTFKV PMFGNRAILR
WEAERDLEGQ LLGGSGLRIP KKYGGPDDID GPVMVKFPGA RGGRGYFPCS TVEEFWRKID
EFKAKGILTE DDVAKAHIEE YVVGANYCIH YFYSPLKDQV ELMGIDRRYE SSIDGLVRVP
AKDQLELSID PSYVITGNFP VVIRESLLPQ VFDMGDKLAT KAKELVKPGM LGPFCLQSLC
NENLELVVFE MSARVDGGTN TFMNGSPYSC LYTGEPLSMG QRIAREIKLA LELKMIDKVI
S