Gene MmarC6_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC6_1300 
SymbolpurP 
ID5737554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C6 
KingdomArchaea 
Replicon accessionNC_009975 
Strand
Start bp1206538 
End bp1207623 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content39% 
IMG OID641283795 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_001549345 
Protein GI159905683 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.842573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCAA AAGAAGAAAT AATGGGGATT TTTGAAAAGT ACAACAAGGA TGAAGTGACT 
ATTGTTACGG TAGGAAGCCA CACGTCCTTG CACATTTTAA AAGGTGCGAA ATTGGAGGGC
TTTTCAACTG CAGTTATAAC AACAAGAGAT AGGGACATTC CGTACAAAAG ATTCGGGGTT
GCGGACAAAT TTATCTATGT TGACAAATTT TCAGATATTT CAAAAGAAGA GATTCAACAG
CAATTAAGAG ATATGAATGC AATTATTGTT CCACACGGTT CATTCATTGC TTATTGTGGT
TTGGATAATG TGGAAGATAC ATTTAAAGTT CCAATGTTTG GAAACAGAGC TATTTTAAGA
TGGGAAGCTG AAAGAGATTT GGAAGGACAG CTTTTGGGTG GAAGTGGTCT TCGGATCCCT
AAAAAATACG GCGGACCTGA CGATATAGAT GGGCCAGTAA TGGTTAAATT TCCTGGGGCA
AGAGGGGGCA GAGGATACTT CCCATGCTCA ACAGTGGAAG AATTCTGGAG AAAAATAGGC
GAATTCAAAG CTAAAGGTAT CCTTACAGAA GACGACGTTA AAAAAGCACA CATCGAAGAA
TATGTTGTTG GTGCAAACTA CTGTATTCAC TACTTCTACT CACCATTAAA AGACCAGGTT
GAATTAATGG GGATTGACAG AAGGTACGAA AGCAGTATTG ATGGACTTGT TAGGGTTCCT
GCAAAAGACC AGCTTGAATT AAGCATTGAC CCTTCATACG TTATCACAGG AAACTTCCCT
GTTGTAATCA GGGAAAGTCT CTTACCTCAG GTATTTGACA TGGGTGACAA ATTAGCAACA
AAAGCAAAAG AACTTGTAAA ACCAGGAATG CTTGGGCCAT TCTGCTTACA ATCATTGTGT
AATGAAAATC TGGAACTCGT TGTATTCGAA ATGAGTGCAA GGGTAGATGG GGGAACAAAC
ACGTTTATGA ACGGAAGCCC ATATTCATGC CTTTACACAG GAGAGCCATT AAGCATGGGT
CAGAGAATTG CAAAAGAAAT AAAATTAGCG CTTGAACTTA AAATGATTGA CAAAGTCATA
TCTTAA
 
Protein sequence
MIPKEEIMGI FEKYNKDEVT IVTVGSHTSL HILKGAKLEG FSTAVITTRD RDIPYKRFGV 
ADKFIYVDKF SDISKEEIQQ QLRDMNAIIV PHGSFIAYCG LDNVEDTFKV PMFGNRAILR
WEAERDLEGQ LLGGSGLRIP KKYGGPDDID GPVMVKFPGA RGGRGYFPCS TVEEFWRKIG
EFKAKGILTE DDVKKAHIEE YVVGANYCIH YFYSPLKDQV ELMGIDRRYE SSIDGLVRVP
AKDQLELSID PSYVITGNFP VVIRESLLPQ VFDMGDKLAT KAKELVKPGM LGPFCLQSLC
NENLELVVFE MSARVDGGTN TFMNGSPYSC LYTGEPLSMG QRIAKEIKLA LELKMIDKVI
S