Gene Mthe_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0184 
SymbolpurP 
ID4462751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp179435 
End bp180505 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID639699192 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_842623 
Protein GI116753505 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCGA AAGAGGAGAT TTTGGAGATA CTGAATGGGT ATGATCTGAA GAACATCACA 
ATAGCCACGG TCTGCTCCCA CAGCAGCCTT CAGATATTTC ACGGCGCGAA ACAGGAGGGC
TTCAGGACTC TCGGCATATG TATAGGTCCG CCCCCGAGGT TCTATGATGC CTTCCCTCTC
GCAAAGCCGG ATGCATTTAT ATCGCTGGAC AGTTATAAAG CCATGCTGGA TGAGAGCGAC
CGTCTGATCG ATGAGAACGC CATAATAATA CCACACGGCT CGATGGTCGA GTATCTCGGC
ATCCAGAACT TCGAAGCTCT GCCTGTGCCG ACCTTTGGGA ACAGAAGATG CCTCGCCTGG
GAGAGCGACC GCGAGATGGA GCGAGAGTGG CTTCTCAGGG CTGGTGTGAA CGTCCCCATG
AGGTTCGAGA ACGCCGAGCT GATAGACAGG CCTGTCATAG TCAAGTACCA CGGCGCCAAG
GGCGGAAAGG GGTTCTTCAT AGCAAAGAAT AAGGAAGAGT TCCAGTCGAA GATCCAGCAG
GGACAGAAAT ACACCATACA GGAGTTCATC TTAGGAACAA GATACTACAT ACATTTCTTC
TACTCTCCCA TAAGGGAGAA GGGCTACCGT CTGAGGAAGG GCACACTCGA CATGCTCGGG
ATCGACAGGC GTGTGGAATC AAATGCTGAC GAGATATTCA GGATAGGATC GGTGAACGAG
CTCGAGGCTG CAGGGATATA CCCCAGCTTC GTCGTGACCG GGAACCTGCC GCTGGTTCTG
CGAGAGTCGC TTCTTCCGAA GGTATTCGAC CTCGGCGAGA GGGTCGTCGA GACGTCGATA
GAGTTATTTG GCGGCATGGT GGGTCCGTTT AGCCTCGAGA CGATAGTGAC CGACGATCTG
GACTTCAAGG TCTTCGAGAT ATCTGCAAGA ATCGTTGCAG GTACAAACCT CTTCATAAGC
GGATCGCCAT ATTCTGATCT CATTGAGAAG GGCCTCTCCA CAGGAAGGAG AATCGCCCAG
GAGATCGCGC TCGCGAGAAG CATGGGGGCG CTTGGAGAGG TCATAAGCTG A
 
Protein sequence
MISKEEILEI LNGYDLKNIT IATVCSHSSL QIFHGAKQEG FRTLGICIGP PPRFYDAFPL 
AKPDAFISLD SYKAMLDESD RLIDENAIII PHGSMVEYLG IQNFEALPVP TFGNRRCLAW
ESDREMEREW LLRAGVNVPM RFENAELIDR PVIVKYHGAK GGKGFFIAKN KEEFQSKIQQ
GQKYTIQEFI LGTRYYIHFF YSPIREKGYR LRKGTLDMLG IDRRVESNAD EIFRIGSVNE
LEAAGIYPSF VVTGNLPLVL RESLLPKVFD LGERVVETSI ELFGGMVGPF SLETIVTDDL
DFKVFEISAR IVAGTNLFIS GSPYSDLIEK GLSTGRRIAQ EIALARSMGA LGEVIS