Gene Mthe_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1099 
SymbolpurP 
ID4463131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1188458 
End bp1189597 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content54% 
IMG OID639700116 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein 
Protein accessionYP_843522 
Protein GI116754404 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCGATC GCAGCAGGAT CTATGAGATT CTGGACGGAT ATGAGATGGA TGATGTGAGA 
ATAGGCATGA TCGCATCTCA TTCCGCCCTC GATGTCGCTG ATGGCGCTGT GGAGGAGGGC
TTCAGAACCC TGGCTGTCTG CCAGGAGGGG CGGGAGAGGA CCTATGTAAA GTACTTCCGG
GCGGAACGGG CACCGGACGG AAGGATAATC ACGGGAATGA TAGATGAAGT GGTGGTCTTA
AAGAGGTTCA AGGATATCCT CGACCAGCAG GATATGCTTC TCAGAAAGAA CGTCCTTTTT
GTTCCCAACA GGTCATTCAC ATCTTACTGC GGAATAGATT CGGTGGAGGA CGAGTTCGAG
GTCCCGCTCG TGGGCAGCAG GAATCTCCTG AGATCGGAGG AGCGGGGCGA CAAGATGGAC
TACTACTGGC TCCTGGAGAA GGCAGGCCTG CCATACCCGG AACAGATAGA GCCGGATGAG
ATCGACTGTC TTGTCATAGT AAAGCTGCCA CATGCCGTCA AGACGCTTGA AAGGGGTTTC
TTCACAGCGG CATCCGCAGA GGAATACTAC GAGAAGTCGG AGCTTCTTCT CCGCCAGGGG
GTCATAGACA GCGATGGCCT CGAGCTCGCC AGGATCGAGA GGTACATAAT AGGCCCGGTG
TTCAACCTGG ACTTCTTCTA CTCGCCTCTC AGGGATCGCA TCGAGCTTCT CGGGATCGAC
TGGAGGTTTG AGACGAGCCT TGATGGGCAT GTGAGGCTTC CGGCACCACA GCAGCTCAGG
CTGAACGAGA AGCAGATAAA CCCTGAGTAC ACAGTCTGCG GCCACAACTC CGCGACGCTG
AGAGAGTCTT TGCTGGAGAA GGCTTTTGAT CTCGCTGAGA AATATGTTGC AGCGACGAAG
GAATACTATC CTCCTGGAAT AATCGGCCCG TTCTGTCTCC AGACCTGCGT TGACAAGGAC
CTGAACTTCT ACATCTACGA TGTGGCGCCC CGCATAGGCG GAGGAACAAA CATACACATG
GCCGTGGGGC ATCCGTATGG AAACGCGCTC TGGCGGACGA ACATGTCTAC TGGAAGGAGA
CTGGCTAAAG AGGTGAGGCT TGCGATAGAG AGTGATTCAC TCAGGAAGAT AGTGACCTGA
 
Protein sequence
MIDRSRIYEI LDGYEMDDVR IGMIASHSAL DVADGAVEEG FRTLAVCQEG RERTYVKYFR 
AERAPDGRII TGMIDEVVVL KRFKDILDQQ DMLLRKNVLF VPNRSFTSYC GIDSVEDEFE
VPLVGSRNLL RSEERGDKMD YYWLLEKAGL PYPEQIEPDE IDCLVIVKLP HAVKTLERGF
FTAASAEEYY EKSELLLRQG VIDSDGLELA RIERYIIGPV FNLDFFYSPL RDRIELLGID
WRFETSLDGH VRLPAPQQLR LNEKQINPEY TVCGHNSATL RESLLEKAFD LAEKYVAATK
EYYPPGIIGP FCLQTCVDKD LNFYIYDVAP RIGGGTNIHM AVGHPYGNAL WRTNMSTGRR
LAKEVRLAIE SDSLRKIVT