Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1099 |
Symbol | purP |
ID | 4463131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1188458 |
End bp | 1189597 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639700116 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein |
Protein accession | YP_843522 |
Protein GI | 116754404 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCGATC GCAGCAGGAT CTATGAGATT CTGGACGGAT ATGAGATGGA TGATGTGAGA ATAGGCATGA TCGCATCTCA TTCCGCCCTC GATGTCGCTG ATGGCGCTGT GGAGGAGGGC TTCAGAACCC TGGCTGTCTG CCAGGAGGGG CGGGAGAGGA CCTATGTAAA GTACTTCCGG GCGGAACGGG CACCGGACGG AAGGATAATC ACGGGAATGA TAGATGAAGT GGTGGTCTTA AAGAGGTTCA AGGATATCCT CGACCAGCAG GATATGCTTC TCAGAAAGAA CGTCCTTTTT GTTCCCAACA GGTCATTCAC ATCTTACTGC GGAATAGATT CGGTGGAGGA CGAGTTCGAG GTCCCGCTCG TGGGCAGCAG GAATCTCCTG AGATCGGAGG AGCGGGGCGA CAAGATGGAC TACTACTGGC TCCTGGAGAA GGCAGGCCTG CCATACCCGG AACAGATAGA GCCGGATGAG ATCGACTGTC TTGTCATAGT AAAGCTGCCA CATGCCGTCA AGACGCTTGA AAGGGGTTTC TTCACAGCGG CATCCGCAGA GGAATACTAC GAGAAGTCGG AGCTTCTTCT CCGCCAGGGG GTCATAGACA GCGATGGCCT CGAGCTCGCC AGGATCGAGA GGTACATAAT AGGCCCGGTG TTCAACCTGG ACTTCTTCTA CTCGCCTCTC AGGGATCGCA TCGAGCTTCT CGGGATCGAC TGGAGGTTTG AGACGAGCCT TGATGGGCAT GTGAGGCTTC CGGCACCACA GCAGCTCAGG CTGAACGAGA AGCAGATAAA CCCTGAGTAC ACAGTCTGCG GCCACAACTC CGCGACGCTG AGAGAGTCTT TGCTGGAGAA GGCTTTTGAT CTCGCTGAGA AATATGTTGC AGCGACGAAG GAATACTATC CTCCTGGAAT AATCGGCCCG TTCTGTCTCC AGACCTGCGT TGACAAGGAC CTGAACTTCT ACATCTACGA TGTGGCGCCC CGCATAGGCG GAGGAACAAA CATACACATG GCCGTGGGGC ATCCGTATGG AAACGCGCTC TGGCGGACGA ACATGTCTAC TGGAAGGAGA CTGGCTAAAG AGGTGAGGCT TGCGATAGAG AGTGATTCAC TCAGGAAGAT AGTGACCTGA
|
Protein sequence | MIDRSRIYEI LDGYEMDDVR IGMIASHSAL DVADGAVEEG FRTLAVCQEG RERTYVKYFR AERAPDGRII TGMIDEVVVL KRFKDILDQQ DMLLRKNVLF VPNRSFTSYC GIDSVEDEFE VPLVGSRNLL RSEERGDKMD YYWLLEKAGL PYPEQIEPDE IDCLVIVKLP HAVKTLERGF FTAASAEEYY EKSELLLRQG VIDSDGLELA RIERYIIGPV FNLDFFYSPL RDRIELLGID WRFETSLDGH VRLPAPQQLR LNEKQINPEY TVCGHNSATL RESLLEKAFD LAEKYVAATK EYYPPGIIGP FCLQTCVDKD LNFYIYDVAP RIGGGTNIHM AVGHPYGNAL WRTNMSTGRR LAKEVRLAIE SDSLRKIVT
|
| |