Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0184 |
Symbol | purP |
ID | 4462751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 179435 |
End bp | 180505 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639699192 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_842623 |
Protein GI | 116753505 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTCGA AAGAGGAGAT TTTGGAGATA CTGAATGGGT ATGATCTGAA GAACATCACA ATAGCCACGG TCTGCTCCCA CAGCAGCCTT CAGATATTTC ACGGCGCGAA ACAGGAGGGC TTCAGGACTC TCGGCATATG TATAGGTCCG CCCCCGAGGT TCTATGATGC CTTCCCTCTC GCAAAGCCGG ATGCATTTAT ATCGCTGGAC AGTTATAAAG CCATGCTGGA TGAGAGCGAC CGTCTGATCG ATGAGAACGC CATAATAATA CCACACGGCT CGATGGTCGA GTATCTCGGC ATCCAGAACT TCGAAGCTCT GCCTGTGCCG ACCTTTGGGA ACAGAAGATG CCTCGCCTGG GAGAGCGACC GCGAGATGGA GCGAGAGTGG CTTCTCAGGG CTGGTGTGAA CGTCCCCATG AGGTTCGAGA ACGCCGAGCT GATAGACAGG CCTGTCATAG TCAAGTACCA CGGCGCCAAG GGCGGAAAGG GGTTCTTCAT AGCAAAGAAT AAGGAAGAGT TCCAGTCGAA GATCCAGCAG GGACAGAAAT ACACCATACA GGAGTTCATC TTAGGAACAA GATACTACAT ACATTTCTTC TACTCTCCCA TAAGGGAGAA GGGCTACCGT CTGAGGAAGG GCACACTCGA CATGCTCGGG ATCGACAGGC GTGTGGAATC AAATGCTGAC GAGATATTCA GGATAGGATC GGTGAACGAG CTCGAGGCTG CAGGGATATA CCCCAGCTTC GTCGTGACCG GGAACCTGCC GCTGGTTCTG CGAGAGTCGC TTCTTCCGAA GGTATTCGAC CTCGGCGAGA GGGTCGTCGA GACGTCGATA GAGTTATTTG GCGGCATGGT GGGTCCGTTT AGCCTCGAGA CGATAGTGAC CGACGATCTG GACTTCAAGG TCTTCGAGAT ATCTGCAAGA ATCGTTGCAG GTACAAACCT CTTCATAAGC GGATCGCCAT ATTCTGATCT CATTGAGAAG GGCCTCTCCA CAGGAAGGAG AATCGCCCAG GAGATCGCGC TCGCGAGAAG CATGGGGGCG CTTGGAGAGG TCATAAGCTG A
|
Protein sequence | MISKEEILEI LNGYDLKNIT IATVCSHSSL QIFHGAKQEG FRTLGICIGP PPRFYDAFPL AKPDAFISLD SYKAMLDESD RLIDENAIII PHGSMVEYLG IQNFEALPVP TFGNRRCLAW ESDREMEREW LLRAGVNVPM RFENAELIDR PVIVKYHGAK GGKGFFIAKN KEEFQSKIQQ GQKYTIQEFI LGTRYYIHFF YSPIREKGYR LRKGTLDMLG IDRRVESNAD EIFRIGSVNE LEAAGIYPSF VVTGNLPLVL RESLLPKVFD LGERVVETSI ELFGGMVGPF SLETIVTDDL DFKVFEISAR IVAGTNLFIS GSPYSDLIEK GLSTGRRIAQ EIALARSMGA LGEVIS
|
| |