Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_1029 |
Symbol | purP |
ID | 3998769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | - |
Start bp | 1111845 |
End bp | 1113002 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637958805 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein |
Protein accession | YP_565714 |
Protein GI | 91773022 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACA GGAAAGAGAT TATTGAGATT GCGGAGAGCT ATTATACTGA TGACATAAAG ATCGGTACAG TTGCTTCTCA TTCAGGATTG GATGTATTTG ACGGTGCCAT CGAGGAAAAT TTCGAGACCT TTGCAATATG CCAGGCGGGT CGTGAAAAAA CATATACCGA GTACTTCAAG TCAAAAAGGG ATGCCAATGG CAATGTTGTG CGCGGTATCG TTGATGATCA TGTTGTATAT GATAAATTCA ATGAGCTCAT GCTTCCAGAG AACCAGCAGA AGCTTGTGGA CGACAATGTT CTTTTCATAC CTAACAGATC CTTTACTTCT TACTGTGACA TCGATGAGGT CGAGAACGAT TTCCGTGTAC CAATGGTCGG AAGCAGGAAC ATGCTCCGAA GTGAGGAGCG CGGTATGGAC CAGGATTATT ACTGGCTTCT TGAGAAGGCT GGTCTCCCAT TCCCTGAAAG GATAAACGAT CCTGAAGACA TTGATGAGCT TGTAATGGTA AAGCTCCCTC ATGCAGTAAA GAAACTTGAG CGTGGGTTCT TCACTGCCGG AACTTACAGT GAATATGTGG AGAAGTCCGA GTCCCTTATC AAACAGGGTG TAATTACAAG GGAAGCCCTT GCGGAAGCAA GGATCGAGCG CTATATTATT GGTCCGGTCT TCAATTTTGA TATGTTCCAT TCTCCTATCG AGGAAGAAAT GAACAAGACC GAGATCCTTG GTGTTGACTG GAGGTTCGAG ACAAGTCTGG ACGGTTATGT CAGGCTTCCG GCACCACAGC AGATGAATCT CGCAGAGCAT CAGTTAACTC CTGAGTACAC AGTATGTGGT CACAATTCTG CAACACTTCG CGAGTCTCTC CTTGAGGAGG TTTTCAAACT TTCAGAAATG TATATCAAAG CATCCAAGGA GTTCTATGAC CCCGGGGTCA TTGGTCCTTT CTGTCTTCAG ACATGCATTG ATAAAGATCT GAACTTCCAC ATTTATGATG TTGCTCCACG GGTTGGCGGT GGGACGAACG TTCACATGTC TGTTGGCCAT CCATATGCTA ATACATTATG GCGTAAACCT ATGAGTACTG GAAGGCGCGT TGCCTTTGAG GTACGTCGTG CTATTGAATC CGGGCAATTA GATAAGATCA TAACATGA
|
Protein sequence | MIDRKEIIEI AESYYTDDIK IGTVASHSGL DVFDGAIEEN FETFAICQAG REKTYTEYFK SKRDANGNVV RGIVDDHVVY DKFNELMLPE NQQKLVDDNV LFIPNRSFTS YCDIDEVEND FRVPMVGSRN MLRSEERGMD QDYYWLLEKA GLPFPERIND PEDIDELVMV KLPHAVKKLE RGFFTAGTYS EYVEKSESLI KQGVITREAL AEARIERYII GPVFNFDMFH SPIEEEMNKT EILGVDWRFE TSLDGYVRLP APQQMNLAEH QLTPEYTVCG HNSATLRESL LEEVFKLSEM YIKASKEFYD PGVIGPFCLQ TCIDKDLNFH IYDVAPRVGG GTNVHMSVGH PYANTLWRKP MSTGRRVAFE VRRAIESGQL DKIIT
|
| |