Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_1626 |
Symbol | purP |
ID | 3997260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 1705351 |
End bp | 1706418 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637959383 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_566280 |
Protein GI | 91773588 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0313609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATCAA AACAACAAAT CTCTGAGATA ATCAGCAACT ACGATCTAAA TGATCTTGCT ATTGCAACTG TTTGCTCACA CTCAAGCCTC CAGATATTCG ATGGAGCACG AAAAGAGGGA CTAAGAACAA TAGGGATTTG TGTGGGTCAG CCACCACGTT TTTATGATGC ATTTCCAAAA GCAAAACCTG ACGAGTATAT TGTCGTGGAA AGCTATTCCG ACATACCAAA GATAGCGGAA GAACTTGTCA GAAAGAATGC TATTGTAATA CCTCACGGTT CTTTTGTTGA ATATATGGGT ACAGAGAGTT TCGCCGAGTT AGCCGTACCA ACCTTTGGTA ACAGGGAAGT GCTTGAATGG GAGTCAGACA GGGACAAGGA AAGAGAATGG CTTGAAGGCG CAGGTATCCA CATGCCAAAG ATCGTCGATC CTGAAAAGAT CGAAAGCCCG GTAATGGTCA AGTACCATGG TGCAAAGGGT GGCAGGGGAT TCTTTATTGC CAAGGACTAT GAAGAGTTCA AGCAATATAT TGACCCTAAC GAAAAACACA CAGTACAGGA ATTCATTGTA GGTACACGTT ATTATCTCCA CTTCTTCTAC TCCCCAATAA GAGAAGAAGG ATACAAGTTA AGCGAAGGAA TACTTGAGAT GCTCAGCATG GACCGCAGGG TAGAATCGAA CGCTGACGAG ATATTCAGGC TCGGATCCCC GAAGGAACTT GAAGACGCAG GCATTCACCC TACATATGTA GTTACAGGAA ATGTCCCACT TGTCGCAAGA GAATCCCTTT TGCCACGCAT ATTCGCACTG GGGGAAAAGG TTGTTGAGGA ATCCCTCGGC CTGTTTGGCG GTATGATCGG ACCATTCTGT CTTGAGACCG TTTTTACCGA CAAACTTGAG ATCAAGGTCT TTGAGATCTC AGCCCGTATA GTTGCAGGTA CCAATCTCTA CACATCAGGC TCTCCCTACT CAGACATGAT CGAGGAAAAT CTTTCCACAG GAAAGAGGAT CGCACAGGAA ATAAAATTGG GAGCCAAGAC AGGCAAGCTG GACCTTATTT TGTCATAA
|
Protein sequence | MISKQQISEI ISNYDLNDLA IATVCSHSSL QIFDGARKEG LRTIGICVGQ PPRFYDAFPK AKPDEYIVVE SYSDIPKIAE ELVRKNAIVI PHGSFVEYMG TESFAELAVP TFGNREVLEW ESDRDKEREW LEGAGIHMPK IVDPEKIESP VMVKYHGAKG GRGFFIAKDY EEFKQYIDPN EKHTVQEFIV GTRYYLHFFY SPIREEGYKL SEGILEMLSM DRRVESNADE IFRLGSPKEL EDAGIHPTYV VTGNVPLVAR ESLLPRIFAL GEKVVEESLG LFGGMIGPFC LETVFTDKLE IKVFEISARI VAGTNLYTSG SPYSDMIEEN LSTGKRIAQE IKLGAKTGKL DLILS
|
| |