Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0016 |
Symbol | purP |
ID | 5105155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 13486 |
End bp | 14529 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640505909 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein |
Protein accession | YP_001190117 |
Protein GI | 146302801 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000276238 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00501926 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATAG CGTCGGTAGC CAGCCACTCA GCCCTGGACG TGTTTGATGG TGCTAAGGAC GAGGGATTTA GTACAATAGC CCTTTGTAAA AAGGGGAGAG AAAGACCATA TAAGGAATTC AAGAGAATAG TTGACACTTG CATCATCCTC AACGATTTTA AGGAGATATC CAGTGACGAG ATTCAGACGA GACTGATCGA GCAAGACGCG CTAATAGTCC CGAACCGAAG TATGGCAGTG TATCTAGGTT ACGATGCCAT AGAGGCCATG AAGGTCAAGT TCTTCGGTAA TAGAATGATG CTTAGATGGG AGGAGAGGAC TGGTGAAAAG AATTACTACA GGATACTTGA CGAGGCTAAA ATAAGGAGGC CTAAGACCCT AAGGCCTGAG GAAGTTGATA GACCTGTTAT AGTCAAAATA CCCGAGGCTA AGAGAAAGGT AGAAAGGGGA TTTTTCTTCG CAGCTAATAG GGAAGATTTT CAGGAAAAGT TGAGGAAACT GGAGAAGGAT GGGATAATAG ATCAACAGGG AATATCGCAA ATGGTCATAG AGGAGTTCAT TTTTGGGGCT TATTTCAATA TCAACTACTT TTACTCGCCA ATATTTGATA GAGTTGAAAT AATTAGCGTA GATAGAAGGA TACAAAGCGA TTGGGACTCA TTTTACAGAC TTCCCTCTGA CATACAGAGT AAGCTGGGGA GATATCCAAG ACTAATTGAG GTGGGCCACG AGCCTGCGAC CATCAGGGAA AGCATGCTCG AAAAGCTGTT CGACGCGGGC TACTCGTTCG TAGAGACTAC AAGAAAGCTG GAAAAGGGGG GTATAATTGG GCCTTTCACC CTTCAACTTG CAGTTACTCC AGACTTAGAC ATTGTGGTCT TCGACGTTGC CCCAAGGATA GGTGGCGGAA CTAACGCGTA CATGGGAATA GGAAGTCAAT ACTCTAAATT ATACTTCGGC AAGCCCATCA GTCTAGGAAG AAGAATAGCC ATAGAAATCA AGGAAGCTAT ACAGAAGAAG GAGCTGGACA AGGTTATAAG CTAG
|
Protein sequence | MKIASVASHS ALDVFDGAKD EGFSTIALCK KGRERPYKEF KRIVDTCIIL NDFKEISSDE IQTRLIEQDA LIVPNRSMAV YLGYDAIEAM KVKFFGNRMM LRWEERTGEK NYYRILDEAK IRRPKTLRPE EVDRPVIVKI PEAKRKVERG FFFAANREDF QEKLRKLEKD GIIDQQGISQ MVIEEFIFGA YFNINYFYSP IFDRVEIISV DRRIQSDWDS FYRLPSDIQS KLGRYPRLIE VGHEPATIRE SMLEKLFDAG YSFVETTRKL EKGGIIGPFT LQLAVTPDLD IVVFDVAPRI GGGTNAYMGI GSQYSKLYFG KPISLGRRIA IEIKEAIQKK ELDKVIS
|
| |