Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0017 |
Symbol | purP |
ID | 5105156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 14526 |
End bp | 15524 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640505910 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_001190118 |
Protein GI | 146302802 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.117033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00153786 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCTATTAC TCACTCTAGC AAGCCATTCC TCGCTTCAGA TTCTTCACGG GGCCAAAAGG GAAGGCTTCG AGACTGGGAT AGTTGTAAAT GGCAAGAGGG AAGGTTTTTA CAGAAGATTC AGCTTCATAG ATAACTTCTA TGTGTATTCC AGTGAAGATG AGGCTGTGGA AAAGATCAAT GGCACTTCCA ACTCCGTTTT TGTACCACAT GGAAGTCTCA TCGAGTACAT CGGCATGGAG AGGGTTTCAC GGATAAGGAC GCCCATTTTC GGTAACAGGA ACTTGTTTCC ATGGGAATCA AATCAATCCA AAAAAATGAA ATTATTGGAG CTAAGCAATA TAAAAACACC GATGAAGTTT GAAAACCCTG AGGATGTCGA TAGAATGGTT ATCGTGAAAT TGCCAGGTGC CAAGGGTGGA AGAGGTTACT TCATAGGTAG GAATAAACAG GAGGTCAAGG AGGGGATAAG GAGGCTCCAG GAAAAAGGGC TGATCAACAA CGTAGAGGAA TTAATAATAC AGGAATACGT GATTGGGATT CCTATGTACT TTCAATTCTT CTACAGTCCA ATGCTACAGC GTGTGGAGAT GACTGGGATT GATATCAGGT ATGAGACCAA CGTTGACGGT CTGAGGAGAC TTCCAAGTGA TATTAAAGCC GAGCCCACGT TCGTGGTCGC AGGCAACATC CCAGCGGTAG CCAGAGAGAG TATATTGCCT TCAGTATATG AATACGCAGA AAATTTTGTT AAAACTACAA AAAATGTGGT GCCTCCCGGG GCCATAGGTC CCTTCTGCCT TGAATCCATA GTTACGGATA CCCTTGATGT GGTTGTCTTT GAGTTCTCGG GAAGGATAGT GGCCGGAACT AACCTTTATG TGGATGGGAG TCCCTACAGT TGGCTCTATT GGGATGAACC CATGAGCGTC GGAAGAAGGA TAGGGAGAGA AATAAATCTA GCAATCCAAA AGAACAGGTT AAATGAGGTG ACCACATGA
|
Protein sequence | MLLLTLASHS SLQILHGAKR EGFETGIVVN GKREGFYRRF SFIDNFYVYS SEDEAVEKIN GTSNSVFVPH GSLIEYIGME RVSRIRTPIF GNRNLFPWES NQSKKMKLLE LSNIKTPMKF ENPEDVDRMV IVKLPGAKGG RGYFIGRNKQ EVKEGIRRLQ EKGLINNVEE LIIQEYVIGI PMYFQFFYSP MLQRVEMTGI DIRYETNVDG LRRLPSDIKA EPTFVVAGNI PAVARESILP SVYEYAENFV KTTKNVVPPG AIGPFCLESI VTDTLDVVVF EFSGRIVAGT NLYVDGSPYS WLYWDEPMSV GRRIGREINL AIQKNRLNEV TT
|
| |