Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_17811 |
Symbol | purM |
ID | 4911847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1502580 |
End bp | 1503623 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640161383 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001092005 |
Protein GI | 126697119 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.225927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTACA AAACATCAGG TGTTGATATA GAAGCTGGGC GAGAATTTGT TTCCGAAATT AAACAGGCAG TTGAAGCAAC TCATACATCT AATGTGATTG AGGGTATTGG CGGGTTCGGA GGTTTGTTTA GAATTCCTAT CGATAGTTTT AAAAAACCAG TTCTTGTTTC AGGGACTGAT GGTGTTGGAA CAAAATTAGA ATTAGCTCAA AGTAAAAACT TTCACTTTGA GGTTGGCATT GATTTAGTTG CTATGTGCAT GAACGATATT ATTACTAGTG GTGCAAGACC TTTATTTTTT CTAGATTATA TTGCTACAGG TAAGCTTGAT AAGAAACAAT TATTGAGGGT TGTTCAGGGA ATTTCACATG GATGCGGAGA AAATAACTGT TCATTACTTG GCGGAGAAAC TGCTGAAATG CCTGGATTTT ATTCAAAAAA TAAGTATGAT CTTGCAGGAT TTTGTGTTGG AATAGTTGAT GAGGATAAGC TTATTAATGG TAAAAAAGTC TCTGAAAATG ACTTAATAAT TGCTTTAAAA AGTAATGGAG TTCATAGTAA CGGCTTTAGT TTAGTAAGAA AAATTATTCA AAACAATAAT CAAATAGATA AAGAATTTGA AAAAGTTTCT CACTTAAATT TTTATGATGA GTTATTAAAA CCTACAAAAA TTTACAATAA TGTGATTAAC CAAATGTTAT CTGAAGATAT TGAAATTAAA GCAATGTCTC ATATAACTGG AGGAGGAATT CCAGAAAATT TACCAAGATG TATTCCTTCT GATTTTATTC CTTATATCAA TACCAGTTCT TGGCAAATAC CTACTTTATT TAAATTCTTA AAAGAGAAAG GATCTATTCC TGAAAGAGAT TTTTGGAATA CTTTTAATCT TGGAGTTGGA TTTTGTTTAA TTATTGATAA ACAATTTAAG GACGCGATAT TAAGTATCTG CAAAGATTAT GGTATAGATA GTTGGGAAAT TGGAAAGATA GTTCGAAAAA ATGATTCAAC AATTAGTAAA TTTTTACCAG AAATTTTAAC TTAA
|
Protein sequence | MDYKTSGVDI EAGREFVSEI KQAVEATHTS NVIEGIGGFG GLFRIPIDSF KKPVLVSGTD GVGTKLELAQ SKNFHFEVGI DLVAMCMNDI ITSGARPLFF LDYIATGKLD KKQLLRVVQG ISHGCGENNC SLLGGETAEM PGFYSKNKYD LAGFCVGIVD EDKLINGKKV SENDLIIALK SNGVHSNGFS LVRKIIQNNN QIDKEFEKVS HLNFYDELLK PTKIYNNVIN QMLSEDIEIK AMSHITGGGI PENLPRCIPS DFIPYINTSS WQIPTLFKFL KEKGSIPERD FWNTFNLGVG FCLIIDKQFK DAILSICKDY GIDSWEIGKI VRKNDSTISK FLPEILT
|
| |