Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_17981 |
Symbol | purM |
ID | 4718533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1530614 |
End bp | 1531657 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640079529 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001010188 |
Protein GI | 123969330 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.742703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTACA AAACATCAGG TGTTGATATA GAAGCTGGGC GAGAATTTGT TTCCGAAATT AAACAGGCAG TTGAAGGAAC ACATACATCT AATGTGATTG AGGGTATTGG TGGGTTTGGA GGTTTGTTTA GAATTCCTAT CGATAGTTTT AAAAAACCAG TTCTTGTTTC AGGAACTGAT GGTGTTGGAA CAAAATTAGA ATTAGCCCAA AGTAAAAACT TCCACTTTGA GGTTGGTATT GATTTAGTTG CTATGTGTAT GAATGATATT ATTACAAGTG GGGCAAAACC TTTATTTTTT TTGGATTATA TTGCTACTGG TAAACTTGAT AAGAATCAAT TATTGAGGGT TGTTAAGGGA ATTTCACATG GCTGCGGAGA AAATAACTGT TCATTACTCG GCGGAGAAAC TGCTGAGATG CCTGGATTTT ATTCAAAAAA TAAGTATGAT CTTGCAGGAT TTTGTGTTGG AATAGTTGAT GAGGATAAGC TTATTAATGG TAAAAAAGTA TCTGAAAATG ACTTAATAAT TGCTTTAAAA AGTAATGGAG TTCATAGTAA CGGCTTTAGT TTAGTAAGAA AAATTATTCA AAATAATAAT CAAATAGATA AAGAATTTGA AAAAGTTTTT CACTTAAATT TTTATGATGA GTTATTGAAA CCTACAAAAA TTTATAATAA TGTGATTAAC CAAATGTTAA CTGAAAATAT AGAAATTAAA GCAATGTCTC ATATTACTGG GGGAGGAATT CCAGAAAATT TACCAAGATG TATGCCTTCT GATTTTATTC CTTATGTAGA TACCGGTTCT TGGGAAATAC CTATTATATT TAAATTCCTC AAAGAGAAAG GATCGATTCC TGAAAAAGAT TTTTGGAATA CTTTCAATCT TGGAGTAGGA TTTTGTTTAA TTATTGATAA ACAATTTAAG GATCCGATAT TAAATATCTG TAAAGATAAT GAAATAGATA GTTGGGAAAT TGGAAAGATA GTTCGAAAAA ATGATTCAAC GATTAGTAAA TTTTTGCCAG AAATTTTAAC TTAA
|
Protein sequence | MDYKTSGVDI EAGREFVSEI KQAVEGTHTS NVIEGIGGFG GLFRIPIDSF KKPVLVSGTD GVGTKLELAQ SKNFHFEVGI DLVAMCMNDI ITSGAKPLFF LDYIATGKLD KNQLLRVVKG ISHGCGENNC SLLGGETAEM PGFYSKNKYD LAGFCVGIVD EDKLINGKKV SENDLIIALK SNGVHSNGFS LVRKIIQNNN QIDKEFEKVF HLNFYDELLK PTKIYNNVIN QMLTENIEIK AMSHITGGGI PENLPRCMPS DFIPYVDTGS WEIPIIFKFL KEKGSIPEKD FWNTFNLGVG FCLIIDKQFK DPILNICKDN EIDSWEIGKI VRKNDSTISK FLPEILT
|
| |