Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_07561 |
Symbol | purK |
ID | 4719508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 685655 |
End bp | 686845 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640080435 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001011072 |
Protein GI | 123965991 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.645989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAA AAAGAAATAT AAAAAATATT AGTAAAAATT ATTCACTAGG AATAATTGGA GGAGGTCAAT TAGCCTTAAT GCTTACTGAA GCGGCAAACA AAAGAGGAAT AAAAGTATGC GTTCAAACTA AATCTTCCAA TGACCCAGCA GGCTCAAAAG CAGATTGTGT TATTGAGGCT GATCCTCTTA AGATAAAAGG AAATAAGGAT CTAATTAAAA AGTGCGAAAA AATTATTTTT GAAAATGAAT GGATTAGAAT CGAAAAATTA AATTTAATTG AATCTAATAA TATTTTTGTA CCAAGCCTGC AATCGATTCA ACCTTTGGTA GATAGAATTT CTCAAAAAAA ATTAATAGAG AAAATGGGCC TGCCCTCACC AAGATGGATT TCAATAAAGG ATTTTAAAAT TCTTGAGCAT AAAGAAATTG AGGATTGGAA TTTTCCCTTA ATGGTTAAAT CTCTTAAAGG AGGATATGAC GGTAAAGGAA ATAAAAAAAT TAATAACAAA GAAGATTTGA ATTCCTTTTT GGTTGGAGCG GAATCAGACG ATTGGCTAAT TGAAGAATGG ATTGACTATA AAAAGGAGCT TGCTCTGGTT GGATCAAGAG ATTTTGATGG CAAAATAAGG CTATTTCCCA TTGTTGAGAC ATTTCAAAAA AATAACGTTT GTGATTGGGT TTTGTCTCCA GCTGAAATTA ATTATGACTT GAAAACTTTT GTGATTAATA TTTTTTCCTC AATAGTAAAT GAACTCAATT ATGTTGGAGT AATGGGAATT GAATTCTTTT ATGGTGATAA AGGACTATTA ATTAATGAAA TTGCCCCTAG AACTCATAAT TCTGCTCACT TTTCTATAGA GGCTTGTACT TCAAGCCAGT TCGATCAATA TATTTGTATA TCTTCAGGTG CCAAGCCCCC AGATATTAAT TTGAATTCTC ATGGCTCCTT AATGATAAAT TTATTAGGTT TGAAAAAAGA TTTCCCTTTG TCAATAGAAA AAAGGATAGA GTTTTTAACA CAAATTAAGG GGTCTAACCT TCATTGGTAT GGAAAATCTA AAGAAAGTGT TGGACGGAAA ATGGGTCATA TAACTTTTTT ACTGAATGAG AATAATTATT TGAAAAGGAA TGAAAAATCA AGAGAAATAT TAAATAAGGT AAGAGAGATT TGGCCATCTC CAAATGAATA A
|
Protein sequence | MTQKRNIKNI SKNYSLGIIG GGQLALMLTE AANKRGIKVC VQTKSSNDPA GSKADCVIEA DPLKIKGNKD LIKKCEKIIF ENEWIRIEKL NLIESNNIFV PSLQSIQPLV DRISQKKLIE KMGLPSPRWI SIKDFKILEH KEIEDWNFPL MVKSLKGGYD GKGNKKINNK EDLNSFLVGA ESDDWLIEEW IDYKKELALV GSRDFDGKIR LFPIVETFQK NNVCDWVLSP AEINYDLKTF VINIFSSIVN ELNYVGVMGI EFFYGDKGLL INEIAPRTHN SAHFSIEACT SSQFDQYICI SSGAKPPDIN LNSHGSLMIN LLGLKKDFPL SIEKRIEFLT QIKGSNLHWY GKSKESVGRK MGHITFLLNE NNYLKRNEKS REILNKVREI WPSPNE
|
| |