Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_07361 |
Symbol | purK |
ID | 4912440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 656308 |
End bp | 657498 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640160318 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001090960 |
Protein GI | 126696074 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.291651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAA AAAAAAATAT AAACGATTTT AAGAAAAATT ATTCCCTGGG AATAATTGGA GGTGGTCAAT TGGCTTTGAT GTTAACCGAG GCAGCAAAAA AAAGAGATCT TGAAGTATGT GTGCAAACAA AATCTTGTGA TGATCCTGCT GGGTTAAAAG CAGATCATGT CATAGAAGCT GATCCTTTAA AGATAAGAGG TAATAAATCA TTAATTAATG AGTGTGAAAA AATAATTTTT GAAAATGAAT GGATAAAAAT TGATAAATTA AATTTAATTG ACAATAAAGA TATTTTTGTT CCAAGCCTTA ATGCAATTAA GCCATTAGTA GATAGGTTTT CTCAAAAAAA ATTAATAGAA AGAATGAATA TTCCCTGCCC AAAATGGATA AGTATTGAAG ATTTTAAAAA TCTCTCGGAT CAGGAAATCA ATAATTGGAC TTTTCCTCTA ATGGCAAAAT CAAATAAAGG TGGATATGAC GGCAAAGGGA ACAGAAAAAT AAAGACAAAA GAAGATTTAG ATTCTTTTTT AAAAGAGAAC AATTCTAATG AATGGTTAAT AGAAGAATGG ATAGAGTATG AAAAAGAACT GGCTCTTGTT GGTTCGAGAG ATAGGACCGG TAAGATAAGA TTCTTTCCAA TAGTTGAGAC GTTCCAATCA AACCACGTTT GTGATTGGGT TCTTGCCCCT GGAACAAATG AATATGATTT GAACTTATTT GCAATAAATA TTTTCTCTTC AATAGTCAAT GAACTTAATT ACGTTGGAGT TTTAGCTATT GAATTCTTCT ATGGAGATAA TGGTCTTTTA ATTAATGAAA TAGCTCCTAG AACACATAAC TCAGCTCATT TCTCTATTGA AGCTTGCACT TCAAGTCAGT TTGATCAATA TGTTTGCATT TCTTCTGGGA TAATGCCACC TGAAATTAAA ATGAACTGTG AAGGTGCAAT TATGATAAAT TTACTGGGTT TAAAAAAGAA TTTCCCAATC TCAATGGAAA CCAGAATTAA AATGTTATCT GAAATTGAGG GTTCTAATAT TCATTGTTAT GGCAAATCTC GCGAAATTCT GGGAAGAAAA ATGGCTCACA TCACATTTTT ATTAAATGGT AAAACGCATT TAGAAAGATA TGATGAAGCT CAAATTTTAT TAACTATGGT AAGAGACATT TGGCCATCTC CAAATGCATA A
|
Protein sequence | MSLKKNINDF KKNYSLGIIG GGQLALMLTE AAKKRDLEVC VQTKSCDDPA GLKADHVIEA DPLKIRGNKS LINECEKIIF ENEWIKIDKL NLIDNKDIFV PSLNAIKPLV DRFSQKKLIE RMNIPCPKWI SIEDFKNLSD QEINNWTFPL MAKSNKGGYD GKGNRKIKTK EDLDSFLKEN NSNEWLIEEW IEYEKELALV GSRDRTGKIR FFPIVETFQS NHVCDWVLAP GTNEYDLNLF AINIFSSIVN ELNYVGVLAI EFFYGDNGLL INEIAPRTHN SAHFSIEACT SSQFDQYVCI SSGIMPPEIK MNCEGAIMIN LLGLKKNFPI SMETRIKMLS EIEGSNIHCY GKSREILGRK MAHITFLLNG KTHLERYDEA QILLTMVRDI WPSPNA
|
| |