Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_07381 |
Symbol | purK |
ID | 4717443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 657471 |
End bp | 658661 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640078452 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001009131 |
Protein GI | 123968273 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.210398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTA AAAAAAATAT AAACGATATC AAGAAAAATT ATTCCCTAGG AATAATTGGA GGTGGTCAAC TGGCACTGAT GTTAACCGAG GCAGCAAAAA AAAGAGATTT AGAAGTATGT GTGCAAACAA AATCTTGTGA TGATCCTGCT GGCTCAAAAG CAGATCATGT CATAGAAGCT GATCCTTTAA AGATAAGAGG AAATAAATCA TTAATTAATG AGTGTGAAAA AATAATTTTT GAAAATGAAT GGATAAAAAT TGATAAATTA AATTTAATTG GCAATGAAGA TATTTTTGTT CCAAGCCTTA ATGCAATTAA GCCATTAGTA GATAGGTTTT CTCAAAAAAA ATTAATAGAT AGAATGAATA TTCCCTGTCC AAAATGGATA AGTATTGAAG ATTTTAAAAA TCTCTCGGAT GAGGAAATCA ATAATTGGAC TTTTCCTCTA ATGGCAAAAT CAAATAAAGG TGGATACGAC GGCAAAGGGA ACAGAAAAAT AAAGACAAAA GAAGATTTAG ATTCTTTTTT AACAGAGAAT AATTCTGATG AATGGTTAAT AGAAGAATGG ATAGAGTATG AAAAAGAACT GGCTCTTGTT GGTTCGAGAG ATAGGACCGG TAAGATAAGA TTCTTTCCAA TAGTTGAGAC GTTCCAATCA AACCATGTTT GTGATTGGGT TCTTGCACCT GGATCAAATG AATATGATTT GAACTTATTT GCAATAAATA TTTTCTCTTC AATAGTCAAT GAACTTAATT ACGTTGGAGT TTTAGCTATT GAATTCTTCT ATGGAGATAA TGGTCTTTTA ATTAATGAAA TAGCTCCTAG AACACATAAC TCAGCTCATT TCTCTATTGA AGCTTGCACA TCAAGTCAGT TTGATCAATA TGTTTGCATT TCTTCTGGGA TAATACCACC TGAAATTAAA ATGAACTGTG AAGGTGCAAT TATGATAAAT CTACTGGGGT TAAAAAAGAA TTTCCCAATC TCAATGGAAA CCAGAATTAA AATGTTATCT GAAATTGAGG GTTCTAATAT TCATTGTTAT GGCAAATCTC GCGAAATTCT TGGACGAAAA ATGGCTCACA TCACATTTTT ATTAAATGGT AAAACGCATT CAGAAAGATA TGATGAAGCT CAAATTTTAT TAACTATGGT AAGAGACATT TGGCCATCTC CAAATGCATA A
|
Protein sequence | MSFKKNINDI KKNYSLGIIG GGQLALMLTE AAKKRDLEVC VQTKSCDDPA GSKADHVIEA DPLKIRGNKS LINECEKIIF ENEWIKIDKL NLIGNEDIFV PSLNAIKPLV DRFSQKKLID RMNIPCPKWI SIEDFKNLSD EEINNWTFPL MAKSNKGGYD GKGNRKIKTK EDLDSFLTEN NSDEWLIEEW IEYEKELALV GSRDRTGKIR FFPIVETFQS NHVCDWVLAP GSNEYDLNLF AINIFSSIVN ELNYVGVLAI EFFYGDNGLL INEIAPRTHN SAHFSIEACT SSQFDQYVCI SSGIIPPEIK MNCEGAIMIN LLGLKKNFPI SMETRIKMLS EIEGSNIHCY GKSREILGRK MAHITFLLNG KTHSERYDEA QILLTMVRDI WPSPNA
|
| |