Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29941 |
Symbol | ppk |
ID | 4776344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2643852 |
End bp | 2645990 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640088518 |
Product | polyphosphate kinase |
Protein accession | YP_001018989 |
Protein GI | 124024682 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0855] Polyphosphate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.500566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATC CAGCGGTAGC TTCGGAGCAC TACATCAACC GCGAGCTCAG CTGGATCGCC TTCAACGAAA GGGTCCTTGC TCAGGCACTG AATACGCGCA CCCCGCTGCT GGAACAAGCC AAGTTCAGTG CCATCTTCAG CAACAACCTC GACGAATTCT TCATGGTTCG CGTGGCCTCC CTCAAAGCGC AAGTGGAAGC CGGCATTACC AAAACCAGCG CAGATGGTCT AACCCCTATT CAGCAGCTCC TCACGATCCG AGATCACCTT GTTCCTCTCA TCGAGCAACA ACAAGATCAC TACCGTAAAC ACCTCAAAAA CCAGTTAGTC GAGCAAGGTG TTCACCTACT CGACTATGAG CAACTCAATC AAAAAGAACG CCTCTGGGTT GACAATTATT TCCAGACAGC CATTTTCCCG GTGCTGACAC CACTAGCCGT GGATCAAGCC CATCCCTTCC CTTTCGTTAG CAATCTCAGC CTCAACATTG CGACCCTGAT CCTCGATCCG GAAACAGGCC AACAACAATT CGCCCGGGTC AAGATTCCAC AAAAAACGAT GCCTCGTTTT GTAGAGATAC CTCCTGATCT AAGTGGCATC AATCCCAAGC CGGTTCACAC AGCAGTTCCA TTGGAGCAAG TGGTGGCCTT CAACCTCAAA TTGCTCTTCC CCGGAATGAA GATTGAAGAG CACTACTTCT TCCGAGTCAC CCGCGATGCC GACCTTGAAC TCAGGGACCT AGAAGCCGAC GATCTCATGA GCGCCATGGA ACAGGGCCTT CACAAGCGGC GGATGGGCGG AGAAGTGGTG AGACTCGAAG TGACAAACGA AATGCCCCAG AGAGTCGTTG AGATGCTGAT TGAAGGTATG GCTGTTGAAG AAAACGACCT TTATCGCATC GAGGGACTAC TAGGCCTCGA TGATCTATTC GGTCTCATGC GTTTGCCCCT GGAGCAACTC AAAGACCAAC CCCACATTGG CCTGACCGCC AAAGTTCTCT CCCGCAGCCA GCGCAGAATG CTTGAAGACG AATCAATCAA AGAAGAAGAA TTCAAAAGCA TCTTCTCGGT GATTCGCCGC AAAGACATTC TCCTTCACCA CCCATACGAA CTGTTCGCGA CCTCTGTAGA GGAATTCATC AATCAAGCAG CAGACGATCC CCTCGTCATG GGAATCAAGA TCACGCTCTA TCGGACATCC AAGGATTCCC CAATCATTGC GGCCCTGATC CGTGCAGCAG AACACGGTAA GCAGGTCATG GCTCTGGTTG AACTCAAGGC ACGCTTCGAT GAAGGCAACA ATATTCAATG GGCCCGTCAT CTAGAGCGAT CGGGCGTTCA CGTTGTCTAT GGCGTTTTAG GACTAAAAAC ACACACAAAA ACCATCTTGG TCGTTCGCAA AGAAAAAGAG CGCCTACGCA GCTACGTGCA CATCGGCACA GGTAACTACA ACTCGAAGAC ATCACGTCTC TACACCGACC TTGGCCTGCT CTCTGCAAGA CCGGAACTCA GCCAAGATCT AGTTGAACTA TTCAATTATC TCACTGGCTT TTCAAAGCAA CAAAGTTTCC GTCGACTGCT GGTGGCGCCT GTCACCCTAC GCAAGGGGAT GGAATCACTG ATCCTCCGCG AAATCGAACA CGCCCGCGAA GGTAGAGGCG GACACATCCG CGCCAAGATG AATGCTCTGG TGGATCCAGC CATCATTAGC CTGCTCTACG AAGCTTCCCA AGTTGGAGTT CGCATTGAAC TGATCATCCG CGGTATGTGC TGTCTCTACC CAGGGCGAAA AGGGTTCAGC GAAAACATCA GCGTGATCAG CATCATCGGC CGGTTCCTGG AACACTCCCG TATCTTCTGG TTTGCCAATG ACAACAACCC AGAGGTTTAT ATCGGCAGCG CAGACTTGAT GCCTCGAAAC TTAGACAGAC GCGTGGAAGC CGTTACTCCA ATTGAAGAAC CAGAGCAAAA GGAACACCTA GAGCGACTGC TGAACCTCTA CCTAAACGAC AACCGGGAAG CCTGGGATAT GCAGAGCGAT GGCAGCTTTT TACAGCGCCA ACCCAATCCC AATAGTGAAG AACATCGTGC ACAGCAACAG CTGATCAACC TCTGGCAACA AGGCATCCCT GCAGGGTGA
|
Protein sequence | MSNPAVASEH YINRELSWIA FNERVLAQAL NTRTPLLEQA KFSAIFSNNL DEFFMVRVAS LKAQVEAGIT KTSADGLTPI QQLLTIRDHL VPLIEQQQDH YRKHLKNQLV EQGVHLLDYE QLNQKERLWV DNYFQTAIFP VLTPLAVDQA HPFPFVSNLS LNIATLILDP ETGQQQFARV KIPQKTMPRF VEIPPDLSGI NPKPVHTAVP LEQVVAFNLK LLFPGMKIEE HYFFRVTRDA DLELRDLEAD DLMSAMEQGL HKRRMGGEVV RLEVTNEMPQ RVVEMLIEGM AVEENDLYRI EGLLGLDDLF GLMRLPLEQL KDQPHIGLTA KVLSRSQRRM LEDESIKEEE FKSIFSVIRR KDILLHHPYE LFATSVEEFI NQAADDPLVM GIKITLYRTS KDSPIIAALI RAAEHGKQVM ALVELKARFD EGNNIQWARH LERSGVHVVY GVLGLKTHTK TILVVRKEKE RLRSYVHIGT GNYNSKTSRL YTDLGLLSAR PELSQDLVEL FNYLTGFSKQ QSFRRLLVAP VTLRKGMESL ILREIEHARE GRGGHIRAKM NALVDPAIIS LLYEASQVGV RIELIIRGMC CLYPGRKGFS ENISVISIIG RFLEHSRIFW FANDNNPEVY IGSADLMPRN LDRRVEAVTP IEEPEQKEHL ERLLNLYLND NREAWDMQSD GSFLQRQPNP NSEEHRAQQQ LINLWQQGIP AG
|
| |