Gene P9303_29941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29941 
Symbolppk 
ID4776344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2643852 
End bp2645990 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content52% 
IMG OID640088518 
Productpolyphosphate kinase 
Protein accessionYP_001018989 
Protein GI124024682 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.500566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATC CAGCGGTAGC TTCGGAGCAC TACATCAACC GCGAGCTCAG CTGGATCGCC 
TTCAACGAAA GGGTCCTTGC TCAGGCACTG AATACGCGCA CCCCGCTGCT GGAACAAGCC
AAGTTCAGTG CCATCTTCAG CAACAACCTC GACGAATTCT TCATGGTTCG CGTGGCCTCC
CTCAAAGCGC AAGTGGAAGC CGGCATTACC AAAACCAGCG CAGATGGTCT AACCCCTATT
CAGCAGCTCC TCACGATCCG AGATCACCTT GTTCCTCTCA TCGAGCAACA ACAAGATCAC
TACCGTAAAC ACCTCAAAAA CCAGTTAGTC GAGCAAGGTG TTCACCTACT CGACTATGAG
CAACTCAATC AAAAAGAACG CCTCTGGGTT GACAATTATT TCCAGACAGC CATTTTCCCG
GTGCTGACAC CACTAGCCGT GGATCAAGCC CATCCCTTCC CTTTCGTTAG CAATCTCAGC
CTCAACATTG CGACCCTGAT CCTCGATCCG GAAACAGGCC AACAACAATT CGCCCGGGTC
AAGATTCCAC AAAAAACGAT GCCTCGTTTT GTAGAGATAC CTCCTGATCT AAGTGGCATC
AATCCCAAGC CGGTTCACAC AGCAGTTCCA TTGGAGCAAG TGGTGGCCTT CAACCTCAAA
TTGCTCTTCC CCGGAATGAA GATTGAAGAG CACTACTTCT TCCGAGTCAC CCGCGATGCC
GACCTTGAAC TCAGGGACCT AGAAGCCGAC GATCTCATGA GCGCCATGGA ACAGGGCCTT
CACAAGCGGC GGATGGGCGG AGAAGTGGTG AGACTCGAAG TGACAAACGA AATGCCCCAG
AGAGTCGTTG AGATGCTGAT TGAAGGTATG GCTGTTGAAG AAAACGACCT TTATCGCATC
GAGGGACTAC TAGGCCTCGA TGATCTATTC GGTCTCATGC GTTTGCCCCT GGAGCAACTC
AAAGACCAAC CCCACATTGG CCTGACCGCC AAAGTTCTCT CCCGCAGCCA GCGCAGAATG
CTTGAAGACG AATCAATCAA AGAAGAAGAA TTCAAAAGCA TCTTCTCGGT GATTCGCCGC
AAAGACATTC TCCTTCACCA CCCATACGAA CTGTTCGCGA CCTCTGTAGA GGAATTCATC
AATCAAGCAG CAGACGATCC CCTCGTCATG GGAATCAAGA TCACGCTCTA TCGGACATCC
AAGGATTCCC CAATCATTGC GGCCCTGATC CGTGCAGCAG AACACGGTAA GCAGGTCATG
GCTCTGGTTG AACTCAAGGC ACGCTTCGAT GAAGGCAACA ATATTCAATG GGCCCGTCAT
CTAGAGCGAT CGGGCGTTCA CGTTGTCTAT GGCGTTTTAG GACTAAAAAC ACACACAAAA
ACCATCTTGG TCGTTCGCAA AGAAAAAGAG CGCCTACGCA GCTACGTGCA CATCGGCACA
GGTAACTACA ACTCGAAGAC ATCACGTCTC TACACCGACC TTGGCCTGCT CTCTGCAAGA
CCGGAACTCA GCCAAGATCT AGTTGAACTA TTCAATTATC TCACTGGCTT TTCAAAGCAA
CAAAGTTTCC GTCGACTGCT GGTGGCGCCT GTCACCCTAC GCAAGGGGAT GGAATCACTG
ATCCTCCGCG AAATCGAACA CGCCCGCGAA GGTAGAGGCG GACACATCCG CGCCAAGATG
AATGCTCTGG TGGATCCAGC CATCATTAGC CTGCTCTACG AAGCTTCCCA AGTTGGAGTT
CGCATTGAAC TGATCATCCG CGGTATGTGC TGTCTCTACC CAGGGCGAAA AGGGTTCAGC
GAAAACATCA GCGTGATCAG CATCATCGGC CGGTTCCTGG AACACTCCCG TATCTTCTGG
TTTGCCAATG ACAACAACCC AGAGGTTTAT ATCGGCAGCG CAGACTTGAT GCCTCGAAAC
TTAGACAGAC GCGTGGAAGC CGTTACTCCA ATTGAAGAAC CAGAGCAAAA GGAACACCTA
GAGCGACTGC TGAACCTCTA CCTAAACGAC AACCGGGAAG CCTGGGATAT GCAGAGCGAT
GGCAGCTTTT TACAGCGCCA ACCCAATCCC AATAGTGAAG AACATCGTGC ACAGCAACAG
CTGATCAACC TCTGGCAACA AGGCATCCCT GCAGGGTGA
 
Protein sequence
MSNPAVASEH YINRELSWIA FNERVLAQAL NTRTPLLEQA KFSAIFSNNL DEFFMVRVAS 
LKAQVEAGIT KTSADGLTPI QQLLTIRDHL VPLIEQQQDH YRKHLKNQLV EQGVHLLDYE
QLNQKERLWV DNYFQTAIFP VLTPLAVDQA HPFPFVSNLS LNIATLILDP ETGQQQFARV
KIPQKTMPRF VEIPPDLSGI NPKPVHTAVP LEQVVAFNLK LLFPGMKIEE HYFFRVTRDA
DLELRDLEAD DLMSAMEQGL HKRRMGGEVV RLEVTNEMPQ RVVEMLIEGM AVEENDLYRI
EGLLGLDDLF GLMRLPLEQL KDQPHIGLTA KVLSRSQRRM LEDESIKEEE FKSIFSVIRR
KDILLHHPYE LFATSVEEFI NQAADDPLVM GIKITLYRTS KDSPIIAALI RAAEHGKQVM
ALVELKARFD EGNNIQWARH LERSGVHVVY GVLGLKTHTK TILVVRKEKE RLRSYVHIGT
GNYNSKTSRL YTDLGLLSAR PELSQDLVEL FNYLTGFSKQ QSFRRLLVAP VTLRKGMESL
ILREIEHARE GRGGHIRAKM NALVDPAIIS LLYEASQVGV RIELIIRGMC CLYPGRKGFS
ENISVISIIG RFLEHSRIFW FANDNNPEVY IGSADLMPRN LDRRVEAVTP IEEPEQKEHL
ERLLNLYLND NREAWDMQSD GSFLQRQPNP NSEEHRAQQQ LINLWQQGIP AG