Gene A9601_19061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19061 
Symbolppk 
ID4718645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1642332 
End bp1644410 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content29% 
IMG OID640079641 
Productpolyphosphate kinase 
Protein accessionYP_001010296 
Protein GI123969438 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.547171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCC AGGCTGATGT TTTTATTAAT AGAGAATTAA GTTGGATTGA ATTTAATAAG 
AGAGTTCTCC TTACTGGAAT GGAAAAGGAG TACAAAATCC TAGACAAAGT AAAATTTTGT
TCAATTTTTA GTAATAACCT AGATGAATTT TTTATGGTAA GAGTAGCTTC ATTAAAGGCT
CAAGTTGAAG CAGAAATTAC TAAAAAAAGT ATTGACGGAC TTACCCCTAA AGAGCAATTA
AAAAAAATCA ATAATGAAAT AAAGAAGTTA ACTATTCTCC AAGAAAACTA TGTAAATAAT
GAATTAAAAA ATGAATTAAA AGAAAAAGGG GTAATTTTAA AAAAATATAA GGAACTAAGT
GATAATCAAA GAAATTGGTG TAATAACTTC TTTACAACAT CTATTTTTCC TTTATTAACT
CCATTAGTTG TTGATCCGGC ACATCCATTT CCTTTTATAA GTAATTTAAG TCTAAATTTA
GCAGCTTTAA TGAAGGATGA GGAGAATTCT AAAAATCAGT TTGTCAGAGT AAAAATACCA
ACAAAAAATA TACCCCGATT TATAAGAATT CCCAATGAAA TTACTCAACT TAGTGATGAA
AGTTCTCACT ATTTCATAAG TGTTGAAGAT TTAATTGGGA ATAATATAAA TACTTTATTT
AACGGAATGG AATGTATAAA TTACTCTTTT TTTAGAGTGA CAAGAGATGC AGATTTAGAA
TTAAAAGAAC TTGAAGCTGA TGATCTACTT TTAGCTGTTG AACAAAGTTT GCAAAAAAGA
AGATTAGGTG GAGACGTAGT TAGATTAGAA GTGGAGTCAG ATATGCCAGA AAATATTCTA
AAGTTACTTA TTGAAAGTAT CTCAATACAG AAAGAATATA TATACTTTTG CAAAAGTTTA
TTAGGCCTAG ACGATTTAAA TCAGCTTACA AAAATTGATA GAGATGATTT AAAAGAAAAT
CTACTAATTG GAAAAACTCA CCCAGAATTA AAACATTTAG ATTTGCCTTC AAACAAAAAC
CCTAATTCTA TTTTCAAGAT ACTTAGGAAA AAAAATATTC TGCTTCATCA TCCCTATGAC
CTATTTAAAA CTTCAGTTGA AGAATTTATA AACAGAGCAG CTGATGATCC ACTTGTAATG
GCTATAAAAA TTACTTTATA TCGAGTTTCC CAAGATTCGC CTATAATTGC AGCTTTAATG
AGAGCTGCAG AGAATGGTAA AGAAGTAATG ACTCTTGTTG AACTAAAAGC AAGATTTGAT
GAAGACAATA ATATTCAATG GGCCAAACAA CTTGAACAAG CTGGCATTCA TGTTGTATAT
GGAATCATCG GATTTAAAAC ACATACAAAA ATTGCCTTAA CAGTTAGAAA AGAGAAAGGA
CGATTAAGAA ATTATTTTCA TATTGGAACA GGAAATTATA ACTCTAATAC TTCAAAGTTT
TATACAGATT TAGGATTACT TTCAACGGAT CCTGAAATTG CTTCAGATTT ACTTGAGTTA
TTTAACTACT TATCTGGTTT CTCTAAACAA AAAAGTTATC AAAAGTTATT AGTTTCTCCC
TCATCGATGC GAGAGAAATT TATATTTCTG ATAAAGAGAG AAATTAAAAA TGCAGAGGAA
GGCAAAAAAG CCGAAATAAT CGCAAAAATG AATTCTTTAG TAGACCCAGA AATAATTAAA
CTGCTTTATT TAGCTTCAGA CTCAGGTGTA AAAATTAGCC TCATCATAAG AGGTATTTGT
TGCCTATATC CCCAAAGAAA AAATTTAAGT GAAAATATTA AAGTTATAAG CATTATTGGC
CATTTTCTTG AACACTCAAG AATTTTTTGG TTTTGTAATA ACGGGGATAA TGAGGTTTTT
ATAGGGAGTG CAGATTGGAT GAGAAGAAAT CTTGATAGAA GAATAGAAGC TGTTACGCCT
ATAGAGGATT ATGAATTGAA ATCTAAAATA TACACGCTTT TGCAAACTTA CATTAACGAT
AATTACTTTT CTTGGATAAT GAAAGATGAT GGTTCATATT CGAAATATGA ATTAGATTCA
TCGCATAATC GTTCGCAAAT TGACCTCATA GAAAAATAA
 
Protein sequence
MKRQADVFIN RELSWIEFNK RVLLTGMEKE YKILDKVKFC SIFSNNLDEF FMVRVASLKA 
QVEAEITKKS IDGLTPKEQL KKINNEIKKL TILQENYVNN ELKNELKEKG VILKKYKELS
DNQRNWCNNF FTTSIFPLLT PLVVDPAHPF PFISNLSLNL AALMKDEENS KNQFVRVKIP
TKNIPRFIRI PNEITQLSDE SSHYFISVED LIGNNINTLF NGMECINYSF FRVTRDADLE
LKELEADDLL LAVEQSLQKR RLGGDVVRLE VESDMPENIL KLLIESISIQ KEYIYFCKSL
LGLDDLNQLT KIDRDDLKEN LLIGKTHPEL KHLDLPSNKN PNSIFKILRK KNILLHHPYD
LFKTSVEEFI NRAADDPLVM AIKITLYRVS QDSPIIAALM RAAENGKEVM TLVELKARFD
EDNNIQWAKQ LEQAGIHVVY GIIGFKTHTK IALTVRKEKG RLRNYFHIGT GNYNSNTSKF
YTDLGLLSTD PEIASDLLEL FNYLSGFSKQ KSYQKLLVSP SSMREKFIFL IKREIKNAEE
GKKAEIIAKM NSLVDPEIIK LLYLASDSGV KISLIIRGIC CLYPQRKNLS ENIKVISIIG
HFLEHSRIFW FCNNGDNEVF IGSADWMRRN LDRRIEAVTP IEDYELKSKI YTLLQTYIND
NYFSWIMKDD GSYSKYELDS SHNRSQIDLI EK