Gene P9301_09471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_09471 
SymbolpykF 
ID4911747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp815554 
End bp817344 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content34% 
IMG OID640160530 
Productpyruvate kinase 
Protein accessionYP_001091171 
Protein GI126696285 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0469] Pyruvate kinase 
TIGRFAM ID[TIGR01064] pyruvate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAATA TTGATTTAAA AAGAAGAACA AAAATAGTAG CAACTATTGG CCCTGCAACT 
CAATCTGAAG AGATAATTAC AAATTTAATT AAAGCTGGAG TAACAACATT CAGATTAAAT
TTCTCACATG GCGATCATAA AGATCATGCT GATAGAATAA AAACCATAAG GGAAGTATCA
AAAAAGTTAG ATATAGATAT TGGGATATTG CAAGATCTAC AAGGACCTAA AATAAGATTA
GGGCGCTTTA AAGATGGGCC AGTAAAAGTT AAAAAAGGCG ATAAATTCAC ACTTACATCA
AATGAAGTCG AATGTACGAA TACTATTGCA AATGTTACCT ACGAAAAACT TTCTCAAGAA
GTTAGCGAAG GAAAAAGAAT ACTTTTAGAT GATGGAAAAA TAGAAATGAT TGTAGAAAAA
GTTGATACAA AAGCTAATAA TTTGGAGTGC ATGGTAACTG TAGGAGGGGT TCTTTCAAAC
AATAAAGGTG TTAATTTTCC AGATGTTCAA TTATCAGTAA AAGCATTAAC AGAAAAAGAT
AAAAAGGATT TAAAATTTGG ATTATCTGAA GGAGTTGATT GGATCGCACT AAGTTTTGTA
AGAAATCCAT CCGATATAAA TGAGATAAAA GATTTAATAA ATAAAAATGG TCATTCAACT
CCTGTAGTCG CAAAAATAGA AAAATTTGAA GCAATCGATC AGATCGATAC AGTATTACCC
TTATGTGATG GGGTTATGGT TGCAAGAGGT GATTTGGGAG TAGAAATGCC TGCCGAAGAA
GTTCCTCTTT TACAAAAGGA ATTAATAAGA AAAGCTAATT CATTAGGTAT CCCAATAATT
ACAGCGACTC AAATGCTTGA TTCAATGGCT TCTAACCCAA GACCAACCAG GGCCGAAGTT
AGTGATGTTG CAAATGCAAT TCTGGATGGT ACAGATGCAG TAATGCTTTC AAACGAAACT
GCAGTTGGCG ATTATCCTGT GGAGGCAGTT GAAACGATGG CAACTATAGC AAGAAGAATT
GAAAGGGATT ATCCACTTAA GGCTATTGAA AGCCACTTAC CCAGTACGAT CCCAAATGCT
ATTAGCGCAG CAGTAAGCAA TATAGCTAGA CAACTTGAAG CAGGAGCTAT AATCCCTTTA
ACTAAATCAG GTTCTACCGC TCGAAATGTA AGTAAGTTCA GACCACCAAC ACCCATCTTG
GCAACTACTA CAGAAAGAAG TGTAGCGAGA AGATTGCAAC TTGTTTGGGG AGTTACTCCA
ATAGTAGTTA AAAATGATGA AAGAACAGCA AAAACTTTTA GTTTAGCTAT GCAAATTGCT
CAAGAGATGG GGATCCTTAA TCAAGGAGAT TTAGTAGTTC AAACCGCAGG TACATTAACT
GGAATTAGTG GCTCTACAGA TTTAATAAAA GTCGGTTTAG TAAGAAGGAT TGTATCAAGA
GGAATTTCAA TAGGGGAAAT CGGTGTTACA GGTAAAGCAA GAATAATTAA AAATAATCTT
GATATATCTT TAATTTGCCC AGGTGAAATA TTATTTGTTC CGAAGGAATT AATGAAAAAT
ATTCCACTGA GTAAAAATAT TGCAGGCATT GTTACGAACC AAAATGTAAA TGATGTTTAT
GCTTTTTTTA ACAAAAATAA TAAAAAGATT TCTACAATTT GTAATTTAGA AAATATGGAT
AGTCATCAAA TCAGTAATGG AGATCTCATT ACTCTCCAGC TTAATGAAGG TGTCATATAC
ATGGGCCAAA TTGAAGATGA TGATGCAATA GATAAATATA AATATGTCTA G
 
Protein sequence
MSNIDLKRRT KIVATIGPAT QSEEIITNLI KAGVTTFRLN FSHGDHKDHA DRIKTIREVS 
KKLDIDIGIL QDLQGPKIRL GRFKDGPVKV KKGDKFTLTS NEVECTNTIA NVTYEKLSQE
VSEGKRILLD DGKIEMIVEK VDTKANNLEC MVTVGGVLSN NKGVNFPDVQ LSVKALTEKD
KKDLKFGLSE GVDWIALSFV RNPSDINEIK DLINKNGHST PVVAKIEKFE AIDQIDTVLP
LCDGVMVARG DLGVEMPAEE VPLLQKELIR KANSLGIPII TATQMLDSMA SNPRPTRAEV
SDVANAILDG TDAVMLSNET AVGDYPVEAV ETMATIARRI ERDYPLKAIE SHLPSTIPNA
ISAAVSNIAR QLEAGAIIPL TKSGSTARNV SKFRPPTPIL ATTTERSVAR RLQLVWGVTP
IVVKNDERTA KTFSLAMQIA QEMGILNQGD LVVQTAGTLT GISGSTDLIK VGLVRRIVSR
GISIGEIGVT GKARIIKNNL DISLICPGEI LFVPKELMKN IPLSKNIAGI VTNQNVNDVY
AFFNKNNKKI STICNLENMD SHQISNGDLI TLQLNEGVIY MGQIEDDDAI DKYKYV