Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_02131 |
Symbol | pgk |
ID | 4716897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 198727 |
End bp | 199935 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640077912 |
Product | phosphoglycerate kinase |
Protein accession | YP_001008608 |
Protein GI | 123967750 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0126] 3-phosphoglycerate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0158378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAT TATCTCTTTC CAGTCTTGAT AAGACACATT TAGAAGGAAA AAAAGTTCTT GTAAGAGTAG ATTTTAATGT TCCATTAAAT GAAGACGGTC AAATAACCGA CGATACGCGT ATTCGTGCAG CGATCCCAAC TATTGAATAT CTTATTAATC ATTCTGCAAA AGTCATTTTA GCTGCTCATT TTGGTAGACC GAAGGGTCAG GTAAATGAAA AAATGAGATT AACTCCAGTA GCAGCAAGAT TAAGTGAATT GTTGGGGCAA AATGTTGCTC TTACTAACAG TTGTATTGGT GATGAAGCAG TTGCACAATC AAATAGCTTA TCTAATGGAG ATGTTCTTTT ACTTGAGAAT GTTCGTTTTT TTGGTGAAGA GGAGAAGAAC GACCTGGAGT TTGCTGAAAA ATTAGCATCA CATGCAGATA TGTATGTAAA TGATGCTTTC GGTGCTGCTC ATAGAGCGCA TGCTTCAACT CAGGGTGTTA CAAATTATTT AAGTCCCTCA GTAGCTGGAT TCCTTTTAGA AAAAGAATTG AAATACCTAC AAGGAGCAGT AGATTCCCCA AATCGTCCAT TGGCAGCAAT AGTTGGAGGG TCAAAGGTTA GTAGCAAAAT AGGAGTACTT GATTCTTTAC TAGATAAGTG TGACAAAATC ATGATTGGTG GAGGTATGAT TTTCACTTTT TATAAAGCTA GAGGTTTAGA TGTCGGAAAG AGCCTTGTAG AAGAAGATAA ACTCGAGCTT GCTAAAGATT TAGAAGCAAA AGCAAAAGCA AAAGGAGTAG AGTTATTATT ACCCACTGAT GTTGTTTTGG CTGATGAATT TTCTCCTGAC GCCAATAGTA AAATATCTCA AATTGATGCA ATTAGTGGGA ATTGGATGGG TCTAGATATT GGTCCAGATT CTATTAAAGT TTTTCAGAAT GCTCTTGCAG AATGTAAGAC AATTATTTGG AATGGTCCAA TGGGAGTTTT TGAATTTGAT AAATTTGCAG ATGGTACAAA TGCAATAGCT ACGACTCTTG CGGACTTAAG TGCTTTTTCT GAAGTTTGTA CAATAATTGG TGGTGGAGAT TCAGTTGCAG CAGTTGAAAA AGCAGGATTA GCTGAGAAAA TGTCTCATAT ATCTACCGGA GGTGGGGCTA GTTTGGAACT TTTAGAAGGT AAAACTTTAC CAGGTGTGGC TGCGTTAAAC GACGCTTAG
|
Protein sequence | MSKLSLSSLD KTHLEGKKVL VRVDFNVPLN EDGQITDDTR IRAAIPTIEY LINHSAKVIL AAHFGRPKGQ VNEKMRLTPV AARLSELLGQ NVALTNSCIG DEAVAQSNSL SNGDVLLLEN VRFFGEEEKN DLEFAEKLAS HADMYVNDAF GAAHRAHAST QGVTNYLSPS VAGFLLEKEL KYLQGAVDSP NRPLAAIVGG SKVSSKIGVL DSLLDKCDKI MIGGGMIFTF YKARGLDVGK SLVEEDKLEL AKDLEAKAKA KGVELLLPTD VVLADEFSPD ANSKISQIDA ISGNWMGLDI GPDSIKVFQN ALAECKTIIW NGPMGVFEFD KFADGTNAIA TTLADLSAFS EVCTIIGGGD SVAAVEKAGL AEKMSHISTG GGASLELLEG KTLPGVAALN DA
|
| |