Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_02151 |
Symbol | pgk |
ID | 4912091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 199705 |
End bp | 200913 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640159781 |
Product | phosphoglycerate kinase |
Protein accession | YP_001090439 |
Protein GI | 126695553 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0126] 3-phosphoglycerate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.772299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAT TATCTCTTTC CAGTCTTGAT AAGACACATT TAGAAGGAAA AAAAGTTCTT GTAAGAGTAG ATTTTAATGT TCCATTAAAT GAAGATGGCC AAATAACCGA CGATACTCGT ATTCGTGCAG CGATCCCAAC TATTGAATAT CTTGTTAATC ATTCCGCAAA AGTTATTTTA GCTGCTCATT TTGGTAGACC AAAGGGTCAG GTAAATGAAA AAATGAGATT AACTCCAGTA GCAGCAAGAT TAAGTGAATT GTTGGGGCAA AGTGTTGCTC TCACTAACAG TTGTATTGGT GATGAAGCAG TTGCAAAATC AAATAGCTTA TCTAATAGAG ATGTTCTTTT ACTTGAAAAT GTTCGTTTCT TTGGTGAAGA GGAAAAGAAC GACCTGGAGT TTGCTCAAAA ATTAGCGTCA CATGCAGATA TGTATGTAAA TGATGCTTTC GGTGCTGCTC ATAGAGCGCA TGCTTCAACT CAGGGTGTTA CAAATTATTT AAGTCCCTCA GTAGCTGGAT TCCTTTTAGA AAAAGAATTG AAATACCTAC AAGGAGCTGT AGATTCCCCA AATCGTCCAT TGGCAGCAAT AGTTGGAGGA TCAAAGGTTA GTAGCAAAAT AGGAGTACTT GATTCTTTAC TAGATAAATG CGACAAAATC ATGATTGGTG GAGGTATGAT TTTTACTTTT TATAAAGCTA GAGGTTTAGA TGTAGGAAAG AGCCTAGTAG AGGAAGATAA ACTCGAGCTT GCGAAAGATT TAGAGGCAAA AGCAAAAGCA AAAGGAGTCG AATTATTATT ACCTACTGAT GTCGTTTTAG CTAATGAATT TTCGCCTGAT GCCGAAAGTA AAATATCTCA AATTGATTCA ATTAGTGGCA ATTGGATGGG TCTTGATATT GGTCCAGATT CCATTAAAGT TTTTCAGAAT GCTCTTGCTG AATGTAAGAC AATTATTTGG AATGGCCCAA TGGGAGTTTT TGAATTTGAT AAATTTGCAG ACGGTACAAA TGCAATAGCT ACGACTCTTG CGGACTTAAG TGCTTTTTCT GAGGTATGTA CAATAATTGG TGGTGGAGAT TCAGTTGCAG CAGTTGAAAA AGCAGGATTA GCAGAGAAAA TGTCTCATAT ATCTACTGGA GGTGGAGCTA GTTTAGAACT TCTAGAAGGT AAAATTCTTC CTGGTGTAGC TGCTTTAAAC GACGCTTAA
|
Protein sequence | MSKLSLSSLD KTHLEGKKVL VRVDFNVPLN EDGQITDDTR IRAAIPTIEY LVNHSAKVIL AAHFGRPKGQ VNEKMRLTPV AARLSELLGQ SVALTNSCIG DEAVAKSNSL SNRDVLLLEN VRFFGEEEKN DLEFAQKLAS HADMYVNDAF GAAHRAHAST QGVTNYLSPS VAGFLLEKEL KYLQGAVDSP NRPLAAIVGG SKVSSKIGVL DSLLDKCDKI MIGGGMIFTF YKARGLDVGK SLVEEDKLEL AKDLEAKAKA KGVELLLPTD VVLANEFSPD AESKISQIDS ISGNWMGLDI GPDSIKVFQN ALAECKTIIW NGPMGVFEFD KFADGTNAIA TTLADLSAFS EVCTIIGGGD SVAAVEKAGL AEKMSHISTG GGASLELLEG KILPGVAALN DA
|
| |