Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_06521 |
Symbol | glk |
ID | 4717354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 575153 |
End bp | 576187 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640078365 |
Product | putative glucokinase |
Protein accession | YP_001009045 |
Protein GI | 123968187 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0837] Glucokinase |
TIGRFAM ID | [TIGR00749] glucokinase, proteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTC TCGCTTGTGA TTTAGGAGGT ACAAAGGTTC TATTAGGAAT TTTCAAAAAA GGAACAAATA ATAATTCGCC TAAGTTAATA TTTAAAAAGA AATATATATC GTCTGATTGG GATTCTTTTG AACTAATCCT AGAAGATTTT ATCAAAAAAG AATGTAAGAA TATTACTCAT CCTTCTTCTG CATGTTTCGC TGTAGCTGGT CCTTTATCTA AAAACAACGC AAAAATCGTT AACTTGTCAT GGAATATTTC TGGAAATGAT TTACAGAACA AATTTAATTT TAAAAACTGC GAGCTAATAA ATGATTTCGC TGTACAAATT TATGGAATAC CTTTTTTAAA AAAAAATCAA TATTCTACTA TCCAAAATGG ATCCAATTCT GAAAATACTA ATAATGATTT GCATGCCATT GTTGGAGCGG GGACTGGCTT GGGGATTGCA AGAGGAATAA TATCAGGGGA AAAGGTAAAA GTTTTAGCTA GTGAAGGTGG TCATGTAGAG TACTCCCCAA AGTCAAAATT AGAATGGGAT TTGAAAATTT GGCTTAAGAA TTACCTAAAA GTTGAAAGGA TATCTTGTGA AAGGATTATT AGCGGCACTG GTTTATCAAG AATTGCCGAA TGGAGACTAA GCAAACCTGA TGCCCAAAAC CATCCTTTAC AAAAATATTT AAAAAAAATT AAAATTTTTG ATGCAGCGAG AAAAGAACTA CCTGAAAAAA TTTGTAATCT TTCTAAAGAA GGTGATCAGG TAATGATTGA AGTTGAGAGG ATTTGGTTAG GTGCTTATGC CTCTTTATTG GGAGATGTTG CTCTTCAAGA ATTGTGCTTT GGTGGATTAT GGATTTCTGG AGGAACAGCG TCAAAACATT TCAAAAACTT TAAATCAGAC TTATTTTTAA AACAATTTTT CGACAAGGGA AGATTAAAAG ATATTCTTAA AACAATACCT ATAAAAGTAA TTTTAGATGA AGAGTTTGGA CTTTTTAGTG CAGCCTGCAG AGCAAAAATG CTTTTAAAAA CTTAA
|
Protein sequence | MNFLACDLGG TKVLLGIFKK GTNNNSPKLI FKKKYISSDW DSFELILEDF IKKECKNITH PSSACFAVAG PLSKNNAKIV NLSWNISGND LQNKFNFKNC ELINDFAVQI YGIPFLKKNQ YSTIQNGSNS ENTNNDLHAI VGAGTGLGIA RGIISGEKVK VLASEGGHVE YSPKSKLEWD LKIWLKNYLK VERISCERII SGTGLSRIAE WRLSKPDAQN HPLQKYLKKI KIFDAARKEL PEKICNLSKE GDQVMIEVER IWLGAYASLL GDVALQELCF GGLWISGGTA SKHFKNFKSD LFLKQFFDKG RLKDILKTIP IKVILDEEFG LFSAACRAKM LLKT
|
| |