Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18591 |
Symbol | glk |
ID | 4776090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1621167 |
End bp | 1622228 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087368 |
Product | putative glucokinase |
Protein accession | YP_001017866 |
Protein GI | 124023559 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0837] Glucokinase |
TIGRFAM ID | [TIGR00749] glucokinase, proteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.620136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTCCC TGAGAACCTT TCTGGCAGGT GATTTGGGGG GCACCAAAAC CTTGCTTGCG CTCTATAGCT GGGACGAAAA GCAACTCAAG CAGCAGCACC GGCGGAGGTA TCTATCCAAT CAGTGGACTT CGCTTGAACC CATGCTGAGC GACTTCATCG CCCATTTACC AGGGGAGATG GAGCAACCCA ATAACGGCTG CATCGCTGTT GCAGGACCGG TTCGTCATGG TGAGGCACGT ATCACCAACC TGCCCTGGAG CCTGAAGGAG AAGGACCTTT GCGCAGCCAC GGGACTGAAG CATTTGGAAC TGATTAATGA CTTTGGCGTG CTGATTTACG GCCTACCCTT TCTCAACGAC TCGCAGCAGG TGGAGCTTCA GCTTCCACAG CAGCATTTGT CTGGGCAAGG ACCAATTGCA GTTCTGGGGG CAGGTACTGG ACTTGGAATG GCCCGTGGCC TGCCCACAAA AGATGGGATG GTGGCGCTGC CCAGTGAAGG CGGACACCGT GAATTTGCTC CTCGCAGTGA ATGTGAGTGG CAACTTTGTG AGTGGCTCAA GGCCGACTTG CAACTCGAAC GCCTTTCATT GGAACGTGTT GTCAGTGGCA CCGGCCTGGG TCATGTGGCG CGCTGGCGGC TACAGCACAG TGACGCAGAT GGTCATCCTC TGCGAGATCT GGCCGATGCT TGGCGCCATG GTGCTGATGA TCATTCCGAC CACTTGGATC TGCCTGCTCT TGCTAGTCAA GCCGCAAGCG AAGGCGATTC AATTCTTCAA GAAGCCTTAC AGCTCTGGCT AGCTGCTTAC GGTTCTGCTG CGGGAGATTT AGCTCTGCAG GAACTCTGTG TCGGAGGCCT CTGGGTGGGT GGGGGTACCG CTGCGAAGCA GCTTCAAGGT CTTCGCTCAA GCACCTTTCT TGAAGCCTTC CGCAACAAGG GTCGCTTCCG TCCGTTTCTA GAGCAATTGC CAGTGATGGC AGTGATCGAT CCCGAGGTGG GCCTGTTCAG TGCAGCCTGC AGAGCACACA TGCTTGCTGA GCAAGGTGGG ACACTGACCT AA
|
Protein sequence | MPSLRTFLAG DLGGTKTLLA LYSWDEKQLK QQHRRRYLSN QWTSLEPMLS DFIAHLPGEM EQPNNGCIAV AGPVRHGEAR ITNLPWSLKE KDLCAATGLK HLELINDFGV LIYGLPFLND SQQVELQLPQ QHLSGQGPIA VLGAGTGLGM ARGLPTKDGM VALPSEGGHR EFAPRSECEW QLCEWLKADL QLERLSLERV VSGTGLGHVA RWRLQHSDAD GHPLRDLADA WRHGADDHSD HLDLPALASQ AASEGDSILQ EALQLWLAAY GSAAGDLALQ ELCVGGLWVG GGTAAKQLQG LRSSTFLEAF RNKGRFRPFL EQLPVMAVID PEVGLFSAAC RAHMLAEQGG TLT
|
| |