Gene P9303_18591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18591 
Symbolglk 
ID4776090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1621167 
End bp1622228 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content56% 
IMG OID640087368 
Productputative glucokinase 
Protein accessionYP_001017866 
Protein GI124023559 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0837] Glucokinase 
TIGRFAM ID[TIGR00749] glucokinase, proteobacterial type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.620136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCCC TGAGAACCTT TCTGGCAGGT GATTTGGGGG GCACCAAAAC CTTGCTTGCG 
CTCTATAGCT GGGACGAAAA GCAACTCAAG CAGCAGCACC GGCGGAGGTA TCTATCCAAT
CAGTGGACTT CGCTTGAACC CATGCTGAGC GACTTCATCG CCCATTTACC AGGGGAGATG
GAGCAACCCA ATAACGGCTG CATCGCTGTT GCAGGACCGG TTCGTCATGG TGAGGCACGT
ATCACCAACC TGCCCTGGAG CCTGAAGGAG AAGGACCTTT GCGCAGCCAC GGGACTGAAG
CATTTGGAAC TGATTAATGA CTTTGGCGTG CTGATTTACG GCCTACCCTT TCTCAACGAC
TCGCAGCAGG TGGAGCTTCA GCTTCCACAG CAGCATTTGT CTGGGCAAGG ACCAATTGCA
GTTCTGGGGG CAGGTACTGG ACTTGGAATG GCCCGTGGCC TGCCCACAAA AGATGGGATG
GTGGCGCTGC CCAGTGAAGG CGGACACCGT GAATTTGCTC CTCGCAGTGA ATGTGAGTGG
CAACTTTGTG AGTGGCTCAA GGCCGACTTG CAACTCGAAC GCCTTTCATT GGAACGTGTT
GTCAGTGGCA CCGGCCTGGG TCATGTGGCG CGCTGGCGGC TACAGCACAG TGACGCAGAT
GGTCATCCTC TGCGAGATCT GGCCGATGCT TGGCGCCATG GTGCTGATGA TCATTCCGAC
CACTTGGATC TGCCTGCTCT TGCTAGTCAA GCCGCAAGCG AAGGCGATTC AATTCTTCAA
GAAGCCTTAC AGCTCTGGCT AGCTGCTTAC GGTTCTGCTG CGGGAGATTT AGCTCTGCAG
GAACTCTGTG TCGGAGGCCT CTGGGTGGGT GGGGGTACCG CTGCGAAGCA GCTTCAAGGT
CTTCGCTCAA GCACCTTTCT TGAAGCCTTC CGCAACAAGG GTCGCTTCCG TCCGTTTCTA
GAGCAATTGC CAGTGATGGC AGTGATCGAT CCCGAGGTGG GCCTGTTCAG TGCAGCCTGC
AGAGCACACA TGCTTGCTGA GCAAGGTGGG ACACTGACCT AA
 
Protein sequence
MPSLRTFLAG DLGGTKTLLA LYSWDEKQLK QQHRRRYLSN QWTSLEPMLS DFIAHLPGEM 
EQPNNGCIAV AGPVRHGEAR ITNLPWSLKE KDLCAATGLK HLELINDFGV LIYGLPFLND
SQQVELQLPQ QHLSGQGPIA VLGAGTGLGM ARGLPTKDGM VALPSEGGHR EFAPRSECEW
QLCEWLKADL QLERLSLERV VSGTGLGHVA RWRLQHSDAD GHPLRDLADA WRHGADDHSD
HLDLPALASQ AASEGDSILQ EALQLWLAAY GSAAGDLALQ ELCVGGLWVG GGTAAKQLQG
LRSSTFLEAF RNKGRFRPFL EQLPVMAVID PEVGLFSAAC RAHMLAEQGG TLT