Gene NATL1_06521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06521 
Symbolglk 
ID4779656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp597149 
End bp598192 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content35% 
IMG OID640083930 
Productputative glucokinase 
Protein accessionYP_001014479 
Protein GI124025363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0837] Glucokinase 
TIGRFAM ID[TIGR00749] glucokinase, proteobacterial type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAC TTGCTGGAGA CCTTGGGGGA ACTAAAACAA TATTAGCTAT TTATTCAAAC 
GAGAACTATC CAAAAAAAAT ATTTGAGAGG TACTATATTT CATCAGAATG GAAATCTTTT
TACTCATTAT TTGAAGATTT TATTAAACAT TTACCAGATC ATATATCACT GCCTCAATAT
GGTTCTATTG GTGTAGCCGG GCCAATACAG AATCAGGAGG TTAAGATTAC AAATCTTGGC
TGGGATATTG AATCAAAAAA GTTATCTCTA CTTTCAAAAA TAAATAATAT TGAATTAATA
AATGATTTTT CAGTTTTAAT CTATGGAATA CCATTCTTCA ACAGAAACCA ATATGAAGTA
ATACAAGGGA CATTAAATTC TGATTACAAA AACGATCAAA AATTAATTGC AATTATTGGA
GCTGGTACTG GCTTAGGAAT GTCCAGAGGC TTGATAACCC CTAAAAGCAT TTCTATATTT
CCAAGTGAAG GAGGGCATCG AGAATTTTCC CCAAGAACAG AAAACGAATG GGCATTAGTC
AAATGGCTAA AAAAGAAGTT AAATATTCAA AGAATATCCA TTGAAAGAAT TGTTAGTGGT
ACTGGCCTTG GCATGATTGC CAGATGGAAA TTGGATGATC CAATAAATGA AAGCCATCCA
CTTCAGGTAA TTTTAAAAAA TATGGATAGT GACAAATCAG ATTCCACAGA TTTACCCGCA
CTTGTTTGGG AAAAAGCAAA AAACGGAGAC AAATTAATGA CTGAAGCATT GCAACTATGG
CTAAATGCTT ATGGGTCTGC AGCTGGAGAC CTTGCTTTAC AAGAACTTTG CTCTTCAGGG
TTATGGATTT CAGGTGGAAC AGCCGCAAAA AACCTCGATG GAATAAACTC TTCTAACTTC
CTAAATGCAT TTAGTAATAA GGGTCGCTTT CAATCTTATT TAAAGGAAAT CCCATTGATT
GTTCTTAAAG ATCCAGAAGC GACATTATTC AGTTCAGCTT GCAGAGCACG CTTAAGTGCC
GAATCAAATG GGAGACTTAG CTAA
 
Protein sequence
MNLLAGDLGG TKTILAIYSN ENYPKKIFER YYISSEWKSF YSLFEDFIKH LPDHISLPQY 
GSIGVAGPIQ NQEVKITNLG WDIESKKLSL LSKINNIELI NDFSVLIYGI PFFNRNQYEV
IQGTLNSDYK NDQKLIAIIG AGTGLGMSRG LITPKSISIF PSEGGHREFS PRTENEWALV
KWLKKKLNIQ RISIERIVSG TGLGMIARWK LDDPINESHP LQVILKNMDS DKSDSTDLPA
LVWEKAKNGD KLMTEALQLW LNAYGSAAGD LALQELCSSG LWISGGTAAK NLDGINSSNF
LNAFSNKGRF QSYLKEIPLI VLKDPEATLF SSACRARLSA ESNGRLS