Gene NATL1_17101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17101 
Symbol 
ID4780155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1395291 
End bp1396982 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content38% 
IMG OID640084994 
Productfused sugar kinase/uncharacterized domain-containing protein 
Protein accessionYP_001015530 
Protein GI124026415 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.343407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCAG TCTTTTTAAT GTTTTCTCGA CTGTTTCTTC TGAAGGCAAA TGATTTAAAA 
CTTTTTTTGT TTCTCTTGTT TCCACATTCA TGGGCATTGA TGATAGTAGC TTCTATTCAT
AGTGGGTTTG GTTGCTTTTT GATTTTGAAT TGGCCTCAAT CTGATTCGGA ACATTTGATG
GTTTCTTCAG AGCAGATGCA AAACATAGAA AAAGAAATGT TTTCTATGGG TATGCCTGTT
GAAGCTCTGA TGGAAAAAGT GGGAATTGGT ATTTCTTCAT GGATATTAGA CAGGCATGGG
TTGATTGAAA ATGGTGCAAT CGTTTTGGTA GGGCCTGGTC ACAATGGAGG CGATGGACTT
GTTGTTGCGA GAGAACTTTA TATGGCAGGC GTTGATATCT CTATTTGGTG TCCATTCCCA
TTGAAAAAAC AATTAACACA AAAACATTTT GATTATGCAA TGCAGATTGG GATTGAAAAC
TTGGAGCGTA AACCTGATTC AAATTCAGAT TTATTATGGA TTGAGGCTTT ATTTGGATTA
GGACAATCAA GAATTATTTC TGATGAAATA CTTCGCCTAT TGAATTCGAA GAAGACATTT
AGTCCTGAAA AGTTAATTAG TATTGATGTC CCCGCAGGAT TAGACTCAGA TAATGGAAAT
ATAGTATCAA ATACACCATG CAAAGCCAGT TCTTCGTTAA CCTTGGGTTT ATTTAAGTCG
GGATTAATTC AAGATTCAGC TATTGATTAT GTAGGTAATT TAGAGAGAGT AGATATTGGA
ATTCCAGATA AGATATTAGC TGGTTTTCCT GAAACACAGC CTCTAAGGAT TTCTTTTTCA
GATTTGTCTA CTTTTGTTTG GCCCATGCTT AGTAAAAGTA AAAGTAAATA TCAGAGAGGA
AGAGTTCTGG TGATTGCAGG GAGTGAGAAA TACAGGGGAG CAGCATCCCT TGCTTTGAAT
GGAGCTTTAG CAAGTGGGGT AGGTAGTGTT AGTGCTTTTT TGCCCAATTC CGTTTCATCT
GCTCTTTGGA TTACTCACCC TGAAGTCTTG TTACTTGGAG ATCTAAATGC TTTTCAAGAC
GGTTCTTCAG ATTTCTCCAA AGTTTTAAAT GAAGTTGACT TGAATAGGTT TGACTCGATT
TTGCTCGGAC CAGGATTGGG GATGGCAGAA GAAAAAGATT GTTTTGGTTC TGACTTGCAG
GATTTCAAAG GCCTACTTGT TCTTGATGCT GATGCAATTA ATAGGCTGTC AATAACATCA
AAAGGTTGGG AGTGGCTAAA TGATAGGGAA GGTCCCACTT GGCTTACCCC TCATCTTGAA
GAATTTAAAA GGTTATTTCC TTTAATTGAT TGCTCGAATC CATTGAAGGC TGGAATTGAA
GCTGCAAAAA CTTGCAGTTC TACTGTCTTA TTGAAATGTG CTCATAGTGT TATCTCTGAT
CCAGAAGGTA AAACTTGGCA AATAGGGCAA GTGAATTCAA GTGTTGCAAG AACTGGCCTT
GGCGATGTTT TGGCTGGGTT TGTTTCAGGG ATGGGTGCTT CCGGACTGGC AAGTGATAAA
AAATTGGATT CAAGTTTGCT TGCTGCGTCA GCTTTGATGC ATGCATATGC AGGGGCTTTT
TCTATAAAAG GGAGCACAGC GAGCACTATT TGCACTTTTC TTGGTGAATT AATAAAAAAG
GAAAGCACTT GA
 
Protein sequence
MFAVFLMFSR LFLLKANDLK LFLFLLFPHS WALMIVASIH SGFGCFLILN WPQSDSEHLM 
VSSEQMQNIE KEMFSMGMPV EALMEKVGIG ISSWILDRHG LIENGAIVLV GPGHNGGDGL
VVARELYMAG VDISIWCPFP LKKQLTQKHF DYAMQIGIEN LERKPDSNSD LLWIEALFGL
GQSRIISDEI LRLLNSKKTF SPEKLISIDV PAGLDSDNGN IVSNTPCKAS SSLTLGLFKS
GLIQDSAIDY VGNLERVDIG IPDKILAGFP ETQPLRISFS DLSTFVWPML SKSKSKYQRG
RVLVIAGSEK YRGAASLALN GALASGVGSV SAFLPNSVSS ALWITHPEVL LLGDLNAFQD
GSSDFSKVLN EVDLNRFDSI LLGPGLGMAE EKDCFGSDLQ DFKGLLVLDA DAINRLSITS
KGWEWLNDRE GPTWLTPHLE EFKRLFPLID CSNPLKAGIE AAKTCSSTVL LKCAHSVISD
PEGKTWQIGQ VNSSVARTGL GDVLAGFVSG MGASGLASDK KLDSSLLAAS ALMHAYAGAF
SIKGSTASTI CTFLGELIKK EST