Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_17101 |
Symbol | |
ID | 4780155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1395291 |
End bp | 1396982 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640084994 |
Product | fused sugar kinase/uncharacterized domain-containing protein |
Protein accession | YP_001015530 |
Protein GI | 124026415 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.343407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGCAG TCTTTTTAAT GTTTTCTCGA CTGTTTCTTC TGAAGGCAAA TGATTTAAAA CTTTTTTTGT TTCTCTTGTT TCCACATTCA TGGGCATTGA TGATAGTAGC TTCTATTCAT AGTGGGTTTG GTTGCTTTTT GATTTTGAAT TGGCCTCAAT CTGATTCGGA ACATTTGATG GTTTCTTCAG AGCAGATGCA AAACATAGAA AAAGAAATGT TTTCTATGGG TATGCCTGTT GAAGCTCTGA TGGAAAAAGT GGGAATTGGT ATTTCTTCAT GGATATTAGA CAGGCATGGG TTGATTGAAA ATGGTGCAAT CGTTTTGGTA GGGCCTGGTC ACAATGGAGG CGATGGACTT GTTGTTGCGA GAGAACTTTA TATGGCAGGC GTTGATATCT CTATTTGGTG TCCATTCCCA TTGAAAAAAC AATTAACACA AAAACATTTT GATTATGCAA TGCAGATTGG GATTGAAAAC TTGGAGCGTA AACCTGATTC AAATTCAGAT TTATTATGGA TTGAGGCTTT ATTTGGATTA GGACAATCAA GAATTATTTC TGATGAAATA CTTCGCCTAT TGAATTCGAA GAAGACATTT AGTCCTGAAA AGTTAATTAG TATTGATGTC CCCGCAGGAT TAGACTCAGA TAATGGAAAT ATAGTATCAA ATACACCATG CAAAGCCAGT TCTTCGTTAA CCTTGGGTTT ATTTAAGTCG GGATTAATTC AAGATTCAGC TATTGATTAT GTAGGTAATT TAGAGAGAGT AGATATTGGA ATTCCAGATA AGATATTAGC TGGTTTTCCT GAAACACAGC CTCTAAGGAT TTCTTTTTCA GATTTGTCTA CTTTTGTTTG GCCCATGCTT AGTAAAAGTA AAAGTAAATA TCAGAGAGGA AGAGTTCTGG TGATTGCAGG GAGTGAGAAA TACAGGGGAG CAGCATCCCT TGCTTTGAAT GGAGCTTTAG CAAGTGGGGT AGGTAGTGTT AGTGCTTTTT TGCCCAATTC CGTTTCATCT GCTCTTTGGA TTACTCACCC TGAAGTCTTG TTACTTGGAG ATCTAAATGC TTTTCAAGAC GGTTCTTCAG ATTTCTCCAA AGTTTTAAAT GAAGTTGACT TGAATAGGTT TGACTCGATT TTGCTCGGAC CAGGATTGGG GATGGCAGAA GAAAAAGATT GTTTTGGTTC TGACTTGCAG GATTTCAAAG GCCTACTTGT TCTTGATGCT GATGCAATTA ATAGGCTGTC AATAACATCA AAAGGTTGGG AGTGGCTAAA TGATAGGGAA GGTCCCACTT GGCTTACCCC TCATCTTGAA GAATTTAAAA GGTTATTTCC TTTAATTGAT TGCTCGAATC CATTGAAGGC TGGAATTGAA GCTGCAAAAA CTTGCAGTTC TACTGTCTTA TTGAAATGTG CTCATAGTGT TATCTCTGAT CCAGAAGGTA AAACTTGGCA AATAGGGCAA GTGAATTCAA GTGTTGCAAG AACTGGCCTT GGCGATGTTT TGGCTGGGTT TGTTTCAGGG ATGGGTGCTT CCGGACTGGC AAGTGATAAA AAATTGGATT CAAGTTTGCT TGCTGCGTCA GCTTTGATGC ATGCATATGC AGGGGCTTTT TCTATAAAAG GGAGCACAGC GAGCACTATT TGCACTTTTC TTGGTGAATT AATAAAAAAG GAAAGCACTT GA
|
Protein sequence | MFAVFLMFSR LFLLKANDLK LFLFLLFPHS WALMIVASIH SGFGCFLILN WPQSDSEHLM VSSEQMQNIE KEMFSMGMPV EALMEKVGIG ISSWILDRHG LIENGAIVLV GPGHNGGDGL VVARELYMAG VDISIWCPFP LKKQLTQKHF DYAMQIGIEN LERKPDSNSD LLWIEALFGL GQSRIISDEI LRLLNSKKTF SPEKLISIDV PAGLDSDNGN IVSNTPCKAS SSLTLGLFKS GLIQDSAIDY VGNLERVDIG IPDKILAGFP ETQPLRISFS DLSTFVWPML SKSKSKYQRG RVLVIAGSEK YRGAASLALN GALASGVGSV SAFLPNSVSS ALWITHPEVL LLGDLNAFQD GSSDFSKVLN EVDLNRFDSI LLGPGLGMAE EKDCFGSDLQ DFKGLLVLDA DAINRLSITS KGWEWLNDRE GPTWLTPHLE EFKRLFPLID CSNPLKAGIE AAKTCSSTVL LKCAHSVISD PEGKTWQIGQ VNSSVARTGL GDVLAGFVSG MGASGLASDK KLDSSLLAAS ALMHAYAGAF SIKGSTASTI CTFLGELIKK EST
|
| |