Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18371 |
Symbol | |
ID | 4718574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1567711 |
End bp | 1568709 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640079570 |
Product | putative carbohydrate kinase |
Protein accession | YP_001010227 |
Protein GI | 123969369 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.455738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAAAGGTTAT ATGTATTGGT GAGGCTTTAA TAGACAGAAT CAGAAATAAG TCAAATAAGG GATTTACAGA TTTTTTGGGT GGTGCGCCGG CTAATGTCGC TTGTGCATTG AGAAAATTAA AAATAGATTC AATTTTTATG GGAAGTTTGG GTAATGATGA TTATGGAAAA AAATTTATTT CGCAATTTGA TCAATTGGGA GTTAATTTAG ATTTCTTGCA ATTAAATAAT GATTCATCTA CTCGTGTGGT TAATGTAAAT AGGGATCAAT TTGGAGATCG TTTTTTTTCA GGCTTTGAGG AAAGTTCTCA TGCATGCTAT GCAGACGAAG TTCTTAGCAA GAAATTAATA GAAAAAGAAA TTTTAAATTT GGAGAAATCT TTTCTAGAAA TAAAATATTT GGTAACAGGA ACGAACTTAT TATCATCTCC AATATCAGCA GAGACTATTT TTTTTCTTAT TGAACAGGCT AATAAATTTG AAGTCAAAAT AGTTATTGAT TTGAATTGGA GAGAGGTTTT TTGGGATCAT GCAAGTTTCT CATCAGAAAT TAGTAAAGCC GCGAGAGTTA ATTTAATCAA GAATTTTTTA AATTATGCAA ATGTTTTAAA GCTTGCGAAG GAAGAAGCAA TTTTGTTTTT TGAGGATGAA AACCCCTTGC TAATATCTCA ACGACTGTCT AATAGACCAG ATGTAATAAT AACTGATGGA AAAAATCCTG TTTTATGGAG CATCAACGGA TTTCAGGGAA TTACCGAAAC TCCTACTTCA CAAAAAATTG TTGATACAAC CGGGGCAGGC GATGCTTTTC TAGCTGGCTT TATTTCAAAA TTAATTTCTT CTGGCTATCC TACAAGTGAT TTAGAGATAG AAGATTGCAT TAAGTTCGCA GGTGTTTGTG GATTATTAAC TTGTCTTGGT GAAGGCGCTA TCGAGCAACA GCCATATTAT GAAAAGGTTA ATAAATTTTT GGGATCTCTT ATTTCGTAG
|
Protein sequence | MKKKKVICIG EALIDRIRNK SNKGFTDFLG GAPANVACAL RKLKIDSIFM GSLGNDDYGK KFISQFDQLG VNLDFLQLNN DSSTRVVNVN RDQFGDRFFS GFEESSHACY ADEVLSKKLI EKEILNLEKS FLEIKYLVTG TNLLSSPISA ETIFFLIEQA NKFEVKIVID LNWREVFWDH ASFSSEISKA ARVNLIKNFL NYANVLKLAK EEAILFFEDE NPLLISQRLS NRPDVIITDG KNPVLWSING FQGITETPTS QKIVDTTGAG DAFLAGFISK LISSGYPTSD LEIEDCIKFA GVCGLLTCLG EGAIEQQPYY EKVNKFLGSL IS
|
| |