Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_14891 |
Symbol | |
ID | 4718210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1270719 |
End bp | 1272284 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640079210 |
Product | fused sugar kinase/uncharacterized domain-containing protein |
Protein accession | YP_001009879 |
Protein GI | 123969021 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAA TTGTATGGCC AACAATTGAC TCTGAACATT TAATTGTTGA TTCAAAGCAA ATGCTGGTAT TAGAGAAAGA AATGTTTTCT GATGGGATGC CAAAAGAAGC ATTGATGGAA AAAGCTGGTA TCCAAATTAG TAGATGGCTC TTAAAAAAGA AACCTCTTCT CAAGCATGGA ATAACTTTTT TTATAGGTCC TGGGCATAAT GGCGGGGATG GTGCAGTAAT AGCTAGAGAA CTTTTTTTGA GAGGTTTTAA TGTTCAGGTA TGGTGTCCAT TCCCGATAAA AAAAACATTA ACAAATAACC ACCTTAATTA TCTTACATCT ATTGGTGTCA CAAAATTAGT AGAGCCTCCT GATGCAAATG GGAAAGAGCT TTGGATTGAT GCAGTTTTTG GTAACAATCA AACAAGAAAA GTTGATGATA AATTAATTAA ACTGTTTAAT CAAAAATTTC ATAAAAAACA TGGAAAGGTA ATAAGTATTG ATATTCCAAC AGGATTATGT CCTGATAAAG GAGAGCCTTT TTTAGATAAC GCAGTAAAGG CAGATTATAC CTTGACCATT GGTCTTTATA AGATTGGGTT GACCCAAGAT TCTGCTTTGC CTTTTATTGG AGAATTGCAC CATATAGATG TTGGGATACC TACTAGTAAA CTGTCCAAAG TTGATAAAAA GATTTTTAAG GTTACTTACA AAGACATTAA ACATATTGAT TTACCTTCTT TACCAAAAAA TTACAACAAA TATCAAAGAG GTAGGACATT ACTAATAGCA GGAAGTGAAA AATATCCTGG TGCCGCATAC TTGGCATTAA AAGGGGCTAT GTCAAGTGGA GCAGGCTTTA TCTCTGCGGT CCTTCCTGGG ATAGTTTCTG AATCTATTTG GCAAGTTGCC CCAGAAATAG TTTTGAAAGA AACTATGCAA TCTAACCAAA ATGGCAATGC ATCTTTATTT AGTGCATTAA GGAATATTGA TTTAAGTGCA TTTGATTCAT TAGCTGTCGG TCCAGGAATA GGAATTGATA ATGATGATTG GCAAAAATCA AAAGACTATC TTATTGATTT TGGAGGATTA TTGATCTTGG ATGCAGATGC ACTTAATAGA ATTTCGGAAT CTAAGTTGGG GGCAAATTTC TTTTTAGAGA GAAAGTTTAA GACATGGATT ACACCACATA GCAAAGAATT TGGAAGATTA TTTCCCAATA TCAATTCTCA TACCAATGTT GGGCTTGCCC GTGAAGCGGC AAAAGAATTC AATATCAGTA TTTTGTTAAA AGGAGCTAAC AGTGTAGTTG CTGATAATAG AAAAGTATGG CAACTTTTTG GAACCGATTC TCAGACAGCT CGAGCTGGAT TGGGAGATCT TTTATCTGGT TTTGTAGCTG GTAGTTCTGC AATTGATTCA ACCTTGTGTA GAAATATATC AACAGATTTT TTTGCTAAGT ATGTACTTCT GCATTCATTT GCTGCATCAA AGTGCAAAAA AGGGTCAAAT GCTTCTGTTA TTGGTGACGA ATTGTCAAAA TTAATGAGAA ATATAAAAAC GAGACAAATA TCTTGA
|
Protein sequence | MNEIVWPTID SEHLIVDSKQ MLVLEKEMFS DGMPKEALME KAGIQISRWL LKKKPLLKHG ITFFIGPGHN GGDGAVIARE LFLRGFNVQV WCPFPIKKTL TNNHLNYLTS IGVTKLVEPP DANGKELWID AVFGNNQTRK VDDKLIKLFN QKFHKKHGKV ISIDIPTGLC PDKGEPFLDN AVKADYTLTI GLYKIGLTQD SALPFIGELH HIDVGIPTSK LSKVDKKIFK VTYKDIKHID LPSLPKNYNK YQRGRTLLIA GSEKYPGAAY LALKGAMSSG AGFISAVLPG IVSESIWQVA PEIVLKETMQ SNQNGNASLF SALRNIDLSA FDSLAVGPGI GIDNDDWQKS KDYLIDFGGL LILDADALNR ISESKLGANF FLERKFKTWI TPHSKEFGRL FPNINSHTNV GLAREAAKEF NISILLKGAN SVVADNRKVW QLFGTDSQTA RAGLGDLLSG FVAGSSAIDS TLCRNISTDF FAKYVLLHSF AASKCKKGSN ASVIGDELSK LMRNIKTRQI S
|
| |