Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_14751 |
Symbol | |
ID | 4911040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1243824 |
End bp | 1245389 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640161067 |
Product | fused sugar kinase/uncharacterized domain-containing protein |
Protein accession | YP_001091699 |
Protein GI | 126696813 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAA TTGTATGGCC AACAATTGAT TCTAAACATT TAATTGTTGA TTCAAAGCAA ATGTTGATAG TAGAGAAAGA AATGTTTTCT GATGGAATGC CACAAGAAGC ATTGATGGAA AAAGCTGGTA TCCAAATTAG TAGATGGCTT CTAAAAAAGA AACCTCTTCT CAAGAATGGA ATAACTGTTT TTATAGGTCC TGGGCATAAT GGCGGGGATG GTGCAGTGAT AGCTAGAGAG CTTTTTTTGA AAGGTTTTTA TGTTCAAGTA TGGTGTCCAT TCCCGATAAA AAAAACACTA ACAAATGACC ACCTTAATTA TCTTACATCT ATTGGTGTCA CAAAATTAGT AGAGCCTCCT GATGCAAATG GGAAAGATCT TTGGATTGAT GCAGTTTTTG GTAATAATCA AACAAGAAAA GTCGATGATA GATTAATTAA ACTGTTTAAT CAAAAATTTC ATAACAAATA TAGCAAGGTA ATAAGTATTG ATATTCCAAC AGGATTATGT CCTGATAAAG GACAGCCTTT TTTAGATAAA GCTGTAAAGG CAGACTATAC CTTGGCAATT GGTCTTAATA AGATTGGGTT AACTCAAGAT TCTGCATTGC CTTTTATTGG AGAATTGCAC CATATAGATG TTGGGGTACC TATTAGTAAA CTGTCAAAAG TTGATAAAAA GATTTTTAAG GTTACTTACA AAGATTTAAA AAATATTGAT TTACCTTCTT TACCAAAAAA TTCCAACAAA TATAAAAGAG GTAGAACATT ATTAATAGCT GGAAGTGAAA AATATCCTGG CGCTGCATAC TTAGCATTAA AAGGCGCGAT ATCAAGTGGA GCAGGTTATG TCTCCGCTGT CCTTCCTGAG GTAGTTGCTG AATCTATTTG GCAAGTTGCT CCAGAAATAG TTTTAAAGGA TATTATGCAG TCTAATCAGA ATGGAAATGC ATCTTTATTT GGTGCATTAA AAAATATTGA TTTAAGTGCA TTTGATTCAG TAGCTGTCGG CCCAGGAATA GGAATTGATA ATGATGATTG GCAAAAATCA AAAGACTATC TTTTGGATTA TGGAGGGTTA TTGATCTTGG ATGCAGATGC ACTTAATAGA ATTTCGGAAT CTAAGTTAGG GGCAAATTTC TTTTTAGAGA GAAAATTTAA AACATGGATT ACACCTCATA GCAAAGAATT TCGAAGGTTA TTCCCTAATA TCAATTCTTC TACCAATTTA GGGCTAGCTC TTGATGCAGC AAAAGAATTT AACATTGGTA TTTTATTTAA GGGAGCTAAT AGCATAGTTG CTGATAGTAA AAAAGTTTGG CAACTTTTCG GAACCGATTC ACAGACAGCT CGAGCTGGAT TAGGTGATCT TTTATCTGGA TTTATAGCAG GGAGTTCTGC GATTGATTTA ACCTTTTGTA GAAACATAAC AGCAGAATAT TTTGCTAAAT ACGTACTTTT GCATTCATTT GCTGCATCAA AGTGCAAAAA AGGGTCAAAT GCATCTGCTA TTGGTGACGA ATTGTCAAAA TTAATGAGAA ATAAAAAAAT GAGACAAATA TCTTGA
|
Protein sequence | MNEIVWPTID SKHLIVDSKQ MLIVEKEMFS DGMPQEALME KAGIQISRWL LKKKPLLKNG ITVFIGPGHN GGDGAVIARE LFLKGFYVQV WCPFPIKKTL TNDHLNYLTS IGVTKLVEPP DANGKDLWID AVFGNNQTRK VDDRLIKLFN QKFHNKYSKV ISIDIPTGLC PDKGQPFLDK AVKADYTLAI GLNKIGLTQD SALPFIGELH HIDVGVPISK LSKVDKKIFK VTYKDLKNID LPSLPKNSNK YKRGRTLLIA GSEKYPGAAY LALKGAISSG AGYVSAVLPE VVAESIWQVA PEIVLKDIMQ SNQNGNASLF GALKNIDLSA FDSVAVGPGI GIDNDDWQKS KDYLLDYGGL LILDADALNR ISESKLGANF FLERKFKTWI TPHSKEFRRL FPNINSSTNL GLALDAAKEF NIGILFKGAN SIVADSKKVW QLFGTDSQTA RAGLGDLLSG FIAGSSAIDL TFCRNITAEY FAKYVLLHSF AASKCKKGSN ASAIGDELSK LMRNKKMRQI S
|
| |