Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_05061 |
Symbol | |
ID | 5730439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 473248 |
End bp | 474294 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641284865 |
Product | carbohydrate kinase |
Protein accession | YP_001550391 |
Protein GI | 159903047 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0313242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAACAAA AAGGAATCAA CAATCATGAA AAAGAGATAG ACGTTGTAGC GATAGGGAAT GCAATAGTAG ATGTTTTGGT ATACGAAAGT GATTCATTTC TTGAGAAAAA TTCATTAACA AAAGGGAGTA TGGCTCTCAT CGATGAAGAT GAAGCTAATA AGTTATACAA GAGCTGTGGC CCTGGTCTAG AGACCTCTGG AGGTTCAGCA GCCAATACTA TGGCAGGCTT ATCACAATTA GGTGGCAAGG CAGGATTTAT TGGACGAGTA AAAAAAGACC AACTTGGAGA AATTTTTACC CATGATATTT GCTCTACAGG AGCCATCTAT ACAACTCCAG CGATAGTAAA AGGTCCATCT ACTGCAAGAT GTTTTATTTT TGTCACCCCT GATGCTCAGC GAACAATGTG TACTTTTCTT GGTGCATCAG TTTTTCTAAA CCCAGCAGAT TTAGATCTTT CATTAGTTAG AAAAACAAAA GTCCTCTATC TTGAGGGCTA TCTATGGGAC CATGATGAAG CAAAGAATGC ATTCATCACT TCAGCAAAAG AGTGTAAATT AGCTGGAGGT AAAGTAGCTT TATCTTTATC AGATTCTTTT TGCATAGACC GTCACAGAGA AAGCTTTCAA AACTTAGTCG AGAATCATGT AGATATACTT TTCGCAAATG AATCCGAGAT AATCTCATTA TATGAGAGTA ATGATTTCGA ATCAGCAAAA AACATTATAA AAGGAAAGTG TGAAGTCTCA GTTCTTACAA GAGGTAAAGA CGGATCATTA ATCCTTCATA GAAGTAAAGA ATATATAGTT AGGCCTTATA AGTTAGGAGA ACTATTAGAT ACAACTGGTG CAGGCGATAT ATATGCAGCT GGCTTTCTTT ATGGCTATAC AAACAATAAA GACCTCTACA CATGTGGGAA AATCGGTTCT TTCTGTGCAG GTCACATTGT CACTCAATTA GGGCCTCGTT CTCGCGAATC TCTTGTAAAG CTTTTAGACG AGCAGTTACA TCTAAAAGAT GTTAATAATC CAAAAGAGAT TGATTAA
|
Protein sequence | MEQKGINNHE KEIDVVAIGN AIVDVLVYES DSFLEKNSLT KGSMALIDED EANKLYKSCG PGLETSGGSA ANTMAGLSQL GGKAGFIGRV KKDQLGEIFT HDICSTGAIY TTPAIVKGPS TARCFIFVTP DAQRTMCTFL GASVFLNPAD LDLSLVRKTK VLYLEGYLWD HDEAKNAFIT SAKECKLAGG KVALSLSDSF CIDRHRESFQ NLVENHVDIL FANESEIISL YESNDFESAK NIIKGKCEVS VLTRGKDGSL ILHRSKEYIV RPYKLGELLD TTGAGDIYAA GFLYGYTNNK DLYTCGKIGS FCAGHIVTQL GPRSRESLVK LLDEQLHLKD VNNPKEID
|
| |