Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_13371 |
Symbol | |
ID | 5731847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1205095 |
End bp | 1206669 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641285708 |
Product | fused sugar kinase/uncharacterized domain-containing protein |
Protein accession | YP_001551222 |
Protein GI | 159903878 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.873266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACTGGC CTAAATCAGA TGCAGATCAT TTAATTGCCT CAACTGCTCA GATGAGAAGC TTTGAAAACA ATCTCATCAA TAGAGGCATG CCTGTTGAGG CATTAATGGA AAAAGTAGGG CAAAAACTGA AAGATTGGTT TTTGGAAAGG CCTGAATTAC TTTTAAACGG TGTATTGATT TTAGTTGGTC CAGGGCACAA TGGAGGGGAC GGGCTGGTTT TAGCTAGAGA ATTGTTCTTG GAGAACATAG ATGTATCTAT TTGGTGTCCT TTGCCAATTA CGCAATCTTT AACTCGGCGT CATCAGGCTT ATTCGCATTG GCTGGGTATT AAGGAATTGC ATGAAGAACC TAATGTTTAT TGTAAGACTC TTTGGATTGA CGCCCTTTTT GGACTTGGAC AGACCCGTCC TCTTCCTGAG AAAATAGCAA ATCTTTTAAA GGGTAGAGAG GAAAAACAAC CAAGAAGATT AATCTCTATT GATGTTCCAT CTGGGATTTG TTCAGATACA GGAAAGACTT TCTCAAACGT TGCAGCTACA GCTTCTTATA CCTTGACTAT TGGCTTGAAT AAGCAGGGAT TGATTCAAGA CATTGCTATC CCATATGTGG GTGAATTAAT AAGACTTGAT ATTGGTATTC CACAGAAAGT TCTCAAGAAA GATAAGCAAA TTAACTTCTT CAAACTTGCA GGTGATGATC TTTTTTCGAT CCCTTGGCCT GAGCCTTCTT CAATTTCTTC TAAATATCAG CGTGGAAGAC TTTTGGTTGT TGCCGGCAGT GACAAGTATA AAGGTGCTGC TTTACTTGCT TTACAAGGCG CATTAGCAAG TGGTGTGGGC AGTATTAAGG CGCTTTTGCC CCAAAAAATG GCAGATAATA TTTGGCAAGT TTCTCCAGAA ATTGTTGTTT CAGGAATATT GGGAAATTCA ATTGATGGAG AATCTGAGAT AGGGGAGGCG ATCTTAAAAC ACAAATTAGA CCGTCTAGAC TCATTATTAA TTGGTCCTGG TTTAGCTTTG GCTAACGAAA AGTGGAGTGA TATTGCTTGC CAGCTTGAAA AATTTCCAGG GTTATTAGTT CTTGATGCTG ATGCTTTAAA TAGATTAGCT TTTTCTACTG AGAGCTGGGA GTGGATTAAT AAGCGTGAAG GACCAACATG GCTGACTCCT CATTTACAGG AGTTCTATCG TTTATTCCCA AATCTTAAGG ACTTATCACC TTTGGATGCA GCGCCTAAAG CGGCAAAGAT GACTGGGGCA AATGTATTGC TTAAAGGGGC ACATAGCCTT ATAGCATCCC CCTCTGGGGT GAGATGGCAA TTGGTGTCTA CATCTAGCTG TGCTGCGCGT GCAGGTTTAG GGGATGTTTT GGCTGGATTT GTTGCTGGAG TAGGTGCAAT AGGGTTTTCT ATAGGAGGAG CAATGAATGA TGAATTACTT GCTTTAGCAG CATTGCTTCA CGCGGAAGCG GCAAAAAGTT GCAAAAAAGG AAGTAAAGCA AGCTCAATTG CATTGGCTTT AGAAAAACTT GTAAGACACA TTCAGTGTCC TGAATGTGTT CAAAGAAACA TTTAA
|
Protein sequence | MNWPKSDADH LIASTAQMRS FENNLINRGM PVEALMEKVG QKLKDWFLER PELLLNGVLI LVGPGHNGGD GLVLARELFL ENIDVSIWCP LPITQSLTRR HQAYSHWLGI KELHEEPNVY CKTLWIDALF GLGQTRPLPE KIANLLKGRE EKQPRRLISI DVPSGICSDT GKTFSNVAAT ASYTLTIGLN KQGLIQDIAI PYVGELIRLD IGIPQKVLKK DKQINFFKLA GDDLFSIPWP EPSSISSKYQ RGRLLVVAGS DKYKGAALLA LQGALASGVG SIKALLPQKM ADNIWQVSPE IVVSGILGNS IDGESEIGEA ILKHKLDRLD SLLIGPGLAL ANEKWSDIAC QLEKFPGLLV LDADALNRLA FSTESWEWIN KREGPTWLTP HLQEFYRLFP NLKDLSPLDA APKAAKMTGA NVLLKGAHSL IASPSGVRWQ LVSTSSCAAR AGLGDVLAGF VAGVGAIGFS IGGAMNDELL ALAALLHAEA AKSCKKGSKA SSIALALEKL VRHIQCPECV QRNI
|
| |