Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0723 |
Symbol | |
ID | 4242513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1170383 |
End bp | 1171918 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638106015 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_720628 |
Protein GI | 113474567 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.191783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.257286 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATA TCAAAGAAAA ATTTATAGTT ACTGCTGCAC AAATGCGACA AATAGAGGAA CGTATATTTA TTGCGGGAAT GCCTGTAGCA GCATTAATGG AAAAGGTTGC AGGATTAGTA ACACGAAAAA TCTCAGAATT ATTAAATTCT AATATTAGTA AAATAGGTGT ATTAGTAGGT CCTGGTCATA ATGGTGGAGA TGCTTTGGTC ATAGCCAGAG AATTATATTT TCAAGGTTAT GAAATAATAG TTTACTGCCC TTTTGTAAAC CTGAAAGAAC TAACAAATGC TCATGCCAAA TATCTCCAAT ATTTAGGAGT TAATTTTGTC AAAAATATTT CCCTGCTCAA AAATTGTGCT TTGCTCATAG ATGGTTTATT TGGTTTTGGT TTAGAACGGG AAATATCTGG AGATTTAGCT GAAAGTATCA GTCAAATTAA TAGTTGGCAA AAACAGGTAT TTAGTATTGA CTTACCTTCG GGAATACATA CAGATACAGG AGAAGTTTTA GGAACAAGTA TTTGTGCTAC CTACACATTT TGTTTGGGTC TGTGGAAACA GGCATTTTTA CAAGACAAAG CTCTAAAATA TTTTGGTAAA TCTGAATTAA TTGACTTTGA TATTCCCCTA GCAGATATTA CAGAAGTTTT AGGAGAATTT CCAACAATTG AGAGAATTAC AAAAACATCT GCTCTGGAAA ATTTACCTCT ACCTCGTCCC CCAGCAACCT ATAAATATAC AAACGGTAAT TTATTATTAA TTGTAGGCTC CCATCGTTAT AGTGGCGCGG CAGTTTTAAC AGGATTAGGT TCTAGAGTAA CAGGTGTAGG AATGTTATCA ATTGCTGTAC CAAAATCAAT TAAACCTATA CTAAATAGTC ATTTACCAGA AGCACTAATT ATTGGTTGCC CTGAAACAGA AACTGGAACA ATAAAACAAT TACCAGAAGA TATAGATTTA AGTAAATTTG AAGCTATTGC TTGTGGACCT GGTTTAACTA TAGAACCTAC ATCTATAATA GAAAAAGTAT TATCTGTAAA TTGTCCTCTA GTTTTAGATG CTGATGCTTT AAATATTTGT GGAAATTTAG CAAATAATTC TTTAGTAAGT CAACGTCAAG CACCAACAAT TATTACTCCC CACCCAGGAG AGTTTAAACG TCTATTTCCT CATCTAGTAG ATCAACTGAA TAATCGGATA TTAGCTGCTG AAAAAGCTAC TAAAAATAGT AACGTTATAA TAGTTTTAAA AGGGGCAAAA ACAATTATTG CTAATAGCCA AAAAATATTA ATAAATCCAG AAAGCACTCC CGCATTAGCT AGAGGGGGAA GTGGAGATGT ACTTACAGGA TTAGTTGGAG GTTTATTAGC TATTAATTCA ACTCAAACTC TACCATTCAA TGCTGTAAAA ACAGCAGTTT GGTGGCATGC TCAAGCAGCA ATATTAGCAG TAAAAGAACG AACAGAGTTA GGAGTAGATG CTTTTACTTT AACTCAATAT TTAATACCTG CGCTAAAAAA ATATAACTTT TGCTAA
|
Protein sequence | MNNIKEKFIV TAAQMRQIEE RIFIAGMPVA ALMEKVAGLV TRKISELLNS NISKIGVLVG PGHNGGDALV IARELYFQGY EIIVYCPFVN LKELTNAHAK YLQYLGVNFV KNISLLKNCA LLIDGLFGFG LEREISGDLA ESISQINSWQ KQVFSIDLPS GIHTDTGEVL GTSICATYTF CLGLWKQAFL QDKALKYFGK SELIDFDIPL ADITEVLGEF PTIERITKTS ALENLPLPRP PATYKYTNGN LLLIVGSHRY SGAAVLTGLG SRVTGVGMLS IAVPKSIKPI LNSHLPEALI IGCPETETGT IKQLPEDIDL SKFEAIACGP GLTIEPTSII EKVLSVNCPL VLDADALNIC GNLANNSLVS QRQAPTIITP HPGEFKRLFP HLVDQLNNRI LAAEKATKNS NVIIVLKGAK TIIANSQKIL INPESTPALA RGGSGDVLTG LVGGLLAINS TQTLPFNAVK TAVWWHAQAA ILAVKERTEL GVDAFTLTQY LIPALKKYNF C
|
| |