Gene Tery_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0723 
Symbol 
ID4242513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1170383 
End bp1171918 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content33% 
IMG OID638106015 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_720628 
Protein GI113474567 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.191783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.257286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA TCAAAGAAAA ATTTATAGTT ACTGCTGCAC AAATGCGACA AATAGAGGAA 
CGTATATTTA TTGCGGGAAT GCCTGTAGCA GCATTAATGG AAAAGGTTGC AGGATTAGTA
ACACGAAAAA TCTCAGAATT ATTAAATTCT AATATTAGTA AAATAGGTGT ATTAGTAGGT
CCTGGTCATA ATGGTGGAGA TGCTTTGGTC ATAGCCAGAG AATTATATTT TCAAGGTTAT
GAAATAATAG TTTACTGCCC TTTTGTAAAC CTGAAAGAAC TAACAAATGC TCATGCCAAA
TATCTCCAAT ATTTAGGAGT TAATTTTGTC AAAAATATTT CCCTGCTCAA AAATTGTGCT
TTGCTCATAG ATGGTTTATT TGGTTTTGGT TTAGAACGGG AAATATCTGG AGATTTAGCT
GAAAGTATCA GTCAAATTAA TAGTTGGCAA AAACAGGTAT TTAGTATTGA CTTACCTTCG
GGAATACATA CAGATACAGG AGAAGTTTTA GGAACAAGTA TTTGTGCTAC CTACACATTT
TGTTTGGGTC TGTGGAAACA GGCATTTTTA CAAGACAAAG CTCTAAAATA TTTTGGTAAA
TCTGAATTAA TTGACTTTGA TATTCCCCTA GCAGATATTA CAGAAGTTTT AGGAGAATTT
CCAACAATTG AGAGAATTAC AAAAACATCT GCTCTGGAAA ATTTACCTCT ACCTCGTCCC
CCAGCAACCT ATAAATATAC AAACGGTAAT TTATTATTAA TTGTAGGCTC CCATCGTTAT
AGTGGCGCGG CAGTTTTAAC AGGATTAGGT TCTAGAGTAA CAGGTGTAGG AATGTTATCA
ATTGCTGTAC CAAAATCAAT TAAACCTATA CTAAATAGTC ATTTACCAGA AGCACTAATT
ATTGGTTGCC CTGAAACAGA AACTGGAACA ATAAAACAAT TACCAGAAGA TATAGATTTA
AGTAAATTTG AAGCTATTGC TTGTGGACCT GGTTTAACTA TAGAACCTAC ATCTATAATA
GAAAAAGTAT TATCTGTAAA TTGTCCTCTA GTTTTAGATG CTGATGCTTT AAATATTTGT
GGAAATTTAG CAAATAATTC TTTAGTAAGT CAACGTCAAG CACCAACAAT TATTACTCCC
CACCCAGGAG AGTTTAAACG TCTATTTCCT CATCTAGTAG ATCAACTGAA TAATCGGATA
TTAGCTGCTG AAAAAGCTAC TAAAAATAGT AACGTTATAA TAGTTTTAAA AGGGGCAAAA
ACAATTATTG CTAATAGCCA AAAAATATTA ATAAATCCAG AAAGCACTCC CGCATTAGCT
AGAGGGGGAA GTGGAGATGT ACTTACAGGA TTAGTTGGAG GTTTATTAGC TATTAATTCA
ACTCAAACTC TACCATTCAA TGCTGTAAAA ACAGCAGTTT GGTGGCATGC TCAAGCAGCA
ATATTAGCAG TAAAAGAACG AACAGAGTTA GGAGTAGATG CTTTTACTTT AACTCAATAT
TTAATACCTG CGCTAAAAAA ATATAACTTT TGCTAA
 
Protein sequence
MNNIKEKFIV TAAQMRQIEE RIFIAGMPVA ALMEKVAGLV TRKISELLNS NISKIGVLVG 
PGHNGGDALV IARELYFQGY EIIVYCPFVN LKELTNAHAK YLQYLGVNFV KNISLLKNCA
LLIDGLFGFG LEREISGDLA ESISQINSWQ KQVFSIDLPS GIHTDTGEVL GTSICATYTF
CLGLWKQAFL QDKALKYFGK SELIDFDIPL ADITEVLGEF PTIERITKTS ALENLPLPRP
PATYKYTNGN LLLIVGSHRY SGAAVLTGLG SRVTGVGMLS IAVPKSIKPI LNSHLPEALI
IGCPETETGT IKQLPEDIDL SKFEAIACGP GLTIEPTSII EKVLSVNCPL VLDADALNIC
GNLANNSLVS QRQAPTIITP HPGEFKRLFP HLVDQLNNRI LAAEKATKNS NVIIVLKGAK
TIIANSQKIL INPESTPALA RGGSGDVLTG LVGGLLAINS TQTLPFNAVK TAVWWHAQAA
ILAVKERTEL GVDAFTLTQY LIPALKKYNF C