Gene EcSMS35_0975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0975 
Symbol 
ID6146203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp985210 
End bp986109 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content50% 
IMG OID641615862 
Productlipid kinase 
Protein accessionYP_001743054 
Protein GI170683733 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.435416 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAT TTCCCGCCAG CTTACTGATT CTTAATGGCA AAAGTACTGA CAATCTACCC 
CTGCGCGAAG CAATTATGCT GTTGCGTGAG GAAGGAATGA CGATCCATGT GCGGGTCACC
TGGGAGAAAG GCGATGCCGC ACGATATGTA GAGGAGGCCC GGAAGTTGGG CGTCGCAACG
GTGATTGCCG GTGGTGGCGA TGGCACCATT AATGAAGTTT CTACGGCGTT GATTCAGTGT
GAGGGGGATG ACATACCCGC GCTGGGAATT TTGCCATTAG GAACCGCCAA TGATTTTGCC
ACCAGTGTAG GGATTCCTGA GGCACTGGAT AAGGCGCTGA AACTGGCAAT TGCCGGTAAC
GCCATTGCGA TAGATATGGC GCAGGTCAAC AAACAAACCT GTTTTATTAA TATGGCGACA
GGCGGATTTG GGACGCGTAT TACCACAGAA ACGCCGGAAA AATTAAAAGC CGCGCTGGGT
GGCGTCTCTT ACATCATTCA TGGCTTAATG CGCATGGATA CTCTGCAACC GGACCGTTGT
GAAATCCGCG GTGAAAACTT TCACTGGCAA GGTGACGCCC TGGTCATTGG TATTGGTAAC
GGGCGTCAGG CTGGTGGCGG TCAGCAATTG TGCCCGAACG CGTTAATTAA CGATGGCTTG
CTGCAACTGC GCATTTTTAC CGGCGATGAA ATTCTTCCGG CTCTCGTATC AACCTTAAAA
TCTGACGAAG ATAACCCGAA TATTATCGAA GGCGCTTCGT CGTGGTTTGA TATACAAGCC
CCACACGAAA TCACTTTTAA TCTTGATGGC GAACCGTTGA GCGGACAAAA CTTTCATATT
GAAATACTTC CGGCGGCGTT GCGTTGTCGA TTACCACCGG ATTGTCCATT ATTGCGTTAA
 
Protein sequence
MAEFPASLLI LNGKSTDNLP LREAIMLLRE EGMTIHVRVT WEKGDAARYV EEARKLGVAT 
VIAGGGDGTI NEVSTALIQC EGDDIPALGI LPLGTANDFA TSVGIPEALD KALKLAIAGN
AIAIDMAQVN KQTCFINMAT GGFGTRITTE TPEKLKAALG GVSYIIHGLM RMDTLQPDRC
EIRGENFHWQ GDALVIGIGN GRQAGGGQQL CPNALINDGL LQLRIFTGDE ILPALVSTLK
SDEDNPNIIE GASSWFDIQA PHEITFNLDG EPLSGQNFHI EILPAALRCR LPPDCPLLR