Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0975 |
Symbol | |
ID | 6146203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 985210 |
End bp | 986109 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641615862 |
Product | lipid kinase |
Protein accession | YP_001743054 |
Protein GI | 170683733 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase |
TIGRFAM ID | [TIGR00147] lipid kinase, YegS/Rv2252/BmrU family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.435416 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAAT TTCCCGCCAG CTTACTGATT CTTAATGGCA AAAGTACTGA CAATCTACCC CTGCGCGAAG CAATTATGCT GTTGCGTGAG GAAGGAATGA CGATCCATGT GCGGGTCACC TGGGAGAAAG GCGATGCCGC ACGATATGTA GAGGAGGCCC GGAAGTTGGG CGTCGCAACG GTGATTGCCG GTGGTGGCGA TGGCACCATT AATGAAGTTT CTACGGCGTT GATTCAGTGT GAGGGGGATG ACATACCCGC GCTGGGAATT TTGCCATTAG GAACCGCCAA TGATTTTGCC ACCAGTGTAG GGATTCCTGA GGCACTGGAT AAGGCGCTGA AACTGGCAAT TGCCGGTAAC GCCATTGCGA TAGATATGGC GCAGGTCAAC AAACAAACCT GTTTTATTAA TATGGCGACA GGCGGATTTG GGACGCGTAT TACCACAGAA ACGCCGGAAA AATTAAAAGC CGCGCTGGGT GGCGTCTCTT ACATCATTCA TGGCTTAATG CGCATGGATA CTCTGCAACC GGACCGTTGT GAAATCCGCG GTGAAAACTT TCACTGGCAA GGTGACGCCC TGGTCATTGG TATTGGTAAC GGGCGTCAGG CTGGTGGCGG TCAGCAATTG TGCCCGAACG CGTTAATTAA CGATGGCTTG CTGCAACTGC GCATTTTTAC CGGCGATGAA ATTCTTCCGG CTCTCGTATC AACCTTAAAA TCTGACGAAG ATAACCCGAA TATTATCGAA GGCGCTTCGT CGTGGTTTGA TATACAAGCC CCACACGAAA TCACTTTTAA TCTTGATGGC GAACCGTTGA GCGGACAAAA CTTTCATATT GAAATACTTC CGGCGGCGTT GCGTTGTCGA TTACCACCGG ATTGTCCATT ATTGCGTTAA
|
Protein sequence | MAEFPASLLI LNGKSTDNLP LREAIMLLRE EGMTIHVRVT WEKGDAARYV EEARKLGVAT VIAGGGDGTI NEVSTALIQC EGDDIPALGI LPLGTANDFA TSVGIPEALD KALKLAIAGN AIAIDMAQVN KQTCFINMAT GGFGTRITTE TPEKLKAALG GVSYIIHGLM RMDTLQPDRC EIRGENFHWQ GDALVIGIGN GRQAGGGQQL CPNALINDGL LQLRIFTGDE ILPALVSTLK SDEDNPNIIE GASSWFDIQA PHEITFNLDG EPLSGQNFHI EILPAALRCR LPPDCPLLR
|
| |