Gene ECH74115_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3068 
Symbol 
ID6967255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2840311 
End bp2841210 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content50% 
IMG OID643386900 
Productlipid kinase 
Protein accessionYP_002271368 
Protein GI209396727 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.389752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000297552 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGAAT TTCCCGCCAG CTTACTGATT CTTAATGGCA AAAGTACTGA CAATCTACCC 
TTGCGCGAAG CAATTATGCT GTTGCGTGAG GAAGGAATGA CGATCCATGT GCGGGTCACC
TGGGAGAAAG GCGATGCCGC ACGATATGTA GAGGAGGCCC GGAAGTTGGG CGTCGCAACG
GTGATTGCCG GTGGTGGCGA TGGCACCATT AATGAAGTTT CTACGGCGTT GATTCAGTGT
GAGGGGGATG ACATACCCGC GCTGGGAATT TTGCCATTAG GAACCGCCAA TGATTTTGCC
ACCAGTGTAG GGATTCCTGA GGCACTGGAT AAGGCGCTGA AACTGGCAAT TGCCGGTAAC
GCCATTGCGA TAGATATGGC GCAGGTCAAC AAACAAACCT GTTTTATTAA TATGGCGACA
GGCGGATTTG GGACGCGTAT TACCACAGAA ACGCCGGAAA AATTAAAAGC CGCGCTGGGT
GGCGTCTCTT ACATCATTCA TGGCTTAATG CGCATGGATA CTCTGCAACC GGACCGTTGT
GAAATCCGCG GTGAAAACTT TCACTGGCAA GGTGACGCCC TGGTCATTGG TATTGGTAAC
GGGCGTCAGG CCGGTGGCGG TCAACAATTG TGCCCGAACG CGTTAATTAA CGATGGCTTG
CTGCAACTGC GCATTTTTAC CGGCGATGAA ATTCTTCCGG CTCTCGTATC AACCTTAAAA
TCTGACGAAG ATAACCCGAA TATTATCGAA GGCGCTTCGT CGTGGTTTGA TATACAAGCC
CCACACGAAA TCACTTTTAA TCTTGATGGC GAACCGTTGA GTGGGCAAAA CTTCCATATT
GAAATACTTC CGGCGGCGTT GCGTTGTCGA TTACCACCAG ATTGTCCATT ATTGCGTTAA
 
Protein sequence
MAEFPASLLI LNGKSTDNLP LREAIMLLRE EGMTIHVRVT WEKGDAARYV EEARKLGVAT 
VIAGGGDGTI NEVSTALIQC EGDDIPALGI LPLGTANDFA TSVGIPEALD KALKLAIAGN
AIAIDMAQVN KQTCFINMAT GGFGTRITTE TPEKLKAALG GVSYIIHGLM RMDTLQPDRC
EIRGENFHWQ GDALVIGIGN GRQAGGGQQL CPNALINDGL LQLRIFTGDE ILPALVSTLK
SDEDNPNIIE GASSWFDIQA PHEITFNLDG EPLSGQNFHI EILPAALRCR LPPDCPLLR