Gene EcHS_A2225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2225 
Symbol 
ID5592643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2212324 
End bp2213223 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content50% 
IMG OID640921355 
Productlipid kinase 
Protein accessionYP_001458891 
Protein GI157161573 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAT TTCCCGCCAG CTTACTGATT CTTAATGGCA AAAGTACTGA CAATCTACCC 
TTGCGCGAAG CAATTATGCT GTTGCGTGAG GAAGGAATGA CGATCCATGT GCGGGTCACC
TGGGAGAAAG GCGATGCCGC ACGATATGTA GAGGAGGCCC GGAAGTTGGG CGTCGCAACG
GTGATTGCCG GTGGTGGCGA TGGCACCATT AATGAAGTTT CTACGGCGTT GATTCAGTGT
GAGGGGGATG ACATACCCGC GCTGGGAATT TTGCCATTAG GAACCGCCAA TGATTTTGCC
ACCAGTGTAG GGATTCCTGA GGCACTGGAT AAGGCGCTGA AACTGGCAAT TGCCGGTGAC
GCCATTGCGA TAGATATGGC GCAGGTCAAC AAACAAACCT GTTTTATTAA TATGGCGACA
GGCGGATTTG GGACGCGTAT TACCACAGAA ACGCCGGAAA AATTAAAAGC CGCGCTGGGT
AGCGTCTCTT ACATCATTCA TGGCTTAATG CGTATGGATA CTCTGCAACC GGACCGTTGT
GAAATCCGCG GTGAAAACTT TCACTGGCAA GGTGACGCCC TGGTCATTGG TATTGGTAAC
GGGCGTCAGG CCGGTGGCGG TCAGCAATTG TGTCCGAACG CGTTAATTAA CGATGGCTTG
CTGCAACTGC GCATTTTTAC CGGCGATGAA ATACTTCCGG CTCTCGTATC AACCTTAAAA
TCTGACGAAG ATAACCCGAA TATTATCGAA GGCGCTTCGT CGTGGTTTGA TATTCAGGCA
CCACACGACA TCACCTTTAA TCTTGATGGC GAACCGTTGA GTGGGCAAAA TTTTCATATT
GAAATACTTC CGGCAGCGTT GCGTTGTCGA TTACCACCAG ATTGTCCACT GTTGCGTTAA
 
Protein sequence
MAEFPASLLI LNGKSTDNLP LREAIMLLRE EGMTIHVRVT WEKGDAARYV EEARKLGVAT 
VIAGGGDGTI NEVSTALIQC EGDDIPALGI LPLGTANDFA TSVGIPEALD KALKLAIAGD
AIAIDMAQVN KQTCFINMAT GGFGTRITTE TPEKLKAALG SVSYIIHGLM RMDTLQPDRC
EIRGENFHWQ GDALVIGIGN GRQAGGGQQL CPNALINDGL LQLRIFTGDE ILPALVSTLK
SDEDNPNIIE GASSWFDIQA PHDITFNLDG EPLSGQNFHI EILPAALRCR LPPDCPLLR