Gene B21_01980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01980 
SymbolyegS 
ID8114662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2066316 
End bp2067215 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content50% 
IMG OID644848194 
Producthypothetical protein 
Protein accessionYP_002999767 
Protein GI251785463 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAT TTCCCGCCAG CTTACTGATT CTTAATGGCA AAAGTACTGA CAATTTACCC 
TTGCGCGAAG CAATTATGCT GTTGCGTGAG GAAGGAATGA CGATCCATGT GCGGGTCACC
TGGGAGAAAG GCGATGCCGC ACGATATGTA GAGGAGGCCC GGAAGTTGGG CGTCGCAACG
GTGATTGCCG GTGGTGGTGA TGGCACCATT AATGAAGTTT CTACGGCGTT GATTCAGTGT
GAGGGGGATG ACATACCCGC GCTGGGAATT TTGCCATTAG GAACCGCCAA TGATTTTGCC
ACCAGTGTAG GGATTCCTGA GGCACTGGAT AAGGCGCTGA AACTGGCAAT TGCCGGTAAC
GCCATTGCGA TAGATATGGC GCAGGTCAAC AAACAAACCT GTTTTATTAA TATGGCGACA
GGCGGATTTG GGACGCGTAT TACCACAGAA ACGCCGGAAA AATTAAAAGC CGCGCTGGGT
GGCGTCTCTT ACATCATTCA TGGCTTAATG CGCATGGATA CTCTGCAACC GGACCGTTGT
GAAATCCGCG GTGAAAACTT TCACTGGCAA GGTGACGCCC TGGTCATTGG TATTGGTAAC
GGGCGTCAGG CCGGTGGCGG TCAGCAATTG TGTCCGAACG CGTTAATTAA CGATGGCTTG
CTGCAACTGC GCATTTTTAC CGGCGATGAA ATACTTCCGG CTCTCGTATC AACCTTAAAA
TCTGACGAAG ATAACCCGAA TATTATCGAA GGCGCTTCGT CGTGGTTTGA TATTCAGGCA
CCACACGACA TCACCTTTAA TCTTGATGGC GAACCGTTGA GTGGGCAAAA TTTTCATATT
GAAATACTTC CGGCAGCGTT GCGTTGTCGA TTACCACCAG ATTGTCCACT GTTGCGTTAA
 
Protein sequence
MAEFPASLLI LNGKSTDNLP LREAIMLLRE EGMTIHVRVT WEKGDAARYV EEARKLGVAT 
VIAGGGDGTI NEVSTALIQC EGDDIPALGI LPLGTANDFA TSVGIPEALD KALKLAIAGN
AIAIDMAQVN KQTCFINMAT GGFGTRITTE TPEKLKAALG GVSYIIHGLM RMDTLQPDRC
EIRGENFHWQ GDALVIGIGN GRQAGGGQQL CPNALINDGL LQLRIFTGDE ILPALVSTLK
SDEDNPNIIE GASSWFDIQA PHDITFNLDG EPLSGQNFHI EILPAALRCR LPPDCPLLR