Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01980 |
Symbol | yegS |
ID | 8114662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2066316 |
End bp | 2067215 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644848194 |
Product | hypothetical protein |
Protein accession | YP_002999767 |
Protein GI | 251785463 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase |
TIGRFAM ID | [TIGR00147] lipid kinase, YegS/Rv2252/BmrU family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAT TTCCCGCCAG CTTACTGATT CTTAATGGCA AAAGTACTGA CAATTTACCC TTGCGCGAAG CAATTATGCT GTTGCGTGAG GAAGGAATGA CGATCCATGT GCGGGTCACC TGGGAGAAAG GCGATGCCGC ACGATATGTA GAGGAGGCCC GGAAGTTGGG CGTCGCAACG GTGATTGCCG GTGGTGGTGA TGGCACCATT AATGAAGTTT CTACGGCGTT GATTCAGTGT GAGGGGGATG ACATACCCGC GCTGGGAATT TTGCCATTAG GAACCGCCAA TGATTTTGCC ACCAGTGTAG GGATTCCTGA GGCACTGGAT AAGGCGCTGA AACTGGCAAT TGCCGGTAAC GCCATTGCGA TAGATATGGC GCAGGTCAAC AAACAAACCT GTTTTATTAA TATGGCGACA GGCGGATTTG GGACGCGTAT TACCACAGAA ACGCCGGAAA AATTAAAAGC CGCGCTGGGT GGCGTCTCTT ACATCATTCA TGGCTTAATG CGCATGGATA CTCTGCAACC GGACCGTTGT GAAATCCGCG GTGAAAACTT TCACTGGCAA GGTGACGCCC TGGTCATTGG TATTGGTAAC GGGCGTCAGG CCGGTGGCGG TCAGCAATTG TGTCCGAACG CGTTAATTAA CGATGGCTTG CTGCAACTGC GCATTTTTAC CGGCGATGAA ATACTTCCGG CTCTCGTATC AACCTTAAAA TCTGACGAAG ATAACCCGAA TATTATCGAA GGCGCTTCGT CGTGGTTTGA TATTCAGGCA CCACACGACA TCACCTTTAA TCTTGATGGC GAACCGTTGA GTGGGCAAAA TTTTCATATT GAAATACTTC CGGCAGCGTT GCGTTGTCGA TTACCACCAG ATTGTCCACT GTTGCGTTAA
|
Protein sequence | MAEFPASLLI LNGKSTDNLP LREAIMLLRE EGMTIHVRVT WEKGDAARYV EEARKLGVAT VIAGGGDGTI NEVSTALIQC EGDDIPALGI LPLGTANDFA TSVGIPEALD KALKLAIAGN AIAIDMAQVN KQTCFINMAT GGFGTRITTE TPEKLKAALG GVSYIIHGLM RMDTLQPDRC EIRGENFHWQ GDALVIGIGN GRQAGGGQQL CPNALINDGL LQLRIFTGDE ILPALVSTLK SDEDNPNIIE GASSWFDIQA PHDITFNLDG EPLSGQNFHI EILPAALRCR LPPDCPLLR
|
| |