Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0503 |
Symbol | dxs |
ID | 6970204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 507954 |
End bp | 509816 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384551 |
Product | 1-deoxy-D-xylulose-5-phosphate synthase |
Protein accession | YP_002269065 |
Protein GI | 209398295 |
COG category | [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1154] Deoxyxylulose-5-phosphate synthase |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGCTA CGACTGTTGC CAAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC GTGGCGCTGC ACTATGTCTA TAACACCCCG TTTGACCAAT TGATTTGGGA TGTGGGGCAT CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG AAAGGCGGTC TACACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAGAAAGAA GGCAAAAATC GCCGCACCGT CTGTGTGATT GGCGACGGCG CGATTACCGC AGGCATGGCG TTTGAAGCGA TGAATCACGC GGGCGATATT CGTCCTGATA TGCTGGTGGT CCTCAACGAC AATGAAATGT CGATTTCCGA AAATGTTGGC GCGCTTAACA ACCATCTGGC GCAGCTGCTT TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTTCCG CCAATTAAAG AGCTGCTCAA ACGTACCGAA GAACATATTA AAGGCATGGT GGTGCCTGGC ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TTGACGGTCA CGATGTGCTG GGGCTTATCA CCACGCTGAA GAACATGCGC GACCTGAAAG GCCCGCAGTT CCTGCATATC ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCAATCAC CTTCCACGCC GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGTGG TTTACCAAGC TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGATAA CAAACTGATG GCGATTACTC CGGCGATGCG TGAAGGTTCC GGCATGGTCG AGTTTTCACG TAAATTCCCG GATCGCTATT TTGATGTAGC GATTGCCGAG CAACACGCGG TGACCTTTGC CGCCGGTCTG GCGATTGGTG GCTACAAACC GATCGTGGCA ATTTACTCCA CCTTCCTGCA ACGCGCCTAT GATCAGGTGC TGCATGACGT GGCAATTCAA AAACTCCCGG TCCTGTTCGC CATCGACCGC GCGGGCATTG TTGGTGCTGA CGGTCAAACC CACCAGGGCG CTTTTGATCT CTCTTACCTG CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG CTCTATACCG GCTATCACTA TAACGATGGC CCGTCAGCGG TGCGCTACCC GCGTGGCAAC GCGGTCGGCG TGGAACTGAC GCCGCTGGAA AAACTACCAA TTGGCAAAGG CATTGTGAAG CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA AGCGGCGAAA GTCGCCGAAT CGCTGAACGC CACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA ACGTTAATTC TGGAAATGGC CGCCAGTCAT GAAGCACTGG TTACCGTAGA AGAAAACGCC ATTATGGGCG GTGCAGGCAG CGGAGTGAAC GAAGTGCTGA TGGCCCATCG TAAACCAGTA CCCGTGCTGA ACATTGGCCT GCCGGACTTC TTTATTCCGC AAGGAACTCA GGAAGAAATG CGCGCCGAAC TCGGCCTCGA TGCCGCCGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA TAA
|
Protein sequence | MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVVLND NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPEAAK VAESLNATLV DMRFVKPLDE TLILEMAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM RAELGLDAAG MEAKIKAWLA
|
| |