Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0456 |
Symbol | dxs |
ID | 6146939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 464346 |
End bp | 466208 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615350 |
Product | 1-deoxy-D-xylulose-5-phosphate synthase |
Protein accession | YP_001742557 |
Protein GI | 170680644 |
COG category | [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1154] Deoxyxylulose-5-phosphate synthase |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.927308 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGTTA CGACTGTTGC CGAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC GTGGCGCTGC ACTATGTCTA CAATACCCCG TTTGACCAGT TAATCTGGGA TGTGGGGCAT CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG AAAGGCGGCC TGCACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAGAAAGAA GGCAAAAATC GCCGTACCGT CTGTGTCATT GGCGATGGCG CGATTACCGC TGGCATGGCG TTTGAAGCGA TGAATCACGC GGGCGATATC CGTCCTGATA TGCTGGTGGT TCTCAACGAC AATGAAATGT CGATTTCCGA AAATGTTGGC GCGCTCAACA ACCATCTGGC GCAACTGCTT TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTTCCG CCCATTAAAG AGCTGCTCAA ACGTACCGAA GAACATATTA AAGGCATGGT AGTGCCTGGC ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TGGACGGTCA CGATGTGCTG GGGCTTATCA CCACGCTGAA GAACATGCGC GACCTGAAAG GCCCGCAGTT CCTGCATATC ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCAATCAC TTTCCACGCC GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGCGG TTTGCCGAGC TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGACAA CAAGCTGATG GCGATTACTC CGGCGATGCG CGAAGGTTCT GGCATGGTCG AGTTTTCACG TAAATTCCCG GATCGCTACT TCGACGTGGC AATCGCCGAG CAACACGCAG TGACCTTTGC TGCGGGTCTG GCGATTGGTG GCTACAAACC CATTGTCGCG ATCTACTCCA CTTTCCTGCA ACGCGCCTAT GATCAGGTGC TGCATGACGT GGCGATACAA AAACTCCCGG TTCTGTTCGC CATTGACCGC GCGGGCATTG TTGGTGCTGA CGGTCAAACC CACCAGGGCG CTTTTGATCT CTCTTACCTG CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG CTCTATACCG GCTATCACTA TAACGATGGC CCGTCAGCGG TACGCTACCC GCGTGGCAAC GCGGTTGGCG TGGAACTGAC GCCGCTGGAA AAACTGCCAA TTGGCAAAGG CATTGTGAAG CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA AGCGGCGAAA GTCGCTGAAT CGCTGAACGC TACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA GCGTTAATTC TGGAAATAGC CGCCAGCCAT GAAGCGCTGG TCACCGTAGA AGAAAACGCC ATTATGGGCG GCGCAGGCAG CGGAGTGAAC GAAGTGCTGA TGGCCCATCG TAAACCAGTA CCCGTGCTGA ACATTGGCCT GCCTGACTTC TTTATTCCAC AAGGAACACA GGAAGAAATG CGCGCCGAAC TCGGCCTCGA TGCCGCCGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA TAA
|
Protein sequence | MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVVLND NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPEAAK VAESLNATLV DMRFVKPLDE ALILEIAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM RAELGLDAAG MEAKIKAWLA
|
| |