Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3213 |
Symbol | |
ID | 6066691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3519470 |
End bp | 3521332 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641602628 |
Product | 1-deoxy-D-xylulose-5-phosphate synthase |
Protein accession | YP_001726162 |
Protein GI | 170021208 |
COG category | [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1154] Deoxyxylulose-5-phosphate synthase |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000252771 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGTTA CGACTGTTGC CGAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC GTGGCGCTGC ACTATGTCTA CAACACCCCG TTTGACCAAT TGATTTGGGA TGTGGGGCAT CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG AAAGGCGGCC TGCACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAGAAAGAA GGCAAAAATC GCCGTACCGT CTGTGTAATC GGCGATGGCG CGATTACCGC AGGCATGGCG TTTGAAGCGA TGAATCACGC GGGCGATATC CGTCCTGATA TGCTGGTGGT CCTCAACGAC AACGAAATGT CGATTTCCGA AAATGTCGGC GCGCTCAACA ACCATCTGGC GCAGCTGCTT TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTGCCA CCCATTAAAG AGCTACTCAA ACGTACCGAA GAACATATTA AAGGCATGGT AGTCCCTGGC ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TTGACGGTCA CGATGTGCTG GGGCTTATCA CCACGCTGAA GAACATGCGT GACCTGAAAG GCCCGCAGTT CCTGCATATC ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCAATCAC TTTCCACGCC GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGCGG TTTGCCGAGC TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGACAA CAAGCTGATG GCGATTACTC CGGCGATGCG TGAAGGTTCC GGTATGGTCG AGTTTTCTCG TAAATTCCCG GATCGCTATT TTGATGTAGC GATTGCCGAG CAACACGCAG TGACCTTTGC TGCGGGACTG GCCATTGGTG GATACAAACC CATTGTCGCG ATCTACTCCA CCTTCCTGCA ACGCGCCTAT GATCAGGTGC TGCATGACGT GGCAATTCAA AAACTCCCGG TCCTGTTCGC CATCGACCGC GCGGGCATTG TTGGTGCTGA CGGTCAAACC CATCAGGGTG CTTTTGATCT CTCTTACCTG CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG CTCTATACCG GCTATCACTA TAACGATGGC CCGTCAGCGG TGCGCTACCC GCGTGGCAAC GCGGTAGGCG TGGAGCTGAC GCCGCTGGAA AAACTGCCAA TTGGCAAAGG CATTGTGAAG CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA CGCGGCGAAA GTCGCTGAAT CGCTGAACGC CACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA GCGTTAATTC TGGAAATGGC CGCCAGCCAT GAAGCGCTGG TCACCGTAGA AGAAAACGCC ATTATGGGCG GCGCAGGTAG CGGAGTGAAC GAAGTGCTGA TGGCCCATCG TAAACCAGTA CCCGTGCTGA ACATTGGTCT GCCGGACTTC TTTATTCCGC AAGGAACTCA GGAAGAAATG CGCGCCGAAC TCGGCCTCGA TGCCGCCGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA TAA
|
Protein sequence | MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVVLND NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPDAAK VAESLNATLV DMRFVKPLDE ALILEMAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM RAELGLDAAG MEAKIKAWLA
|
| |