Gene ECH74115_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0503 
Symboldxs 
ID6970204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp507954 
End bp509816 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content54% 
IMG OID643384551 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_002269065 
Protein GI209398295 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGCTA 
CGACTGTTGC CAAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC
GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC
GTGGCGCTGC ACTATGTCTA TAACACCCCG TTTGACCAAT TGATTTGGGA TGTGGGGCAT
CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG
AAAGGCGGTC TACACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC
GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAGAAAGAA
GGCAAAAATC GCCGCACCGT CTGTGTGATT GGCGACGGCG CGATTACCGC AGGCATGGCG
TTTGAAGCGA TGAATCACGC GGGCGATATT CGTCCTGATA TGCTGGTGGT CCTCAACGAC
AATGAAATGT CGATTTCCGA AAATGTTGGC GCGCTTAACA ACCATCTGGC GCAGCTGCTT
TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTTCCG
CCAATTAAAG AGCTGCTCAA ACGTACCGAA GAACATATTA AAGGCATGGT GGTGCCTGGC
ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TTGACGGTCA CGATGTGCTG
GGGCTTATCA CCACGCTGAA GAACATGCGC GACCTGAAAG GCCCGCAGTT CCTGCATATC
ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCAATCAC CTTCCACGCC
GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGTGG TTTACCAAGC
TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGATAA CAAACTGATG
GCGATTACTC CGGCGATGCG TGAAGGTTCC GGCATGGTCG AGTTTTCACG TAAATTCCCG
GATCGCTATT TTGATGTAGC GATTGCCGAG CAACACGCGG TGACCTTTGC CGCCGGTCTG
GCGATTGGTG GCTACAAACC GATCGTGGCA ATTTACTCCA CCTTCCTGCA ACGCGCCTAT
GATCAGGTGC TGCATGACGT GGCAATTCAA AAACTCCCGG TCCTGTTCGC CATCGACCGC
GCGGGCATTG TTGGTGCTGA CGGTCAAACC CACCAGGGCG CTTTTGATCT CTCTTACCTG
CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG
CTCTATACCG GCTATCACTA TAACGATGGC CCGTCAGCGG TGCGCTACCC GCGTGGCAAC
GCGGTCGGCG TGGAACTGAC GCCGCTGGAA AAACTACCAA TTGGCAAAGG CATTGTGAAG
CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA AGCGGCGAAA
GTCGCCGAAT CGCTGAACGC CACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA
ACGTTAATTC TGGAAATGGC CGCCAGTCAT GAAGCACTGG TTACCGTAGA AGAAAACGCC
ATTATGGGCG GTGCAGGCAG CGGAGTGAAC GAAGTGCTGA TGGCCCATCG TAAACCAGTA
CCCGTGCTGA ACATTGGCCT GCCGGACTTC TTTATTCCGC AAGGAACTCA GGAAGAAATG
CGCGCCGAAC TCGGCCTCGA TGCCGCCGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA
TAA
 
Protein sequence
MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT 
VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV
GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVVLND
NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG
TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA
VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP
DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR
AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN
AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPEAAK VAESLNATLV DMRFVKPLDE
TLILEMAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM
RAELGLDAAG MEAKIKAWLA