Gene EcSMS35_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0456 
Symboldxs 
ID6146939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp464346 
End bp466208 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content54% 
IMG OID641615350 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_001742557 
Protein GI170680644 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.927308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGTTA 
CGACTGTTGC CGAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC
GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC
GTGGCGCTGC ACTATGTCTA CAATACCCCG TTTGACCAGT TAATCTGGGA TGTGGGGCAT
CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG
AAAGGCGGCC TGCACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC
GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAGAAAGAA
GGCAAAAATC GCCGTACCGT CTGTGTCATT GGCGATGGCG CGATTACCGC TGGCATGGCG
TTTGAAGCGA TGAATCACGC GGGCGATATC CGTCCTGATA TGCTGGTGGT TCTCAACGAC
AATGAAATGT CGATTTCCGA AAATGTTGGC GCGCTCAACA ACCATCTGGC GCAACTGCTT
TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTTCCG
CCCATTAAAG AGCTGCTCAA ACGTACCGAA GAACATATTA AAGGCATGGT AGTGCCTGGC
ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TGGACGGTCA CGATGTGCTG
GGGCTTATCA CCACGCTGAA GAACATGCGC GACCTGAAAG GCCCGCAGTT CCTGCATATC
ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCAATCAC TTTCCACGCC
GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGCGG TTTGCCGAGC
TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGACAA CAAGCTGATG
GCGATTACTC CGGCGATGCG CGAAGGTTCT GGCATGGTCG AGTTTTCACG TAAATTCCCG
GATCGCTACT TCGACGTGGC AATCGCCGAG CAACACGCAG TGACCTTTGC TGCGGGTCTG
GCGATTGGTG GCTACAAACC CATTGTCGCG ATCTACTCCA CTTTCCTGCA ACGCGCCTAT
GATCAGGTGC TGCATGACGT GGCGATACAA AAACTCCCGG TTCTGTTCGC CATTGACCGC
GCGGGCATTG TTGGTGCTGA CGGTCAAACC CACCAGGGCG CTTTTGATCT CTCTTACCTG
CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG
CTCTATACCG GCTATCACTA TAACGATGGC CCGTCAGCGG TACGCTACCC GCGTGGCAAC
GCGGTTGGCG TGGAACTGAC GCCGCTGGAA AAACTGCCAA TTGGCAAAGG CATTGTGAAG
CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA AGCGGCGAAA
GTCGCTGAAT CGCTGAACGC TACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA
GCGTTAATTC TGGAAATAGC CGCCAGCCAT GAAGCGCTGG TCACCGTAGA AGAAAACGCC
ATTATGGGCG GCGCAGGCAG CGGAGTGAAC GAAGTGCTGA TGGCCCATCG TAAACCAGTA
CCCGTGCTGA ACATTGGCCT GCCTGACTTC TTTATTCCAC AAGGAACACA GGAAGAAATG
CGCGCCGAAC TCGGCCTCGA TGCCGCCGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA
TAA
 
Protein sequence
MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT 
VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV
GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVVLND
NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG
TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA
VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP
DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR
AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN
AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPEAAK VAESLNATLV DMRFVKPLDE
ALILEIAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM
RAELGLDAAG MEAKIKAWLA