Gene EcolC_3213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3213 
Symbol 
ID6066691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3519470 
End bp3521332 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content54% 
IMG OID641602628 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_001726162 
Protein GI170021208 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000252771 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGTTA 
CGACTGTTGC CGAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC
GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC
GTGGCGCTGC ACTATGTCTA CAACACCCCG TTTGACCAAT TGATTTGGGA TGTGGGGCAT
CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG
AAAGGCGGCC TGCACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC
GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAGAAAGAA
GGCAAAAATC GCCGTACCGT CTGTGTAATC GGCGATGGCG CGATTACCGC AGGCATGGCG
TTTGAAGCGA TGAATCACGC GGGCGATATC CGTCCTGATA TGCTGGTGGT CCTCAACGAC
AACGAAATGT CGATTTCCGA AAATGTCGGC GCGCTCAACA ACCATCTGGC GCAGCTGCTT
TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTGCCA
CCCATTAAAG AGCTACTCAA ACGTACCGAA GAACATATTA AAGGCATGGT AGTCCCTGGC
ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TTGACGGTCA CGATGTGCTG
GGGCTTATCA CCACGCTGAA GAACATGCGT GACCTGAAAG GCCCGCAGTT CCTGCATATC
ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCAATCAC TTTCCACGCC
GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGCGG TTTGCCGAGC
TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGACAA CAAGCTGATG
GCGATTACTC CGGCGATGCG TGAAGGTTCC GGTATGGTCG AGTTTTCTCG TAAATTCCCG
GATCGCTATT TTGATGTAGC GATTGCCGAG CAACACGCAG TGACCTTTGC TGCGGGACTG
GCCATTGGTG GATACAAACC CATTGTCGCG ATCTACTCCA CCTTCCTGCA ACGCGCCTAT
GATCAGGTGC TGCATGACGT GGCAATTCAA AAACTCCCGG TCCTGTTCGC CATCGACCGC
GCGGGCATTG TTGGTGCTGA CGGTCAAACC CATCAGGGTG CTTTTGATCT CTCTTACCTG
CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG
CTCTATACCG GCTATCACTA TAACGATGGC CCGTCAGCGG TGCGCTACCC GCGTGGCAAC
GCGGTAGGCG TGGAGCTGAC GCCGCTGGAA AAACTGCCAA TTGGCAAAGG CATTGTGAAG
CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA CGCGGCGAAA
GTCGCTGAAT CGCTGAACGC CACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA
GCGTTAATTC TGGAAATGGC CGCCAGCCAT GAAGCGCTGG TCACCGTAGA AGAAAACGCC
ATTATGGGCG GCGCAGGTAG CGGAGTGAAC GAAGTGCTGA TGGCCCATCG TAAACCAGTA
CCCGTGCTGA ACATTGGTCT GCCGGACTTC TTTATTCCGC AAGGAACTCA GGAAGAAATG
CGCGCCGAAC TCGGCCTCGA TGCCGCCGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA
TAA
 
Protein sequence
MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT 
VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV
GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVVLND
NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG
TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA
VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP
DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR
AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN
AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPDAAK VAESLNATLV DMRFVKPLDE
ALILEMAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM
RAELGLDAAG MEAKIKAWLA