Gene EcDH1_3189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3189 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3430778 
End bp3432640 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content54% 
IMG OID 
Productdeoxyxylulose-5-phosphate synthase 
Protein accessionACX40815 
Protein GI260450393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGTTA 
CGACTGTTGC CGAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC
GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC
GTGGCGCTGC ACTATGTCTA CAACACCCCG TTTGACCAAT TGATTTGGGA TGTGGGGCAT
CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG
AAAGGCGGTC TGCACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC
GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAAAAAGAA
GGCAAAAATC GCCGCACCGT CTGTGTCATT GGCGATGGCG CGATTACCGC AGGCATGGCG
TTTGAAGCGA TGAATCACGC GGGCGATATC CGTCCTGATA TGCTGGTGAT TCTCAACGAC
AATGAAATGT CGATTTCCGA AAATGTCGGC GCGCTCAACA ACCATCTGGC ACAGCTGCTT
TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTGCCG
CCAATTAAAG AGCTGCTCAA ACGCACCGAA GAACATATTA AAGGCATGGT AGTGCCTGGC
ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TGGACGGTCA CGATGTGCTG
GGGCTTATCA CCACGCTAAA GAACATGCGC GACCTGAAAG GCCCGCAGTT CCTGCATATC
ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCGATCAC TTTCCACGCC
GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGCGG TTTGCCGAGC
TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGACAA CAAGCTGATG
GCGATTACTC CGGCGATGCG TGAAGGTTCC GGCATGGTCG AGTTTTCACG TAAATTCCCG
GATCGCTACT TCGACGTGGC AATTGCCGAG CAACACGCGG TGACCTTTGC TGCGGGTCTG
GCGATTGGTG GGTACAAACC CATTGTCGCG ATTTACTCCA CTTTCCTGCA ACGCGCCTAT
GATCAGGTGC TGCATGACGT GGCGATTCAA AAGCTTCCGG TCCTGTTCGC CATCGACCGC
GCGGGCATTG TTGGTGCTGA CGGTCAAACC CATCAGGGTG CTTTTGATCT CTCTTACCTG
CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG
CTCTATACCG GCTATCACTA TAACGATGGC CCGTCAGCGG TGCGCTACCC GCGTGGCAAC
GCGGTCGGCG TGGAACTGAC GCCGCTGGAA AAACTACCAA TTGGCAAAGG CATTGTGAAG
CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA AGCGGCGAAA
GTCGCCGAAT CGCTGAACGC CACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA
GCGTTAATTC TGGAAATGGC CGCCAGCCAT GAAGCGCTGG TCACCGTAGA AGAAAACGCC
ATTATGGGCG GCGCAGGCAG CGGCGTGAAC GAAGTGCTGA TGGCCCATCG TAAACCAGTA
CCCGTGCTGA ACATTGGCCT GCCGGACTTC TTTATTCCGC AAGGAACTCA GGAAGAAATG
CGCGCCGAAC TCGGCCTCGA TGCCGCTGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA
TAA
 
Protein sequence
MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT 
VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV
GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVILND
NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG
TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA
VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP
DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR
AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN
AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPEAAK VAESLNATLV DMRFVKPLDE
ALILEMAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM
RAELGLDAAG MEAKIKAWLA