Gene EcHS_A0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0491 
Symboldxs 
ID5593091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp502107 
End bp503969 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content55% 
IMG OID640919674 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_001457259 
Protein GI157159941 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG ATATTGCCAA ATACCCGACC CTGGCACTGG TCGACTCCAC CCAGGAGTTA 
CGACTGTTGC CGAAAGAGAG TTTACCGAAA CTCTGCGACG AACTGCGCCG CTATTTACTC
GACAGCGTGA GCCGTTCCAG CGGGCACTTC GCCTCCGGGC TGGGCACGGT CGAACTGACC
GTGGCGCTGC ACTATGTCTA CAACACCCCG TTTGACCAAT TGATTTGGGA TGTGGGGCAT
CAGGCTTATC CGCATAAAAT TTTGACCGGA CGCCGCGACA AAATCGGCAC CATCCGTCAG
AAAGGCGGCC TGCACCCGTT CCCGTGGCGC GGCGAAAGCG AATATGACGT ATTAAGCGTC
GGGCATTCAT CAACCTCCAT CAGTGCCGGA ATTGGTATTG CGGTTGCTGC CGAGAAAGAA
GGCAAAAATC GCCGTACCGT CTGTGTAATC GGCGATGGCG CGATTACCGC AGGCATGGCG
TTTGAAGCGA TGAATCACGC GGGCGATATC CGTCCTGATA TGCTGGTGGT CCTCAACGAC
AACGAAATGT CGATTTCCGA AAATGTCGGC GCGCTCAACA ACCATCTGGC GCAGCTGCTT
TCCGGTAAGC TTTACTCTTC ACTGCGCGAA GGCGGGAAAA AAGTTTTCTC TGGCGTTCCG
CCAATTAAAG AGCTGCTCAA ACGTACCGAA GAACATATTA AAGGCATGGT AGTGCCTGGC
ACGTTGTTTG AAGAGCTGGG CTTTAACTAC ATCGGCCCGG TTGACGGTCA CGATGTGCTG
GGGCTTATCA CCACGCTGAA GAACATGCGC GACCTGAAAG GCCCGCAGTT CCTGCATATC
ATGACCAAAA AAGGTCGTGG TTATGAACCG GCAGAAAAAG ACCCCATCAC TTTCCACGCC
GTGCCTAAAT TTGATCCCTC CAGCGGTTGT TTGCCGAAAA GTAGCGGCGG TTTGCCGAGC
TATTCAAAAA TCTTTGGCGA CTGGTTGTGC GAAACGGCAG CGAAAGACAA CAAGCTGATG
GCGATTACTC CGGCGATGCG TGAAGGTTCC GGCATGGTCG AGTTTTCACG TAAATTCCCG
GATCGTTACT TCGACGTGGC AATCGCCGAG CAACACGCGG TGACCTTTGC CGCCGGTCTG
GCGATTGGTG GGTACAAACC CATTGTCGCG ATTTACTCCA CTTTCCTGCA ACGCGCCTAT
GATCAGGTGC TGCATGACGT GGCGATTCAA AAGCTCCCGG TCCTGTTCGC CATCGACCGC
GCGGGCATTG TTGGTGCTGA CGGTCAAACC CATCAGGGCG CTTTTGACCT CTCTTACCTG
CGCTGCATAC CGGAAATGGT CATTATGACC CCGAGCGATG AAAACGAATG TCGCCAGATG
CTCTATACCG GCTATCACTA TAACGACGGC CCGTCCGCGG TGCGCTACCC GCGCGGTAAC
GCGGTTGGCG TGGAACTGAC GCCGCTGGAA AAACTGCCAA TTGGCAAAGG CATTGTGAAG
CGTCGTGGCG AGAAACTGGC GATCCTTAAC TTTGGTACGC TGATGCCAGA CGCGGCGAAA
GTCGCTGAAT CGCTGAACGC TACGCTGGTC GATATGCGTT TTGTGAAACC GCTTGATGAA
GCGTTAATTC TGGAAATGGC CGCCAGCCAT GAAGCGCTGG TCACCGTAGA AGAAAACGCC
ATTATGGGCG GCGCAGGCAG CGGCGTGAAC GAAGTGCTAA TGGCCCATCG TAAACCAGTA
CCCGTGCTGA ACATTGGCCT GCCTGACTTC TTTATTCCGC AAGGAACTCA GGAAGAAATG
CGCGCCGAAC TCGGCCTCGA TGCCGCCGGT ATGGAAGCCA AAATCAAGGC CTGGCTGGCA
TAA
 
Protein sequence
MSFDIAKYPT LALVDSTQEL RLLPKESLPK LCDELRRYLL DSVSRSSGHF ASGLGTVELT 
VALHYVYNTP FDQLIWDVGH QAYPHKILTG RRDKIGTIRQ KGGLHPFPWR GESEYDVLSV
GHSSTSISAG IGIAVAAEKE GKNRRTVCVI GDGAITAGMA FEAMNHAGDI RPDMLVVLND
NEMSISENVG ALNNHLAQLL SGKLYSSLRE GGKKVFSGVP PIKELLKRTE EHIKGMVVPG
TLFEELGFNY IGPVDGHDVL GLITTLKNMR DLKGPQFLHI MTKKGRGYEP AEKDPITFHA
VPKFDPSSGC LPKSSGGLPS YSKIFGDWLC ETAAKDNKLM AITPAMREGS GMVEFSRKFP
DRYFDVAIAE QHAVTFAAGL AIGGYKPIVA IYSTFLQRAY DQVLHDVAIQ KLPVLFAIDR
AGIVGADGQT HQGAFDLSYL RCIPEMVIMT PSDENECRQM LYTGYHYNDG PSAVRYPRGN
AVGVELTPLE KLPIGKGIVK RRGEKLAILN FGTLMPDAAK VAESLNATLV DMRFVKPLDE
ALILEMAASH EALVTVEENA IMGGAGSGVN EVLMAHRKPV PVLNIGLPDF FIPQGTQEEM
RAELGLDAAG MEAKIKAWLA