Gene EcSMS35_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0184 
Symboldxr 
ID6142607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp202703 
End bp203899 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content53% 
IMG OID641615085 
Product1-deoxy-D-xylulose 5-phosphate reductoisomerase 
Protein accessionYP_001742301 
Protein GI170683853 
COG category[I] Lipid transport and metabolism 
COG ID[COG0743] 1-deoxy-D-xylulose 5-phosphate reductoisomerase 
TIGRFAM ID[TIGR00243] 1-deoxy-D-xylulose 5-phosphate reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.103209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAC TCACCATTCT GGGCTCGACC GGCTCGATTG GTTGCAGCAC GCTGGACGTG 
GTGCGCCATA ATCCCGAACA CTTCCGCGTA GTTGCGCTGG TGGCAGGCAA AAATGTCACT
CGCATGGTAG AACAGTGCCT GGAATTCTCT CCCCGCTATG CCGTAATGGA CGATGAAGCG
AGTGCGAAAC TTCTTAAAAC GATGCTACAG CAACAGGGTA GCCGCACCGA AGTCTTAAGT
GGGCAACAAG CCGCTTGCGA TATGGCAGCG CTTGAGGATG TTGATCAGGT GATGGCAGCC
ATTGTTGGCG CTGCTGGGCT GTTACCTACG CTTGCTGCGA TCCGCGCGGG TAAAACCATT
TTGCTGGCCA ATAAAGAATC ACTGGTTACC TGCGGACGTC TGTTTATGGA CGCCGTAAAG
CAGAGCAAAG CGCAATTGTT ACCGGTCGAT AGCGAACATA ACGCCATTTT TCAGAGTTTA
CCGCAACCTA TCCAGCATAA TCTGGGATAC GCTGACCTTG AGCAAAATGG CGTGGTGTCC
ATTTTACTTA CCGGGTCTGG TGGCCCTTTC CGTGAGACGC CATTGCGCGA TTTGGCAACA
ATGACGCCGG ATCAAGCCTG CCGTCATCCG AACTGGTCGA TGGGGCGTAA AATTTCTGTC
GATTCGGCTA CCATGATGAA TAAAGGTCTG GAATACATTG AAGCGCGTTG GCTGTTTAAC
GCCAGCGCCA GCCAGATGGA AGTGCTGATT CACCCGCAGT CAGTGATTCA CTCAATGGTG
CGCTATCAGG ACGGCAGTGT TCTGGCGCAG CTGGGGGAAC CGGATATGCG TACGCCAATT
GCCCACACCA TGGCATGGCC GAATCGCGTG AACTCTGGCG TGAAGCCGCT CGATTTTTGC
AAACTAAGTG CGTTGACATT TGCCGCACCG GATTATGATC GTTATCCATG CCTGAAACTG
GCGATGGAGG CGTTCGAACA AGGCCAGGCA GCGACGACAG CATTGAATGC CGCAAACGAA
ATCACCGTTG CTGCTTTTCT TGCGCAACAA ATCCGCTTTA CGGATATCGC CGCGTTGAAT
TTATCCGTAC TGGAAAAAAT GGATATGCGC GAACCACAAT GTGTGGACGA TGTGTTATCT
GTTGATGCGA ACGCGCGTGA AGTCGCCAGA AAAGAGGTGA TGCGTCTCGC AAGCTGA
 
Protein sequence
MKQLTILGST GSIGCSTLDV VRHNPEHFRV VALVAGKNVT RMVEQCLEFS PRYAVMDDEA 
SAKLLKTMLQ QQGSRTEVLS GQQAACDMAA LEDVDQVMAA IVGAAGLLPT LAAIRAGKTI
LLANKESLVT CGRLFMDAVK QSKAQLLPVD SEHNAIFQSL PQPIQHNLGY ADLEQNGVVS
ILLTGSGGPF RETPLRDLAT MTPDQACRHP NWSMGRKISV DSATMMNKGL EYIEARWLFN
ASASQMEVLI HPQSVIHSMV RYQDGSVLAQ LGEPDMRTPI AHTMAWPNRV NSGVKPLDFC
KLSALTFAAP DYDRYPCLKL AMEAFEQGQA ATTALNAANE ITVAAFLAQQ IRFTDIAALN
LSVLEKMDMR EPQCVDDVLS VDANAREVAR KEVMRLAS