Gene EcSMS35_3064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3064 
Symbolepd 
ID6145352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3154515 
End bp3155534 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content51% 
IMG OID641617933 
Producterythrose 4-phosphate dehydrogenase 
Protein accessionYP_001745084 
Protein GI170679808 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01532] D-erythrose-4-phosphate dehydrogenase
[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.835732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0736192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTAC GCGTAGCGAT AAATGGCTTC GGTCGCATCG GGCGTAATGT GGTTCGTGCT 
TTGTATGAAT CCGGACGCCG GGCGGAAATT ACCGTGGTGG CAATCAACGA ACTGGCGGAT
GCTGCGGGCA TGGCGCATTT GTTGAAATAT GACACCAGCC ATGGCCGTTT TGCATGGGAA
GTACGACAGG AACGCGATCA ACTTTTTGTT GGTGATGACG CCATCCGCGT ATTGCATGAA
CGTTCACTGC AATCGCTCCC CTGGCGTGAA CTTGGCGTTG ATGTAGTCCT CGACTGCACC
GGCGTATATG GCTCCCGCGA GCATGGCGAA GCACATATTG CCGCCGGGGC TAAAAAAGTG
CTCTTTTCAC ATCCTGGCAG TAACGATCTC GATGCGACCG TTGTTTACGG CGTCAATCAG
GATCAACTTC GTGCGGAACA CCGCATCGTT TCTAACGCTT CCTGTACCAC GAATTGCATA
ATTCCCGTCA TCAAATTGTT AGATGATGCG TACGGTATTG AGTCCGGCAC TGTGACCACA
ATTCACTCCG CCATGCACGA TCAACAGGTT ATTGATGCAT ACCATCCTGA CCTGCGTCGC
ACCCGGGCAG CCAGCCAGTC GATCATTCCG GTCGATACTA AACTGGCCGC CGGTATCACA
CGATTTTTTC CGCAATTTAA CGATCGCTTT GAAGCGATTG CGGTACGTGT GCCAACCATA
AATGTGACGG CAATCGATTT AAGCGTGACG GTGAAGAAAC CTGTAAAAGC CAATGAAGTC
AACCTGTTGC TGCAAAAAGC AGCACAAGGT GCATTTCATG GTATAGTTGA CTATACGGAA
TTGCCGTTGG TCTCTGTAGA TTTTAACCAC GATCCGCACA GTGCCATTGT CGATGGCACC
CAAACCCGGG TCAGTGGCGC ACACCTGATC AAAACGTTGG TCTGGTGCGA TAACGAATGG
GGCTTTGCTA ACCGAATGCT CGACACGACG TTAGCTATGG CTACTGTTGC TTTCAGGTAA
 
Protein sequence
MTVRVAINGF GRIGRNVVRA LYESGRRAEI TVVAINELAD AAGMAHLLKY DTSHGRFAWE 
VRQERDQLFV GDDAIRVLHE RSLQSLPWRE LGVDVVLDCT GVYGSREHGE AHIAAGAKKV
LFSHPGSNDL DATVVYGVNQ DQLRAEHRIV SNASCTTNCI IPVIKLLDDA YGIESGTVTT
IHSAMHDQQV IDAYHPDLRR TRAASQSIIP VDTKLAAGIT RFFPQFNDRF EAIAVRVPTI
NVTAIDLSVT VKKPVKANEV NLLLQKAAQG AFHGIVDYTE LPLVSVDFNH DPHSAIVDGT
QTRVSGAHLI KTLVWCDNEW GFANRMLDTT LAMATVAFR