Gene EcHS_A3085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3085 
Symbolepd 
ID5593969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3100183 
End bp3101202 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content51% 
IMG OID640922204 
Producterythrose 4-phosphate dehydrogenase 
Protein accessionYP_001459704 
Protein GI157162386 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01532] D-erythrose-4-phosphate dehydrogenase
[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.986701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTAC GCGTAGCGAT AAATGGCTTC GGTCGCATCG GGCGTAATGT GGTTCGTGCT 
TTGTATGAAT CCGGACGCCG GGCGGAAATT ACCGTGGTGG CAATCAACGA ACTGGCGGAT
GCTGCGGGCA TGGCGCATTT GTTGAAATAT GACACCAGCC ATGGCCGTTT TGCATGGGAA
GTACGACAGG AACGCGATCA ACTTTTTGTT GGTGATGACG CCATCCGCGT ATTGCATGAA
CGTTCACTGC AATCGCTCCC CTGGCGTGAA CTTGGCGTTG ATGTAGTCCT CGACTGCACC
GGCGTATATG GCTCCCGCGA GCATGGCGAA GCGCATATTG CCGCCGGGGC CAAAAAAGTG
CTCTTTTCAC ATCCTGGCAG TAACGATCTC GACGCGACCG TTGTTTACGG CGTCAATCAG
GATCAACTTC GTGCGGAACA CCGCATCGTT TCTAACGCTT CCTGTACCAC GAATTGCATA
ATTCCCGTCA TCAAATTGTT AGATGATGCG TACGGTATTG AGTCCGGCAC TGTGACCACA
ATTCACTCCG CCATGCACGA TCAACAGGTT ATTGATGCAT ACCATCCTGA CCTGCGTCGC
ACCCGGGCAG CCAGCCAGTC GATCATTCCG GTCGATACTA AACTGGCCGC CGGTATCACA
CGATTTTTTC CGCAATTTAA CGATCGCTTT GAAGCGATTG CGGTACGTGT GCCAACCATA
AATGTGACGG CAATCGATTT AAGCGTGACG GTGAAGAAAC CTGTAAAAGC CAATGAAGTC
AACCTGTTGC TGCAAAAAGC AGCACAAGGT GCATTTCATG GTATAGTTGA CTATACGGAA
TTGCCGTTGG TCTCTGTAGA TTTTAACCAC GATCCGCACA GTGCCATTGT CGATGGCACC
CAAACCCGGG TCAGTGGCGC ACACCTGATC AAAACGTTGG TCTGGTGCGA TAACGAATGG
GGCTTTGCTA ACCGAATGCT CGACACGACG TTAGCTATGG CTACTGTTGC TTTCAGGTAA
 
Protein sequence
MTVRVAINGF GRIGRNVVRA LYESGRRAEI TVVAINELAD AAGMAHLLKY DTSHGRFAWE 
VRQERDQLFV GDDAIRVLHE RSLQSLPWRE LGVDVVLDCT GVYGSREHGE AHIAAGAKKV
LFSHPGSNDL DATVVYGVNQ DQLRAEHRIV SNASCTTNCI IPVIKLLDDA YGIESGTVTT
IHSAMHDQQV IDAYHPDLRR TRAASQSIIP VDTKLAAGIT RFFPQFNDRF EAIAVRVPTI
NVTAIDLSVT VKKPVKANEV NLLLQKAAQG AFHGIVDYTE LPLVSVDFNH DPHSAIVDGT
QTRVSGAHLI KTLVWCDNEW GFANRMLDTT LAMATVAFR