Gene EcolC_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0783 
Symbol 
ID6066654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp839053 
End bp840072 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content51% 
IMG OID641600187 
Producterythrose 4-phosphate dehydrogenase 
Protein accessionYP_001723782 
Protein GI170018828 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01532] D-erythrose-4-phosphate dehydrogenase
[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.176968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.225312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTAC GCGTAGCGAT AAATGGCTTC GGTCGCATCG GGCGTAATGT GGTTCGTGCT 
TTGTATGAAT CCGGACGCCG GGCGGAAATT ACCGTGGTGG CAATCAACGA ACTGGCGGAT
GCTGCGGGCA TGGCGCATTT GTTGAAATAT GACACCAGCC ATGGCCGTTT TGCATGGGAA
GTACGACAGG AACGCGATCA ACTTTTTGTT GGTGATGACG CCATCCGCGT ATTGCATGAA
CGTTCACTGC AATCGCTCCC CTGGCGTGAA CTTGGCGTTG ATGTAGTCCT CGACTGCACC
GGCGTATATG GCTCCCGCGA GCATGGCGAA GCGCATATTG CCGCCGGGGC CAAAAAAGTG
CTCTTTTCAC ATCCTGGCAG TAACGATCTC GACGCGACCG TTGTTTACGG CGTCAATCAG
GATCAACTTC GTGCGGAACA CCGCATCGTT TCTAACGCTT CCTGTACCAC GAATTGCATA
ATTCCCGTCA TCAAATTGTT AGATGATGCG TACGGTATTG AGTCCGGCAC TGTGACCACA
ATTCACTCCG CCATGCACGA TCAACAGGTT ATTGATGCAT ACCATCCTGA CCTGCGTCGC
ACCCGGGCAG CCAGCCAGTC GATCATTCCG GTCGATACTA AACTGGCCGC CGGTATCACA
CGATTTTTTC CGCAATTTAA CGATCGCTTT GAAGCGATTG CGGTACGTGT GCCAACCATA
AATGTGACGG CAATCGATTT AAGCGTGACG GTGAAAAAAC CTGTAAAAGC CAATGAAGTC
AACCTGTTGC TGCAAAAAGC AGCACAAGGT GCATTTCATG GTATAGTTGA CTATACGGAA
TTGCCGTTGG TCTCTGTAGA TTTTAACCAC GATCCGCACA GTGCCATTGT CGATGGCACC
CAAACCCGGG TCAGTGGCGC ACACCTGATC AAAACGTTGG TCTGGTGCGA TAACGAATGG
GGCTTTGCTA ACCGAATGCT CGACACGACG TTAGCTATGG CTACTGTTGC TTTCAGGTAA
 
Protein sequence
MTVRVAINGF GRIGRNVVRA LYESGRRAEI TVVAINELAD AAGMAHLLKY DTSHGRFAWE 
VRQERDQLFV GDDAIRVLHE RSLQSLPWRE LGVDVVLDCT GVYGSREHGE AHIAAGAKKV
LFSHPGSNDL DATVVYGVNQ DQLRAEHRIV SNASCTTNCI IPVIKLLDDA YGIESGTVTT
IHSAMHDQQV IDAYHPDLRR TRAASQSIIP VDTKLAAGIT RFFPQFNDRF EAIAVRVPTI
NVTAIDLSVT VKKPVKANEV NLLLQKAAQG AFHGIVDYTE LPLVSVDFNH DPHSAIVDGT
QTRVSGAHLI KTLVWCDNEW GFANRMLDTT LAMATVAFR