Gene ECH74115_4111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4111 
SymbolkduD 
ID6971538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3809295 
End bp3810056 
Gene Length762 bp 
Protein Length253 aa 
Translation table11 
GC content52% 
IMG OID643387866 
Product2-deoxy-D-gluconate 3-dehydrogenase 
Protein accessionYP_002272306 
Protein GI209400700 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID[TIGR01832] 2-deoxy-D-gluconate 3-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00000566262 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATTTTAA GTGCATTTTC TCTCGAAGGT AAAGTTGCGG TCGTCACTGG TTGTGATACT 
GGGCTGGGCC AGGGGATGGC GTTGGGGCTG GCGCAAGCGG GCTGTGACAT TGTTGGCATT
AACATCGTTG AACCGACTGA AACCATCAAG CAGGTCACGG CGCTGGGGCG TCGTTTTTTA
AGCCTGACCG CCGATCTGCG AAAGATTGAT GGTATTCCTG GACTGCTGGA TCGCGCGGTA
GCTGAGTTTG GTCATATTGA TATCCTGGTG AATAACGCCG GATTGATTCG CCGCGAAGAT
GCTCTCGAGT TCAGCGAAAA GGACTGGGAC GATGTCATGA ACCTGAATAT CAAGAGCGTA
TTCTTCATGT CTCAGGCAGC GGCGAAACAC TTCATCGCGC AAGGCAATGG CGGCAAGATT
ATCAATATCG CGTCAATGCT CTCCTTCCAG GGCGGGATCC GTGTGCCTTC TTATACCGCA
TCAAAAAGCG GCGTGATGGG CGTGACGCGA TTGATGGCGA ATGAATGGGC TAAACACAAC
ATTAATGTTA ATGCGATAGC TCCGGGTTAC ATGGCGACCA ACAATACCCA ACAACTGCGG
GCAGATGAAC AACGTAGCGC GGAAATTCTC GACCGCATTC CAGCTGGCCG TTGGGGACTG
CCGAGTGACC TGATGGGGCC GGTAGTGTTT CTTGCCTCCA GCGCTTCAGA TTATGTAAAT
GGTTATACCA TTGCTGTGGA TGGCGGTTGG CTGGCGCGTT AA
 
Protein sequence
MILSAFSLEG KVAVVTGCDT GLGQGMALGL AQAGCDIVGI NIVEPTETIK QVTALGRRFL 
SLTADLRKID GIPGLLDRAV AEFGHIDILV NNAGLIRRED ALEFSEKDWD DVMNLNIKSV
FFMSQAAAKH FIAQGNGGKI INIASMLSFQ GGIRVPSYTA SKSGVMGVTR LMANEWAKHN
INVNAIAPGY MATNNTQQLR ADEQRSAEIL DRIPAGRWGL PSDLMGPVVF LASSASDYVN
GYTIAVDGGW LAR