Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2990 |
Symbol | kduD |
ID | 6145401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3071130 |
End bp | 3071891 |
Gene Length | 762 bp |
Protein Length | 253 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617859 |
Product | 2-deoxy-D-gluconate 3-dehydrogenase |
Protein accession | YP_001745011 |
Protein GI | 170681878 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01832] 2-deoxy-D-gluconate 3-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.916776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.048185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTAA GTGCATTTTC TCTCGAAGGT AAAGTTGCGG TCGTCACTGG TTGTGATACT GGACTGGGCC AGGGGATGGC GTTGGGGCTG GCGCAAGCGG GCTGTGACAT TGTTGGCATT AACATCGTTG AACCGACTGA AACCATCAAG CAGGTCACGG CGCTGGGGCG TCGTTTTTTA AGCCTGACCG CCGATCTGCG AAAGATTGAT GGCATTCCAG CACTGCTGGA TCGCGCGGTA GCGGAGTTTG GTCATATTGA TATCCTGGTG AATAACGCCG GATTGATTCG GCGCGAAGAT GCTCTCGAGT TCAGCGAAAA AGACTGGGAC GATGTCATGA ACCTGAATAT CAAGAGCGTA TTCTTCATGT CTCAGGCAGC GGCGAAACAC TTTATCGCGC AAGGCAATGG CGGCAAGATT ATCAATATCG CGTCAATGCT CTCCTTCCAG GGCGGGATCC GTGTGCCTTC TTATACCGCA TCAAAAAGCG GCGTGATGGG CGTGACACGA TTGATGGCGA ATGAATGGGC TAAACACAAC ATTAATGTTA ATGCGATAGC TCCGGGTTAC ATGGCAACCA ACAATACTCA ACAACTACGG GCAGATGAAC AACGTAGCGC GGAAATTCTC GACCGCATTC CAGCTGGCCG TTGGGGACTG CCGAGTGACC TGATGGGGCC GGTAGTGTTT CTTGCCTCCA GCGCTTCAGA TTATGTAAAT GGTTATACCA TTGCTGTGGA TGGTGGTTGG CTGGCGCGTT AA
|
Protein sequence | MILSAFSLEG KVAVVTGCDT GLGQGMALGL AQAGCDIVGI NIVEPTETIK QVTALGRRFL SLTADLRKID GIPALLDRAV AEFGHIDILV NNAGLIRRED ALEFSEKDWD DVMNLNIKSV FFMSQAAAKH FIAQGNGGKI INIASMLSFQ GGIRVPSYTA SKSGVMGVTR LMANEWAKHN INVNAIAPGY MATNNTQQLR ADEQRSAEIL DRIPAGRWGL PSDLMGPVVF LASSASDYVN GYTIAVDGGW LAR
|
| |