Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0873 |
Symbol | |
ID | 6064842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 937445 |
End bp | 938206 |
Gene Length | 762 bp |
Protein Length | 253 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641600276 |
Product | 2-deoxy-D-gluconate 3-dehydrogenase |
Protein accession | YP_001723869 |
Protein GI | 170018915 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01832] 2-deoxy-D-gluconate 3-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTAA GTGCATTTTC TCTCGAAGGT AAAGTTGCGG TCGTCACTGG TTGTGATACT GGACTGGGTC AGGGGATGGC GTTGGGGCTG GCGCAAGCGG GCTGTGACAT TGTTGGCATT AACATCGTTG AACCGACTGA AACCATCGAG CAGGTCACAG CGCTGGGGCG TCGTTTTTTA AGCCTGACCG CCGATCTGCG AAAGATTGAT GGTATTCCAG CACTGCTGGA TCGCGCGGTA GCGGAGTTTG GTCATATTGA TATCCTGGTG AATAACGCCG GATTGATTCG CCGCGAAGAT GCTCTCGAGT TCAGCGAAAA GGACTGGGAC GATGTCATGA ACCTGAATAT CAAGAGCGTA TTCTTCATGT CTCAGGCAGC GGCGAAACAC TTTATCGCGC AAGGCAATGG CGGCAAGATT ATCAATATCG CGTCAATGCT CTCCTTCCAG GGCGGGATCC GTGTGCCTTC TTATACCGCA TCAAAAAGCG GCGTGATGGG TGTGACGCGA TTGATGGCGA ACGAATGGGC TAAACACAAC ATTAATGTTA ATGCGATAGC CCCGGGTTAC ATGGCGACCA ACAATACTCA ACAACTACGG GCAGATGAAC AACGTAGCGC GGAAATTCTC GACCGCATTC CAGCTGGTCG TTGGGGACTG CCGAGTGACC TGATGGGGCC GATAGTGTTC CTTGCCTCCA GCGCTTCAGA TTATGTGAAT GGTTATACCA TTGCCGTGGA TGGCGGTTGG CTGGCGCGTT AA
|
Protein sequence | MILSAFSLEG KVAVVTGCDT GLGQGMALGL AQAGCDIVGI NIVEPTETIE QVTALGRRFL SLTADLRKID GIPALLDRAV AEFGHIDILV NNAGLIRRED ALEFSEKDWD DVMNLNIKSV FFMSQAAAKH FIAQGNGGKI INIASMLSFQ GGIRVPSYTA SKSGVMGVTR LMANEWAKHN INVNAIAPGY MATNNTQQLR ADEQRSAEIL DRIPAGRWGL PSDLMGPIVF LASSASDYVN GYTIAVDGGW LAR
|
| |