Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0939 |
Symbol | |
ID | 6068435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1023126 |
End bp | 1023911 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641600347 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001723935 |
Protein GI | 170018981 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.919564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATCG AATCTCTCAA TGCGTTCTCA ATGGATTTTT TCTCACTGAA AGGTAAAACC GCAATTGTTA CCGGTGGGAA TAGCGGTTTA GGCCAGGCAT TTGCCATGGC GTTGGCCAAA GCTGGCGCAA ATATCTTTAT TCCTAGTTTC GTCAAAGATA ACGGCGAAAC AAAGGAAATG ATTGAAAAAC AGGGTGTTGA GGTGGACTTC ATGCAGGTGG ATATCACCGC AGAAGGCGCG CCGCAGAAGA TTATCGCTGC TTGCTGTGAG CGTTTCGGTA CAGTTGATAT TCTGGTAAAC AATGCCGGTA TTTGTAAGCT GAATAAGGTG CTGGACTTCG GTCGTGCCGA CTGGGATCCG ATGATTGATG TGAACCTGAC CGCCGCATTC GAGTTAAGCT ATGAAGCGGC AAAAATTATG ATCCCGCAGA AAAGCGGTAA AATTATTAAT ATCTGTTCAT TGTTCTCTTA CTTAGGTGGA CAATGGTCAC CCGCATATTC TGCCACTAAA CATGCTCTTG CCGGGTTCAC CAAAGCTTAT TGTGATGAAT TAGGTCAATA TAATATTCAG GTAAATGGTA TCGCCCCTGG CTATTATGCA ACAGATATTA CGCTGGCGAC ACGCAGTAAT CCAGAAACCA ATCAGCGCGT TCTTGATCAT ATTCCGGCAA ACCGTTGGGG CGATACTCAG GATTTAATGG GTGCAGCCGT GTTTCTCGCA AGTCAGGCAT CGAATTATGT CAATGGGCAT TTATTAGTGG TCGATGGCGG TTATTTAGTG CGCTAA
|
Protein sequence | MSIESLNAFS MDFFSLKGKT AIVTGGNSGL GQAFAMALAK AGANIFIPSF VKDNGETKEM IEKQGVEVDF MQVDITAEGA PQKIIAACCE RFGTVDILVN NAGICKLNKV LDFGRADWDP MIDVNLTAAF ELSYEAAKIM IPQKSGKIIN ICSLFSYLGG QWSPAYSATK HALAGFTKAY CDELGQYNIQ VNGIAPGYYA TDITLATRSN PETNQRVLDH IPANRWGDTQ DLMGAAVFLA SQASNYVNGH LLVVDGGYLV R
|
| |