Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2902 |
Symbol | |
ID | 6143955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2973387 |
End bp | 2974172 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641617771 |
Product | short chain dehydrogenase/reductase family oxidoreductase |
Protein accession | YP_001744926 |
Protein GI | 170680420 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.773254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATCG AATCTCTCAA TGCGTTCTCA ATGGATTTTT TCTCCCTGAA AGGTAAAACA GCAATTGTTA CCGGCGGGAA TAGCGGTTTA GGCCAGGCAT TTGCCATGGC GTTGGCCAAA GCTGGCGCAA ATGTCTTTAT CCCTAGCTTC GTCAAAGATA ACGGCGAAAC AAAGGAAATG ATTGAAAAAC AGGGTGTCGA AGTTGATTTT ATGCAGGTGG ATATCACCGC AGAAGGCGCG CCGCAGAAGA TTATCGCTGC CTGCTGTGAG CGCTTCGGTA CCGTAGATAT TCTGGTTAAC AATGCCGGTA TTTGTAAGCT GAATAAGGTG CTGGACTTTG GTCGTGCCGA CTGGGACCCG ATGATTGACG TGAACCTGAC CGCCGCATTC GAGTTAAGCT ATGAAGCTGC AAAAATTATG ATCCCGCAGA AAAGCGGTAA AATCATTAAT ATCTGTTCAT TATTCTCTTA CTTAGGTGGA CAATGGTCAC CTGCATATTC TGCCACTAAA CATGCTCTTG CCGGATTCAC CAAAGCTTAT TGTGATGAAC TAGGTCAATA TAATATTCAG GTAAATGGTA TTGCCCCTGG ATATTACGCT ACCGATATCA CCCTGGCGAC ACGCAGTAAT CCAGAAACCA ATCAGCGCGT TCTTGATCAT ATTCCAGCAA ACCGTTGGGG CGATACTCAG GATTTAATGG GCACAGCCAT ATTTCTCGCA AGTCCGGCAT CGAATTATGT CAACGGACAT TTATTAGTGG TTGATGGCGG TTATTTAGTG CGCTAA
|
Protein sequence | MSIESLNAFS MDFFSLKGKT AIVTGGNSGL GQAFAMALAK AGANVFIPSF VKDNGETKEM IEKQGVEVDF MQVDITAEGA PQKIIAACCE RFGTVDILVN NAGICKLNKV LDFGRADWDP MIDVNLTAAF ELSYEAAKIM IPQKSGKIIN ICSLFSYLGG QWSPAYSATK HALAGFTKAY CDELGQYNIQ VNGIAPGYYA TDITLATRSN PETNQRVLDH IPANRWGDTQ DLMGTAIFLA SPASNYVNGH LLVVDGGYLV R
|
| |