Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2188 |
Symbol | |
ID | 8416510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2568151 |
End bp | 2569029 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645025174 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_003182539 |
Protein GI | 257791933 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.907948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAAC TCGTAGCATT GGAAACCCAA GCCCCGAAAG CGGGCTATGC CATGGATGAC ACGCGCGCGA ACGTCGATCC CAAGCCGCAG TTGGAAAGCG ACCTCTACCG GGCAGCAGGC AAGCTGGACG GCAAGGCGGC CCTCGTCACC GGCGGCGACA GCGGCATCGG CGCGGCCGTG GCCATCGCGT TCGCCAAAGA GGGCGCCGAC GTGGCGATCG CCTACTACTC GTCCGACGAC GACGCGCAGG CGGTGGCCGA GCGCATCCGC GAACTGGGAC GGCGCGCGCT CGTGTTCAAG GGCGACGTAG GCGACGAGGC GTTCTGCCGC GATATGGTCG AGGGGATCGC GGCCGAATGG GGTCGCCTCG ACGTGCTCGT GAACAACGCC GGAGAGCAGA CACCGGCCGA GAGCATCCTC GACTTGACCC AAGAGCAGCT GGTCCGCACG TTCCAGACCA ACATCTTCAG CATGTTCTAC CTGGTGAAAG CCGCACTCCC CCACCTGCCC GAGGGAGGCG CCATCGTGAA CACCACGTCG GTGACCGCCT ATCAGGGCTC GCCGAACCTG CTGGACTACT CGGCCACGAA GGGGGCCATC ACCGCGTTCA CCCGCTCGCT CTCCGAGAAC GAAGACCTCG TGAGCCGCCG CATCCGCGTG AACGGCGTGG CGCCCGGGCC TATCTGGACG CCGCTGAACC CCGCCTCGTA CGGGCTCGAC AGCGACAAGG TGAAGCACTT CGGCGAGTCG ACGCCCATGG GGCGCCCCGG CCAGCCCTAC GAGCTGGCGC CCGCCTACGT CTTTCTGGCC AGCGACGATT CCAGCTACGT CTCCGGCCAG GTGATCCACG TGAACGGCGG CACCGTGGTG AACGGGTAG
|
Protein sequence | MSELVALETQ APKAGYAMDD TRANVDPKPQ LESDLYRAAG KLDGKAALVT GGDSGIGAAV AIAFAKEGAD VAIAYYSSDD DAQAVAERIR ELGRRALVFK GDVGDEAFCR DMVEGIAAEW GRLDVLVNNA GEQTPAESIL DLTQEQLVRT FQTNIFSMFY LVKAALPHLP EGGAIVNTTS VTAYQGSPNL LDYSATKGAI TAFTRSLSEN EDLVSRRIRV NGVAPGPIWT PLNPASYGLD SDKVKHFGES TPMGRPGQPY ELAPAYVFLA SDDSSYVSGQ VIHVNGGTVV NG
|
| |