Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0666 |
Symbol | |
ID | 8414956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 842835 |
End bp | 843866 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023640 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_003181037 |
Protein GI | 257790431 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.019294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.000464867 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCACAAA CCGCATTGGT CACCGGCGCG ACGAGCGGCA TCGGAGAGGC CCTCTGCCTG CTGTTCGCCA GCGACGACTT CGATCTGGTC ACCGTCGCAC GCGACAAGGA GGCTCTGGAA AAGCAAGCCG ACGAGCTGCG CAGGCTCGAC ATCGATGTGC TTCCCATCGC CTGCGACCTC TCCGACCCCG ACGCTGCGCG CACCATCTTC AAGCGCGTCC AGGAGGCGGG CAAGGAGATC GAGATCCTCG TCAACGACGC GGGCTACAGC CCCGCCGGCC AGTTCTCCGA CCTGCCCATC GCCGACATCC GCTCGATGAT CCAGGTCAGC GTCACCAGCC TCGCCGAGCT GACCAGCGTG TTCCTGCATC CCATGCTGGA GCGCGGCCAC GGCCGCATCC TCAACATGAG CTCGATGATG GCGAAAACGC CGTGCCCCTA CAACGCGCTG TACGGCGCGG CGAAGGTGTT CGTGCTGTCG TTCTCAACCG CGCTGGCGCG CGAGCTCAAG CACACCGGCG TGTCGGTGAC CACCGTCTGC CCCGGCGCCA CGCGCACGAA CTTCCCGAAG AACGCCGGCA TCGAGGACGC GCCCGCGTGG AAGTACTTCT CCATGGATAC CGACGAGACG GCCATCCGCG TGTACCGCGC GCTCATGCGC GGCGAGCGCT GCGCCGTGAC GGGCTGGTAC AACAAAGTGG GTTCGATCTC GGTGCGGCTC ATGCCCATGG GCATGCAGCT GCTGGCCGGC GAGTGGCTGA TGGGCGCGCG CAAGCATCCG CTCGACCACG AGGGAACCGA AGAGGGCTCG CACGAGGAGC ATCCGCACGG CCGGAAGCAG GGTCAAGCCG GGACGCAGGC CTGCGGATGC GGCCATGCGC ACGATCACGG GGCCGGTCAC GAAGACGGGC ATCCTCGCGG ACGCGAGGAC GATCAGGCTC GGAGGCACGA CCACCATCAG CACGGCCAGC GCGAACACGG CGGCATGGGC GGCACCGCGC CTGTCTGCCG CATCAGCTGG GATCGCTGGT AG
|
Protein sequence | MAQTALVTGA TSGIGEALCL LFASDDFDLV TVARDKEALE KQADELRRLD IDVLPIACDL SDPDAARTIF KRVQEAGKEI EILVNDAGYS PAGQFSDLPI ADIRSMIQVS VTSLAELTSV FLHPMLERGH GRILNMSSMM AKTPCPYNAL YGAAKVFVLS FSTALARELK HTGVSVTTVC PGATRTNFPK NAGIEDAPAW KYFSMDTDET AIRVYRALMR GERCAVTGWY NKVGSISVRL MPMGMQLLAG EWLMGARKHP LDHEGTEEGS HEEHPHGRKQ GQAGTQACGC GHAHDHGAGH EDGHPRGRED DQARRHDHHQ HGQREHGGMG GTAPVCRISW DRW
|
| |