Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0135 |
Symbol | |
ID | 8414419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 187217 |
End bp | 188638 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023115 |
Product | dipeptidase |
Protein accession | YP_003180518 |
Protein GI | 257789912 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01887] dipeptidase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.502281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACATG CTGAACTGAC CAGGAAGATC GACGCGTATC TCGAGGAAAA TTGGGAGACG ATGGTCGAAG ACATCGAGAC GCTCGTCCGC ATCCCCAGCT TCGAGGAATC GGACAAGGCG ACCGAAGGCG CTCCGTTCGG CCCCGGCCCG AAGGAGGCGC TGGAAGCCGC GCTTGAAATG GCCTCCGACA TGGGATTCAA GACGCACGAT GCCGAGGGCT ACATCGGGTT CGCCGACTTC CCCGGCAAGA GCGACACGCA GCTGGGCATC ATCGGCCACA TGGACGTGGT TCCCGCCGGT CCCGGCTGGA ACTTCGAGCC GTACGCGGTC ACGCGCAAGG AAGGCTACCT CGTCGGTCGC GGCACGCTCG ACGACAAGGG CCCCAGCGTG GTGGCGCTGC ACGCCATGAA GTTCTGGAAG GACTTGCAGG ACGCCGGCGA GGCTCCTCAG TTCCCCTACA CCGTCCGCTT CCTGTTCGGC GCGAACGAAG AATCCGGCAT GGCCGACGTG GCCTACTACC ACAAGCATTA CGACGATCCG GCGTTCCTGT TCACGCCCGA TGCCGAGTTC CCCGTGTGCT ACGGCGAGAA GGGTGGCTAC GACGGCGAGC TGATCAGCAA GCCCATCGCC GACCGCATCG TGCTGGAGTT CACGGGCGGC GCGGCCACGA ACGCGGTGCC CGGCATCGCC GAGGCCGTGG TGAAGGCCGA CGCTGCCGAC CTGCCGAATA CTGATCGCAT TACCGTTGCG GCCGACGGCG AAGGCCGCGC GAAGATCACT GCGGCCGGCA AAGGAGCGCA CGCCTCCATG CCCGAAGGAG GCGTGAACGC CATCGGCCTC ATCGTGGACT ACCTGCTGGA GCACGACCTG TGCACCGCCG ACGAGCGCGC GTTCTTCGAG CTCGACCAGA AGCTGCTCAA TCATACCGAC GGCAGCGGCA TCGGCATCAA GAGCTCCGAC GAGTACTTCG GCCCGCTCAC CGTCATCGGC GGCACCATCA AAATCGAGGA CGACCGCTTT GTGCAGACGC TCGACAGTCG GTTCCCCACG TCCATCACGG CCGACGAGAT CACCGAGCGC CTGCGCCAGC TGGCGAGCGA GATCGGCGGC TCGTTTGAGA ACACGCTGCT TATGGAGCCG TTCCTCGTAA AGCCCGACAG CCCTGTGATC CAGGCGCTGC TGAACGCGTA CAACGAGGCC ACCGGCGAGG ACGCGAAGCC GTTCACCATG GGCGGCGGCA CGTACGCGCG CGAGTTCAAG AGCGGTGCCA GCTTCGGTCC CGAAAAGCCG TGGGTTGAGG ATCCCGAATG GGTTGGCATG ATGCACGGTC CGGACGAGGG CGTCAGCGAG GACCTGCTGA AGCAGTCCTT CAAGATCTAC GCGCTCACGC TGGACAAGCT CATGCAGCTC GACCTGCAAT AG
|
Protein sequence | MEHAELTRKI DAYLEENWET MVEDIETLVR IPSFEESDKA TEGAPFGPGP KEALEAALEM ASDMGFKTHD AEGYIGFADF PGKSDTQLGI IGHMDVVPAG PGWNFEPYAV TRKEGYLVGR GTLDDKGPSV VALHAMKFWK DLQDAGEAPQ FPYTVRFLFG ANEESGMADV AYYHKHYDDP AFLFTPDAEF PVCYGEKGGY DGELISKPIA DRIVLEFTGG AATNAVPGIA EAVVKADAAD LPNTDRITVA ADGEGRAKIT AAGKGAHASM PEGGVNAIGL IVDYLLEHDL CTADERAFFE LDQKLLNHTD GSGIGIKSSD EYFGPLTVIG GTIKIEDDRF VQTLDSRFPT SITADEITER LRQLASEIGG SFENTLLMEP FLVKPDSPVI QALLNAYNEA TGEDAKPFTM GGGTYAREFK SGASFGPEKP WVEDPEWVGM MHGPDEGVSE DLLKQSFKIY ALTLDKLMQL DLQ
|
| |