Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2854 |
Symbol | |
ID | 8417185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3314204 |
End bp | 3316879 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645025833 |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003183189 |
Protein GI | 257792583 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family [TIGR01575] ribosomal-protein-alanine acetyltransferase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.594224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.241498 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACA TGAATTTGCG TCCCGAAGGG GCGTCCGAGC GGGCACGGTA CGTGCTCGCC TTCGATACTG CCAACGAAAT CATCGCTATC GGGTTGGGCG TGCTGCATGC GTCGTCGCGT ATGATCGAGT TGACGGCTTC CGTGGAAGCC GAGGCTCGAC GCGCGTCGAA CACCCAGCTG CTGCCCCGCA TCGACGCAGC GTTGGCCGAA CACGGCGTGG CACGCGAGGA CATCGCGTGC GTGGCGGTGG GCCGCGGCCC GGGATCGTTC ACGGGCGTGC GCATCGCGAT GGCTACGGCG AAGGGCATCG CGTCAGCACT CGAGGTGCCG CTGGTTGGCG TATCGTCGCT TGATGCGGTG GCGTGGAACG CGTGGGCGGC GGGCGAGCGC GGGCCGCTGT CCGTGGTGGC CGACGCCATG CGCAAGGAAG TGTACCCGGT GCGCTATCTG CTGAACGATA CGGGCATCGA GCGGTTGGAG GCCGACCGCG TGGTGAAGGC CGAAGACGCC GCGCGGGAAC TTGCTGCTGA GGGCGACCTG TCCGGGTCGG CGTCTGCGAC GGTGCCCAGT CGCTCGGACA GTCCTGGCTC GCACGAAACG TCCGCAGGAC GTTTCGGCTC GACGGGCGCG GCGCCGCGGA GCGAAGCGGA GCAAGTCCCC GAAGGGGAGA CTCGCTTGGT GGGCGCCATC GCAGACGCCG ACCCTGCCGA GGGGGACGCG TGGCCGCAGG TTTCCGCACG TCTTCTTTGC GGCGATGCGC TGAAGAAATA TGGGGAGTTA TTTGAGGGCT GCGGGGCGGC GCTGCCTGCC GAGTTGTGGA TGCCGACGGG TCGCGGGCTG TTGCTGGCGC TGCAGGCGGC GTGGCGGGCG GGCGAGGCCG ACCCGCTCGA TGCGCGTCGC CATGACCCTG CGTTCGCGCT GCCGGTGTAC ACGCGCCTAT CGGATGCGGA GGAGAACGAG CGCATCCGCC TGGCGAAGAA CGATCCGAAG AACCTTGCGA CCGGCGTGCA GGACGTGGCG AAGCGGGCCG ACCAGCGCGC GACCATGCAC GATACCGCCA TCCTGAACGC ACAGCCCGAC GAGCATGGCA TCACGTACAA GCCGCTCGAT GCCGCCCATG CAGGCGCTGT CGCGACGTTG GAGTCCCTAG TCATGGGATC AGACGCTTGG AGCGAAGCGC TCGTGGCCGA CGAACTCCCC CGCGCCGACC GCGTGTGGTG GGCAGCTTAC GAAGGGGAAG CGCTTGCGGG TTACGCCGGC GGGTGGATCG TCGACGGGCA GGTGCAGATC CTCAAGGTGG GCGTCGACCC GGCCATGCGG CGACGCGGCA TCGCGCGCGA GCTGTTGGCG CACGTGGCGG CCGATGCGCG CGACCTGGGC GCGTCCCGTT GCTCGCTCGA GGTGCGGGCG GGGAATGTCG GTGCGCAGGA GTTGTACGCC GCGCTCGGCT TCCGTTCGCT GGGTGTGCGC CCGCGTTACT ACTCCGACGG CGAGGACGCC GTCATCATGG AGGGCCCTCT GCCGTTGGCC AGGCACGATG TGGCCGGCAT GGAGCTGGTG GTGGGCGCTG CCAGCGACGA TGCCCGCTCC CTCCGTGACG AAGTGCAAAC GGATGTTTCA CGCGAAACAT CCGAACGCCG CCCCCTCATC CTCGCCATCG AATCGTCCTG CGACGAGACG GCGGCCGCCA TCGTCGACGG CAACGGCACG CTTATCGCCG ACGTCGTGGC CTCGCAGATC GACTTCCATG CGCGCTTCGG CGGCGTGGTG CCCGAGATCG CTAGCCGCAA GCACATCGAG GCTATCTGCG GCGTGTGCGA CGAGTGCTTC GACGTGGCTG CTTCCGCGCT GGGCATCGAA CGTCTGACCT GGCGCGATCT GGATTCCATT GCCGTGACGT ACGCGCCGGG GCTCGTGGGT GCGCTCGTGG TGGGCGTTGC GTTCGCGAAG GGCGCCGCGT GGGCGGCCGG CAAGCCGTTC ATCGGCGTGA ACCATCTGGA AGGCCACCTG TACGCGAACA AGATCGGCGC GCCCGATTTC CAGCCGCCCG CGGTGGTGTC GCTCGTGTCG GGCGGCAACA CGCTGCTCGT GCATATGAAG GGCTGGGGCG ACTACGAAAC GCTGGGCGCT ACCATCGACG ACGCGGTGGG CGAGGCGTTC GACAAGGTGG CGAAGGCGCT GGGCCTGGGC TATCCGGGCG GTCCCGTCAT CTCGCGCGAG GCGGCCAAGG GCGACCCGAA CGCCATCCCG TTCCCGCGTG CCATGATGCA CTCGGGCGAC CTGCGCTTCT CGTTGTCGGG TCTGAAAACC GCGGTGGTCA CCTATATCAA CAACGAACGC GCCGCCGGCC GCGAGCTGAA CGTGCCGAAC ATCTGCGCCA GTTTCCAGCA GGCGGTGGTG GACGTGCAGG TGAAGAAGGC CGAAATGGCG CTCGAGCAGA CGGGCGCGCG CACGTTCTGC CTCGGCGGCG GCGTGGCTGC GAACCCCGCG CTACGCGACG CGTACGAGCA GCTGTGCGAG CGTCTGCACG TGCGCCTCAC CCTGCCGCCC TTGAGCGCTT GCGGCGACAA CGCCGGCATG ATCGCGCTGG TGGCCCTCGA CCGCCACAAC CAGGGCAAGT TCTTCACCCT GGAAGCCGAC GCCCAAGCCC ACGCCAACCT CGACGAGCCG TACTGA
|
Protein sequence | MSDMNLRPEG ASERARYVLA FDTANEIIAI GLGVLHASSR MIELTASVEA EARRASNTQL LPRIDAALAE HGVAREDIAC VAVGRGPGSF TGVRIAMATA KGIASALEVP LVGVSSLDAV AWNAWAAGER GPLSVVADAM RKEVYPVRYL LNDTGIERLE ADRVVKAEDA ARELAAEGDL SGSASATVPS RSDSPGSHET SAGRFGSTGA APRSEAEQVP EGETRLVGAI ADADPAEGDA WPQVSARLLC GDALKKYGEL FEGCGAALPA ELWMPTGRGL LLALQAAWRA GEADPLDARR HDPAFALPVY TRLSDAEENE RIRLAKNDPK NLATGVQDVA KRADQRATMH DTAILNAQPD EHGITYKPLD AAHAGAVATL ESLVMGSDAW SEALVADELP RADRVWWAAY EGEALAGYAG GWIVDGQVQI LKVGVDPAMR RRGIARELLA HVAADARDLG ASRCSLEVRA GNVGAQELYA ALGFRSLGVR PRYYSDGEDA VIMEGPLPLA RHDVAGMELV VGAASDDARS LRDEVQTDVS RETSERRPLI LAIESSCDET AAAIVDGNGT LIADVVASQI DFHARFGGVV PEIASRKHIE AICGVCDECF DVAASALGIE RLTWRDLDSI AVTYAPGLVG ALVVGVAFAK GAAWAAGKPF IGVNHLEGHL YANKIGAPDF QPPAVVSLVS GGNTLLVHMK GWGDYETLGA TIDDAVGEAF DKVAKALGLG YPGGPVISRE AAKGDPNAIP FPRAMMHSGD LRFSLSGLKT AVVTYINNER AAGRELNVPN ICASFQQAVV DVQVKKAEMA LEQTGARTFC LGGGVAANPA LRDAYEQLCE RLHVRLTLPP LSACGDNAGM IALVALDRHN QGKFFTLEAD AQAHANLDEP Y
|
| |