Gene Elen_2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2854 
Symbol 
ID8417185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3314204 
End bp3316879 
Gene Length2676 bp 
Protein Length891 aa 
Translation table11 
GC content68% 
IMG OID645025833 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003183189 
Protein GI257792583 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family
[TIGR01575] ribosomal-protein-alanine acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.594224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.241498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA TGAATTTGCG TCCCGAAGGG GCGTCCGAGC GGGCACGGTA CGTGCTCGCC 
TTCGATACTG CCAACGAAAT CATCGCTATC GGGTTGGGCG TGCTGCATGC GTCGTCGCGT
ATGATCGAGT TGACGGCTTC CGTGGAAGCC GAGGCTCGAC GCGCGTCGAA CACCCAGCTG
CTGCCCCGCA TCGACGCAGC GTTGGCCGAA CACGGCGTGG CACGCGAGGA CATCGCGTGC
GTGGCGGTGG GCCGCGGCCC GGGATCGTTC ACGGGCGTGC GCATCGCGAT GGCTACGGCG
AAGGGCATCG CGTCAGCACT CGAGGTGCCG CTGGTTGGCG TATCGTCGCT TGATGCGGTG
GCGTGGAACG CGTGGGCGGC GGGCGAGCGC GGGCCGCTGT CCGTGGTGGC CGACGCCATG
CGCAAGGAAG TGTACCCGGT GCGCTATCTG CTGAACGATA CGGGCATCGA GCGGTTGGAG
GCCGACCGCG TGGTGAAGGC CGAAGACGCC GCGCGGGAAC TTGCTGCTGA GGGCGACCTG
TCCGGGTCGG CGTCTGCGAC GGTGCCCAGT CGCTCGGACA GTCCTGGCTC GCACGAAACG
TCCGCAGGAC GTTTCGGCTC GACGGGCGCG GCGCCGCGGA GCGAAGCGGA GCAAGTCCCC
GAAGGGGAGA CTCGCTTGGT GGGCGCCATC GCAGACGCCG ACCCTGCCGA GGGGGACGCG
TGGCCGCAGG TTTCCGCACG TCTTCTTTGC GGCGATGCGC TGAAGAAATA TGGGGAGTTA
TTTGAGGGCT GCGGGGCGGC GCTGCCTGCC GAGTTGTGGA TGCCGACGGG TCGCGGGCTG
TTGCTGGCGC TGCAGGCGGC GTGGCGGGCG GGCGAGGCCG ACCCGCTCGA TGCGCGTCGC
CATGACCCTG CGTTCGCGCT GCCGGTGTAC ACGCGCCTAT CGGATGCGGA GGAGAACGAG
CGCATCCGCC TGGCGAAGAA CGATCCGAAG AACCTTGCGA CCGGCGTGCA GGACGTGGCG
AAGCGGGCCG ACCAGCGCGC GACCATGCAC GATACCGCCA TCCTGAACGC ACAGCCCGAC
GAGCATGGCA TCACGTACAA GCCGCTCGAT GCCGCCCATG CAGGCGCTGT CGCGACGTTG
GAGTCCCTAG TCATGGGATC AGACGCTTGG AGCGAAGCGC TCGTGGCCGA CGAACTCCCC
CGCGCCGACC GCGTGTGGTG GGCAGCTTAC GAAGGGGAAG CGCTTGCGGG TTACGCCGGC
GGGTGGATCG TCGACGGGCA GGTGCAGATC CTCAAGGTGG GCGTCGACCC GGCCATGCGG
CGACGCGGCA TCGCGCGCGA GCTGTTGGCG CACGTGGCGG CCGATGCGCG CGACCTGGGC
GCGTCCCGTT GCTCGCTCGA GGTGCGGGCG GGGAATGTCG GTGCGCAGGA GTTGTACGCC
GCGCTCGGCT TCCGTTCGCT GGGTGTGCGC CCGCGTTACT ACTCCGACGG CGAGGACGCC
GTCATCATGG AGGGCCCTCT GCCGTTGGCC AGGCACGATG TGGCCGGCAT GGAGCTGGTG
GTGGGCGCTG CCAGCGACGA TGCCCGCTCC CTCCGTGACG AAGTGCAAAC GGATGTTTCA
CGCGAAACAT CCGAACGCCG CCCCCTCATC CTCGCCATCG AATCGTCCTG CGACGAGACG
GCGGCCGCCA TCGTCGACGG CAACGGCACG CTTATCGCCG ACGTCGTGGC CTCGCAGATC
GACTTCCATG CGCGCTTCGG CGGCGTGGTG CCCGAGATCG CTAGCCGCAA GCACATCGAG
GCTATCTGCG GCGTGTGCGA CGAGTGCTTC GACGTGGCTG CTTCCGCGCT GGGCATCGAA
CGTCTGACCT GGCGCGATCT GGATTCCATT GCCGTGACGT ACGCGCCGGG GCTCGTGGGT
GCGCTCGTGG TGGGCGTTGC GTTCGCGAAG GGCGCCGCGT GGGCGGCCGG CAAGCCGTTC
ATCGGCGTGA ACCATCTGGA AGGCCACCTG TACGCGAACA AGATCGGCGC GCCCGATTTC
CAGCCGCCCG CGGTGGTGTC GCTCGTGTCG GGCGGCAACA CGCTGCTCGT GCATATGAAG
GGCTGGGGCG ACTACGAAAC GCTGGGCGCT ACCATCGACG ACGCGGTGGG CGAGGCGTTC
GACAAGGTGG CGAAGGCGCT GGGCCTGGGC TATCCGGGCG GTCCCGTCAT CTCGCGCGAG
GCGGCCAAGG GCGACCCGAA CGCCATCCCG TTCCCGCGTG CCATGATGCA CTCGGGCGAC
CTGCGCTTCT CGTTGTCGGG TCTGAAAACC GCGGTGGTCA CCTATATCAA CAACGAACGC
GCCGCCGGCC GCGAGCTGAA CGTGCCGAAC ATCTGCGCCA GTTTCCAGCA GGCGGTGGTG
GACGTGCAGG TGAAGAAGGC CGAAATGGCG CTCGAGCAGA CGGGCGCGCG CACGTTCTGC
CTCGGCGGCG GCGTGGCTGC GAACCCCGCG CTACGCGACG CGTACGAGCA GCTGTGCGAG
CGTCTGCACG TGCGCCTCAC CCTGCCGCCC TTGAGCGCTT GCGGCGACAA CGCCGGCATG
ATCGCGCTGG TGGCCCTCGA CCGCCACAAC CAGGGCAAGT TCTTCACCCT GGAAGCCGAC
GCCCAAGCCC ACGCCAACCT CGACGAGCCG TACTGA
 
Protein sequence
MSDMNLRPEG ASERARYVLA FDTANEIIAI GLGVLHASSR MIELTASVEA EARRASNTQL 
LPRIDAALAE HGVAREDIAC VAVGRGPGSF TGVRIAMATA KGIASALEVP LVGVSSLDAV
AWNAWAAGER GPLSVVADAM RKEVYPVRYL LNDTGIERLE ADRVVKAEDA ARELAAEGDL
SGSASATVPS RSDSPGSHET SAGRFGSTGA APRSEAEQVP EGETRLVGAI ADADPAEGDA
WPQVSARLLC GDALKKYGEL FEGCGAALPA ELWMPTGRGL LLALQAAWRA GEADPLDARR
HDPAFALPVY TRLSDAEENE RIRLAKNDPK NLATGVQDVA KRADQRATMH DTAILNAQPD
EHGITYKPLD AAHAGAVATL ESLVMGSDAW SEALVADELP RADRVWWAAY EGEALAGYAG
GWIVDGQVQI LKVGVDPAMR RRGIARELLA HVAADARDLG ASRCSLEVRA GNVGAQELYA
ALGFRSLGVR PRYYSDGEDA VIMEGPLPLA RHDVAGMELV VGAASDDARS LRDEVQTDVS
RETSERRPLI LAIESSCDET AAAIVDGNGT LIADVVASQI DFHARFGGVV PEIASRKHIE
AICGVCDECF DVAASALGIE RLTWRDLDSI AVTYAPGLVG ALVVGVAFAK GAAWAAGKPF
IGVNHLEGHL YANKIGAPDF QPPAVVSLVS GGNTLLVHMK GWGDYETLGA TIDDAVGEAF
DKVAKALGLG YPGGPVISRE AAKGDPNAIP FPRAMMHSGD LRFSLSGLKT AVVTYINNER
AAGRELNVPN ICASFQQAVV DVQVKKAEMA LEQTGARTFC LGGGVAANPA LRDAYEQLCE
RLHVRLTLPP LSACGDNAGM IALVALDRHN QGKFFTLEAD AQAHANLDEP Y