Gene Elen_2941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2941 
Symbol 
ID8417272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3414189 
End bp3417836 
Gene Length3648 bp 
Protein Length1215 aa 
Translation table11 
GC content65% 
IMG OID645025918 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003183274 
Protein GI257792668 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.972596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAA CGATAAAGCA GGGGAAGGGC GCGCTTCGCT TGCTGCTGGC GACTATGTTG 
GCGGCAATGC TGATCCCCTG GGCTCCCCGA GCGGCTTGGG CCGATGAAAA CGAAGCAGCT
GCCCAGACGC AGACGAGCGA GATCGCCGAT CTGCTGGCTG CCGGAGACTA CGTCGAGGGC
GAGGCTCTCG TCGTCGTTGA CAACGCGGCA TCTCGAAACA GCCTCCGTTC GCGCAGCGTC
GACCCGCTCT CCTCGGCCGA ACCGCTCATG GACGCGTCGG CCGAAACGTA CGCCACCGCT
ACGGGCGTCG ACGTGCCGTT GGAACAATCT GTTTCCAACG GCCCGGTTCT CCTGCGCAGC
GCAAGATCCC TACCTGAAGA CGACGCGGTG AGCATGGTGC TGGTGCGGCA GGAGGGTACC
TCAACCGAGG ATCTCCTGCG CCAGCTGCAG GACGACCCGC GCATCCTGTC CGCCGAGCCG
AACTACGTGC AGCGGCTGGA CGACCCCGAT CAGACGGCGC TCGATGAGAG CACGCCGCAA
GTCGCTGCGG CATTGGGGCT GGATTCGTCG GACGCGGCGC CGTCGGCTCC GGCGAGCGCG
GATGCAAGCG GCACGGCCGC TGCCGACCTG TCCGGCTACC AGTGGGGCTG CCGCAACACC
GGAGAAGTCA TGGTGCAAGA AGAGGCGCCG CCCCTTGCCG GTTTCGACAT CGCGCCGCCG
AATTGGAACC AGGTCGGCAC AACGAACGCC GCCGACGTCG TTGCCGTGGT TGACACTGGC
GTGGACTACA ACCATCCCGA CCTCAACGGC ATCATGTGCG ACATGACGCA GTACACGGAC
AGGGGCGGAC GCTACGGGAT CAACGTCGTG CCCGGCATGG ATCCCACCGA TCCGATGGAC
GATCATTACC ACGGAACGCA CTGCGCGGGC ATCATCGCGT CGCAATGGGA TGGCGTGGGG
ACGAGCGGCG TCGCGTCCGG CGTGCAGCTG GTGGCCGTTA GGGTCGCAGG TAGGCAGAAC
ACGATCGAGA GTGCGGATAC CATCAGGGCA TACGAGTATC TTGCCGAAGC CGTTGATGCC
GGCTTGCCAC TGCGCGCCAT CAACAACTCG TGGGCCGGAA GGACGATGAG CAAAGCATTC
AGCTTGGCGG CGACGAAGCT GGGCGAGAAA GGCGCCGTGA CCGTCATCGG GTCGGGCAAC
GATGCCGCCG ACGTTGACAA GATACCGATG ACCGCATCCG TGCTCGCGCA TAACCCGTAT
GCGGTCGCTG TCGATGCGTC GGATTCTTCA GGGCAGCTCG CCTCGTTCAG CAACTACGGC
GTGGAAACGA CCGACGTGGT GGCTCCGGGC ATGAACATCA TGTCCACCGT GCCCCTCGGC
AGGGCGGCCT ACGCTCCCGA AGCCGACGGC TCCCCTTTGA GCTACGAGAC GTTCGAAAGC
GACGCCCCGT CGGTTACGGC GCGCGCGACG CCGACGTCGT CCGAGGATAT AGATGCCGTC
GTGCAGGGGG CGACCCATTT CGATGCGAGG GGCGGAGCTC TGAAGATCCC CTTTTCCGAA
ATGCTCACGG AAGAGACGGC GCTGGGCATC AGTTTTCGAA GCGTCGTGGT GTCGATGGAA
GTCGATCCGT CGAACCAAGA GGGCGTCCGG TACGTAGGAA TGAACGCGAC GGCCACGAAC
CAGAACAACG ACAACCTGCT TTACCAGGTT GCGATAATGA AAGACGGCGC GCTTGATTGG
ACGCCGATGG GACAGACGGC TGTCGCGGCC CTTGGGAAGG ACCGAGGTTG GACGACCATC
GCCTTCGACG CCCAGACGCT TGCCCAGAAT GCCGGCGGCA CCTTGGCGTT CGAAGGCGGC
AAGCTGCAGG TGCGGCTGAC GTTCGCACGC GCCGGCACGC CGATCGCCCC GACCGATGCC
CTGTACCTCG ACACGGTTGG AACGGGCAGG GCCGGCTCCG CGATTCCCTA TCAATCTTTG
AGCGGGACGT CCATGGCAAC TCCCATGGTC ACCGGCGCGG CCATGGTGCT CGCGCAGGAT
GTCGACGGCT CCACCCCGGC CGAGCGCGCT GCGAAGCTGG CGGCGCTCGT GAAGGCCTCC
GTGCGTCCGG TGGACGCGTT CTCCGGCAAG TGCTCGTCGG GCGGACACCT TGATTTGAGG
GTTGCGTCGG GCGACTTCGT CCCGGTGGTG TCGGGCGCTC GGATGGAATC CGACGGCGAC
GCCGCACTCG TCATGGTGGA GGGCAGCTAC TTCGGCGCAG CTCAGGGTGC GGGAAGCGTG
CACATCGGCG GCAAGGAGGC TGCTGTGCGC TCCTGGTCGG ACGGCTCCAT CGTAACGGAG
TGCCCACAGG GTCTCAAAAG CGGCGTGCTG GTGGTAGAGG TAACGGCCGG CAACGGCAAG
AGCGGATCGA AGGGCTTCCT GCTGCAGGTT CCCGAGCGAC CGGACGACCA ATCCACGCCC
CTGTTCGAGA AGACCATAGC GCTTCCCACG TCGGATGAGG GCCTGCCGGT TGGCGTGTCG
TACGGTCTTC TGACCGGGCT GGGCGGTTCG CTATACCTGC TGCCAAACGA CTTCAGCATA
TCGCATGCGG GTACTTATGC GCTATGGCGC TACGACCCCG ATCAGGACAG CTGGACGCAG
TGCGCCGACC TACCCGAGGC TCTCAGCAGC GCATCGATGA CCGTGTTCGA CGGCAAGCTG
TACGTATACG GCGAAGCGGG CGAGACGGAT GGGGCGTTGG CTCCTCGTCT TGTCAGCTAC
GACCCGGCGA CGAACGCCTG GACTTCCCAT GACGCTTCCC GCCTGCCGTT TCATGCGACG
ATAGTGAACT GCGATGGAGC GATGCTGCTG GTGGGCGGCG CCGCGTACGA CGGTGTCAAA
TGGGTCGAGC TGACCGAAGG CAACATCGCC GTGTACGATA CGCAGACGGG GGCGGTAACG
GCTGCGGGAT CCCTCGCGAC AGGCCGCTCG ACTCCCTTTG TCGTAGCGCG AGGGGCCGAG
CTGTTCGTCG CCTTGGGCGA TGTGCACGAC GTGACGGGCT CTTCGACGAC GAGCCCCGTA
TTCGAGCGCA TCGTGAGATC CGGCGACTCG TACGTTGGCA ACGACCTGTC CGCCGCCCTG
CCTGCGCTTG CGCCCGGCTA CGACGATTCG TTCGGCTTGG CAGCGGTCAA AGCCGGAGTG
GTGCTGTCGG GCCGTATCAG CGTCGAAGAA GCCGACGGCG GACGCGAGCT CGATCAGGAC
ACCTATCTGT TGGACTTGGC GAACGGAGAC AAAAGTTTCC AAGGCATCGA CAAGCGCGCC
TCCCGGGTGC CGCTGTACTT CCCGAGCTCG ACAGCGTACG ACGGATGGCT GTACACCATG
GGCTTCTCGG TCTACGAGAA CGGTGGCCGG GTGATACGGG CCACGCCGAT GGAGACGCTG
CCCCAACCCG GCGACCTACC TGAACCTGGA CCCGGCCCTA CGCCCGGACC TGGCCCTGCA
GCTGCGCCGG AACCGGTTGT CGGCAAGGGG GATGCAACGT CTCTGGCGAG AACCGGCGAC
CCATTAGGCG CTTTCGTGCC GGCTATCACG CTGGCCGCCG GTGTCATGCT CGCAAGCATA
GGCGTTGCCA CGGCTGCACG ACGAAAAGCG AAAAAGAACA GGCTATGA
 
Protein sequence
MDETIKQGKG ALRLLLATML AAMLIPWAPR AAWADENEAA AQTQTSEIAD LLAAGDYVEG 
EALVVVDNAA SRNSLRSRSV DPLSSAEPLM DASAETYATA TGVDVPLEQS VSNGPVLLRS
ARSLPEDDAV SMVLVRQEGT STEDLLRQLQ DDPRILSAEP NYVQRLDDPD QTALDESTPQ
VAAALGLDSS DAAPSAPASA DASGTAAADL SGYQWGCRNT GEVMVQEEAP PLAGFDIAPP
NWNQVGTTNA ADVVAVVDTG VDYNHPDLNG IMCDMTQYTD RGGRYGINVV PGMDPTDPMD
DHYHGTHCAG IIASQWDGVG TSGVASGVQL VAVRVAGRQN TIESADTIRA YEYLAEAVDA
GLPLRAINNS WAGRTMSKAF SLAATKLGEK GAVTVIGSGN DAADVDKIPM TASVLAHNPY
AVAVDASDSS GQLASFSNYG VETTDVVAPG MNIMSTVPLG RAAYAPEADG SPLSYETFES
DAPSVTARAT PTSSEDIDAV VQGATHFDAR GGALKIPFSE MLTEETALGI SFRSVVVSME
VDPSNQEGVR YVGMNATATN QNNDNLLYQV AIMKDGALDW TPMGQTAVAA LGKDRGWTTI
AFDAQTLAQN AGGTLAFEGG KLQVRLTFAR AGTPIAPTDA LYLDTVGTGR AGSAIPYQSL
SGTSMATPMV TGAAMVLAQD VDGSTPAERA AKLAALVKAS VRPVDAFSGK CSSGGHLDLR
VASGDFVPVV SGARMESDGD AALVMVEGSY FGAAQGAGSV HIGGKEAAVR SWSDGSIVTE
CPQGLKSGVL VVEVTAGNGK SGSKGFLLQV PERPDDQSTP LFEKTIALPT SDEGLPVGVS
YGLLTGLGGS LYLLPNDFSI SHAGTYALWR YDPDQDSWTQ CADLPEALSS ASMTVFDGKL
YVYGEAGETD GALAPRLVSY DPATNAWTSH DASRLPFHAT IVNCDGAMLL VGGAAYDGVK
WVELTEGNIA VYDTQTGAVT AAGSLATGRS TPFVVARGAE LFVALGDVHD VTGSSTTSPV
FERIVRSGDS YVGNDLSAAL PALAPGYDDS FGLAAVKAGV VLSGRISVEE ADGGRELDQD
TYLLDLANGD KSFQGIDKRA SRVPLYFPSS TAYDGWLYTM GFSVYENGGR VIRATPMETL
PQPGDLPEPG PGPTPGPGPA AAPEPVVGKG DATSLARTGD PLGAFVPAIT LAAGVMLASI
GVATAARRKA KKNRL