Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2941 |
Symbol | |
ID | 8417272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3414189 |
End bp | 3417836 |
Gene Length | 3648 bp |
Protein Length | 1215 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025918 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003183274 |
Protein GI | 257792668 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.972596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAA CGATAAAGCA GGGGAAGGGC GCGCTTCGCT TGCTGCTGGC GACTATGTTG GCGGCAATGC TGATCCCCTG GGCTCCCCGA GCGGCTTGGG CCGATGAAAA CGAAGCAGCT GCCCAGACGC AGACGAGCGA GATCGCCGAT CTGCTGGCTG CCGGAGACTA CGTCGAGGGC GAGGCTCTCG TCGTCGTTGA CAACGCGGCA TCTCGAAACA GCCTCCGTTC GCGCAGCGTC GACCCGCTCT CCTCGGCCGA ACCGCTCATG GACGCGTCGG CCGAAACGTA CGCCACCGCT ACGGGCGTCG ACGTGCCGTT GGAACAATCT GTTTCCAACG GCCCGGTTCT CCTGCGCAGC GCAAGATCCC TACCTGAAGA CGACGCGGTG AGCATGGTGC TGGTGCGGCA GGAGGGTACC TCAACCGAGG ATCTCCTGCG CCAGCTGCAG GACGACCCGC GCATCCTGTC CGCCGAGCCG AACTACGTGC AGCGGCTGGA CGACCCCGAT CAGACGGCGC TCGATGAGAG CACGCCGCAA GTCGCTGCGG CATTGGGGCT GGATTCGTCG GACGCGGCGC CGTCGGCTCC GGCGAGCGCG GATGCAAGCG GCACGGCCGC TGCCGACCTG TCCGGCTACC AGTGGGGCTG CCGCAACACC GGAGAAGTCA TGGTGCAAGA AGAGGCGCCG CCCCTTGCCG GTTTCGACAT CGCGCCGCCG AATTGGAACC AGGTCGGCAC AACGAACGCC GCCGACGTCG TTGCCGTGGT TGACACTGGC GTGGACTACA ACCATCCCGA CCTCAACGGC ATCATGTGCG ACATGACGCA GTACACGGAC AGGGGCGGAC GCTACGGGAT CAACGTCGTG CCCGGCATGG ATCCCACCGA TCCGATGGAC GATCATTACC ACGGAACGCA CTGCGCGGGC ATCATCGCGT CGCAATGGGA TGGCGTGGGG ACGAGCGGCG TCGCGTCCGG CGTGCAGCTG GTGGCCGTTA GGGTCGCAGG TAGGCAGAAC ACGATCGAGA GTGCGGATAC CATCAGGGCA TACGAGTATC TTGCCGAAGC CGTTGATGCC GGCTTGCCAC TGCGCGCCAT CAACAACTCG TGGGCCGGAA GGACGATGAG CAAAGCATTC AGCTTGGCGG CGACGAAGCT GGGCGAGAAA GGCGCCGTGA CCGTCATCGG GTCGGGCAAC GATGCCGCCG ACGTTGACAA GATACCGATG ACCGCATCCG TGCTCGCGCA TAACCCGTAT GCGGTCGCTG TCGATGCGTC GGATTCTTCA GGGCAGCTCG CCTCGTTCAG CAACTACGGC GTGGAAACGA CCGACGTGGT GGCTCCGGGC ATGAACATCA TGTCCACCGT GCCCCTCGGC AGGGCGGCCT ACGCTCCCGA AGCCGACGGC TCCCCTTTGA GCTACGAGAC GTTCGAAAGC GACGCCCCGT CGGTTACGGC GCGCGCGACG CCGACGTCGT CCGAGGATAT AGATGCCGTC GTGCAGGGGG CGACCCATTT CGATGCGAGG GGCGGAGCTC TGAAGATCCC CTTTTCCGAA ATGCTCACGG AAGAGACGGC GCTGGGCATC AGTTTTCGAA GCGTCGTGGT GTCGATGGAA GTCGATCCGT CGAACCAAGA GGGCGTCCGG TACGTAGGAA TGAACGCGAC GGCCACGAAC CAGAACAACG ACAACCTGCT TTACCAGGTT GCGATAATGA AAGACGGCGC GCTTGATTGG ACGCCGATGG GACAGACGGC TGTCGCGGCC CTTGGGAAGG ACCGAGGTTG GACGACCATC GCCTTCGACG CCCAGACGCT TGCCCAGAAT GCCGGCGGCA CCTTGGCGTT CGAAGGCGGC AAGCTGCAGG TGCGGCTGAC GTTCGCACGC GCCGGCACGC CGATCGCCCC GACCGATGCC CTGTACCTCG ACACGGTTGG AACGGGCAGG GCCGGCTCCG CGATTCCCTA TCAATCTTTG AGCGGGACGT CCATGGCAAC TCCCATGGTC ACCGGCGCGG CCATGGTGCT CGCGCAGGAT GTCGACGGCT CCACCCCGGC CGAGCGCGCT GCGAAGCTGG CGGCGCTCGT GAAGGCCTCC GTGCGTCCGG TGGACGCGTT CTCCGGCAAG TGCTCGTCGG GCGGACACCT TGATTTGAGG GTTGCGTCGG GCGACTTCGT CCCGGTGGTG TCGGGCGCTC GGATGGAATC CGACGGCGAC GCCGCACTCG TCATGGTGGA GGGCAGCTAC TTCGGCGCAG CTCAGGGTGC GGGAAGCGTG CACATCGGCG GCAAGGAGGC TGCTGTGCGC TCCTGGTCGG ACGGCTCCAT CGTAACGGAG TGCCCACAGG GTCTCAAAAG CGGCGTGCTG GTGGTAGAGG TAACGGCCGG CAACGGCAAG AGCGGATCGA AGGGCTTCCT GCTGCAGGTT CCCGAGCGAC CGGACGACCA ATCCACGCCC CTGTTCGAGA AGACCATAGC GCTTCCCACG TCGGATGAGG GCCTGCCGGT TGGCGTGTCG TACGGTCTTC TGACCGGGCT GGGCGGTTCG CTATACCTGC TGCCAAACGA CTTCAGCATA TCGCATGCGG GTACTTATGC GCTATGGCGC TACGACCCCG ATCAGGACAG CTGGACGCAG TGCGCCGACC TACCCGAGGC TCTCAGCAGC GCATCGATGA CCGTGTTCGA CGGCAAGCTG TACGTATACG GCGAAGCGGG CGAGACGGAT GGGGCGTTGG CTCCTCGTCT TGTCAGCTAC GACCCGGCGA CGAACGCCTG GACTTCCCAT GACGCTTCCC GCCTGCCGTT TCATGCGACG ATAGTGAACT GCGATGGAGC GATGCTGCTG GTGGGCGGCG CCGCGTACGA CGGTGTCAAA TGGGTCGAGC TGACCGAAGG CAACATCGCC GTGTACGATA CGCAGACGGG GGCGGTAACG GCTGCGGGAT CCCTCGCGAC AGGCCGCTCG ACTCCCTTTG TCGTAGCGCG AGGGGCCGAG CTGTTCGTCG CCTTGGGCGA TGTGCACGAC GTGACGGGCT CTTCGACGAC GAGCCCCGTA TTCGAGCGCA TCGTGAGATC CGGCGACTCG TACGTTGGCA ACGACCTGTC CGCCGCCCTG CCTGCGCTTG CGCCCGGCTA CGACGATTCG TTCGGCTTGG CAGCGGTCAA AGCCGGAGTG GTGCTGTCGG GCCGTATCAG CGTCGAAGAA GCCGACGGCG GACGCGAGCT CGATCAGGAC ACCTATCTGT TGGACTTGGC GAACGGAGAC AAAAGTTTCC AAGGCATCGA CAAGCGCGCC TCCCGGGTGC CGCTGTACTT CCCGAGCTCG ACAGCGTACG ACGGATGGCT GTACACCATG GGCTTCTCGG TCTACGAGAA CGGTGGCCGG GTGATACGGG CCACGCCGAT GGAGACGCTG CCCCAACCCG GCGACCTACC TGAACCTGGA CCCGGCCCTA CGCCCGGACC TGGCCCTGCA GCTGCGCCGG AACCGGTTGT CGGCAAGGGG GATGCAACGT CTCTGGCGAG AACCGGCGAC CCATTAGGCG CTTTCGTGCC GGCTATCACG CTGGCCGCCG GTGTCATGCT CGCAAGCATA GGCGTTGCCA CGGCTGCACG ACGAAAAGCG AAAAAGAACA GGCTATGA
|
Protein sequence | MDETIKQGKG ALRLLLATML AAMLIPWAPR AAWADENEAA AQTQTSEIAD LLAAGDYVEG EALVVVDNAA SRNSLRSRSV DPLSSAEPLM DASAETYATA TGVDVPLEQS VSNGPVLLRS ARSLPEDDAV SMVLVRQEGT STEDLLRQLQ DDPRILSAEP NYVQRLDDPD QTALDESTPQ VAAALGLDSS DAAPSAPASA DASGTAAADL SGYQWGCRNT GEVMVQEEAP PLAGFDIAPP NWNQVGTTNA ADVVAVVDTG VDYNHPDLNG IMCDMTQYTD RGGRYGINVV PGMDPTDPMD DHYHGTHCAG IIASQWDGVG TSGVASGVQL VAVRVAGRQN TIESADTIRA YEYLAEAVDA GLPLRAINNS WAGRTMSKAF SLAATKLGEK GAVTVIGSGN DAADVDKIPM TASVLAHNPY AVAVDASDSS GQLASFSNYG VETTDVVAPG MNIMSTVPLG RAAYAPEADG SPLSYETFES DAPSVTARAT PTSSEDIDAV VQGATHFDAR GGALKIPFSE MLTEETALGI SFRSVVVSME VDPSNQEGVR YVGMNATATN QNNDNLLYQV AIMKDGALDW TPMGQTAVAA LGKDRGWTTI AFDAQTLAQN AGGTLAFEGG KLQVRLTFAR AGTPIAPTDA LYLDTVGTGR AGSAIPYQSL SGTSMATPMV TGAAMVLAQD VDGSTPAERA AKLAALVKAS VRPVDAFSGK CSSGGHLDLR VASGDFVPVV SGARMESDGD AALVMVEGSY FGAAQGAGSV HIGGKEAAVR SWSDGSIVTE CPQGLKSGVL VVEVTAGNGK SGSKGFLLQV PERPDDQSTP LFEKTIALPT SDEGLPVGVS YGLLTGLGGS LYLLPNDFSI SHAGTYALWR YDPDQDSWTQ CADLPEALSS ASMTVFDGKL YVYGEAGETD GALAPRLVSY DPATNAWTSH DASRLPFHAT IVNCDGAMLL VGGAAYDGVK WVELTEGNIA VYDTQTGAVT AAGSLATGRS TPFVVARGAE LFVALGDVHD VTGSSTTSPV FERIVRSGDS YVGNDLSAAL PALAPGYDDS FGLAAVKAGV VLSGRISVEE ADGGRELDQD TYLLDLANGD KSFQGIDKRA SRVPLYFPSS TAYDGWLYTM GFSVYENGGR VIRATPMETL PQPGDLPEPG PGPTPGPGPA AAPEPVVGKG DATSLARTGD PLGAFVPAIT LAAGVMLASI GVATAARRKA KKNRL
|
| |