Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1089 |
Symbol | |
ID | 8415379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1316321 |
End bp | 1317373 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645024052 |
Product | signal peptide peptidase SppA, 36K type |
Protein accession | YP_003181449 |
Protein GI | 257790843 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00447762 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000000000025278 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTCAAG ACAACAACTG GCAGCAGCAG GTGTGGCAGC AGCCTGCCGC GCCGCAGCAG CCTGTCGCGC CGGCCGCGCC GTACGGCTAC GCCGTGCAGC CCCCGAAGAA AAGCCGCGGC TGGATCGTCG CCCTCGTGGC CGTCGTGCTC GTGTTCGCGC TGCTGGCGCT GGGCATGTGG TCGTGCACGT CGGTCATGTC CTCGTCGTTC GGGTCCTTCG GCACGGGCTC CACGGTCGAC GACGTGGACT ACCTCACGGG CGACGCGGTC GGCGTCATCG ACATCGACGG CACCATCCAG TACGACAACA CCACCTCCAG CCCCGAAGGC CTGAAGGCCC AGCTCGATCG CGCCGAGAAG AACAGCCATA TCAAGGCCGT CGTACTGCGT GTGAACTCCG GCGGCGGCAC GGCTACGGCG GGCGAGGAGA TGGCCGACTA CGTGCGCGGG TTCTCCGAGC GCACCGGCAA GCCTGTCGTG GTGTCCAGCG CGTCCGTCAA TGCGAGCGCC GCCTATGAGA TATCCTCGCA GGCCGACTAT ATCTACACGG CCAAGACCAC GGCCATCGGC GCCATCGGCA CGGTCATGCA GGTTACCGAC CTGTCCGGCC TCATGGAGAA GCTGGGCATC TCGGTGGACA ACGTCACCAG CGCCGACAGC AAGGATTCCA GCTACGGCAC GCGCCCGCTC ACCGAGGAGG AGCGCGCCTA CTACCAGGAT CAGGTCGACC AGATCAACGA GACATTCATC CAGACCGTGG CCGAGGGTCG CGACATGCCC GTCGAAGACG TGCGCGCGCT GGCCACGGGT CTCACGTTCA CCGGCATGAC GGCAGTCGAG AACGGCCTTG CCGACGAGAT CGGCACCAAG GACGACGCCG TGGCGAAGGC AGCCGAGCTG GCGAACATCG CGCACTACAC CACCGTCACG CTCAAGAATC CCACGAGCAG CCTGTCGAGC CTGCTCGACC TCATGTCAAA GAGCAACGTT TCCACCGACG ATATCGCCCG AGCGCTGAAG GAGCTGGACA CCGATGGCAG CATCGCCCAA TAG
|
Protein sequence | MSQDNNWQQQ VWQQPAAPQQ PVAPAAPYGY AVQPPKKSRG WIVALVAVVL VFALLALGMW SCTSVMSSSF GSFGTGSTVD DVDYLTGDAV GVIDIDGTIQ YDNTTSSPEG LKAQLDRAEK NSHIKAVVLR VNSGGGTATA GEEMADYVRG FSERTGKPVV VSSASVNASA AYEISSQADY IYTAKTTAIG AIGTVMQVTD LSGLMEKLGI SVDNVTSADS KDSSYGTRPL TEEERAYYQD QVDQINETFI QTVAEGRDMP VEDVRALATG LTFTGMTAVE NGLADEIGTK DDAVAKAAEL ANIAHYTTVT LKNPTSSLSS LLDLMSKSNV STDDIARALK ELDTDGSIAQ
|
| |