Gene Elen_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1089 
Symbol 
ID8415379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1316321 
End bp1317373 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content65% 
IMG OID645024052 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_003181449 
Protein GI257790843 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00447762 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000025278 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCAAG ACAACAACTG GCAGCAGCAG GTGTGGCAGC AGCCTGCCGC GCCGCAGCAG 
CCTGTCGCGC CGGCCGCGCC GTACGGCTAC GCCGTGCAGC CCCCGAAGAA AAGCCGCGGC
TGGATCGTCG CCCTCGTGGC CGTCGTGCTC GTGTTCGCGC TGCTGGCGCT GGGCATGTGG
TCGTGCACGT CGGTCATGTC CTCGTCGTTC GGGTCCTTCG GCACGGGCTC CACGGTCGAC
GACGTGGACT ACCTCACGGG CGACGCGGTC GGCGTCATCG ACATCGACGG CACCATCCAG
TACGACAACA CCACCTCCAG CCCCGAAGGC CTGAAGGCCC AGCTCGATCG CGCCGAGAAG
AACAGCCATA TCAAGGCCGT CGTACTGCGT GTGAACTCCG GCGGCGGCAC GGCTACGGCG
GGCGAGGAGA TGGCCGACTA CGTGCGCGGG TTCTCCGAGC GCACCGGCAA GCCTGTCGTG
GTGTCCAGCG CGTCCGTCAA TGCGAGCGCC GCCTATGAGA TATCCTCGCA GGCCGACTAT
ATCTACACGG CCAAGACCAC GGCCATCGGC GCCATCGGCA CGGTCATGCA GGTTACCGAC
CTGTCCGGCC TCATGGAGAA GCTGGGCATC TCGGTGGACA ACGTCACCAG CGCCGACAGC
AAGGATTCCA GCTACGGCAC GCGCCCGCTC ACCGAGGAGG AGCGCGCCTA CTACCAGGAT
CAGGTCGACC AGATCAACGA GACATTCATC CAGACCGTGG CCGAGGGTCG CGACATGCCC
GTCGAAGACG TGCGCGCGCT GGCCACGGGT CTCACGTTCA CCGGCATGAC GGCAGTCGAG
AACGGCCTTG CCGACGAGAT CGGCACCAAG GACGACGCCG TGGCGAAGGC AGCCGAGCTG
GCGAACATCG CGCACTACAC CACCGTCACG CTCAAGAATC CCACGAGCAG CCTGTCGAGC
CTGCTCGACC TCATGTCAAA GAGCAACGTT TCCACCGACG ATATCGCCCG AGCGCTGAAG
GAGCTGGACA CCGATGGCAG CATCGCCCAA TAG
 
Protein sequence
MSQDNNWQQQ VWQQPAAPQQ PVAPAAPYGY AVQPPKKSRG WIVALVAVVL VFALLALGMW 
SCTSVMSSSF GSFGTGSTVD DVDYLTGDAV GVIDIDGTIQ YDNTTSSPEG LKAQLDRAEK
NSHIKAVVLR VNSGGGTATA GEEMADYVRG FSERTGKPVV VSSASVNASA AYEISSQADY
IYTAKTTAIG AIGTVMQVTD LSGLMEKLGI SVDNVTSADS KDSSYGTRPL TEEERAYYQD
QVDQINETFI QTVAEGRDMP VEDVRALATG LTFTGMTAVE NGLADEIGTK DDAVAKAAEL
ANIAHYTTVT LKNPTSSLSS LLDLMSKSNV STDDIARALK ELDTDGSIAQ