Gene Elen_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1994 
Symbol 
ID8416305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2338105 
End bp2339319 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content64% 
IMG OID645024971 
ProductC-terminal processing peptidase 
Protein accessionYP_003182347 
Protein GI257791741 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.72907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0163388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTTA TCAAGGCCTG CATCGTCGTC ATCGTCGTGA GCGTCGCTTT CGTCGCGGGC 
TTTACTGCGC GAGGCGACGC TTCTCTTCTC GAGTCGCTCG GCTTCACGTC GCTTGTGGTG
GACGTCGACC GCAATCCGGG ATCCACCACG TCGGGCGACA CCTACGACTC CCTCGGGGCG
CGCGTGGAGG AGGTCGAGGG CATCATCGAC AACGACAGCT TGGACTCCTA CGATTTGAAC
ATGGCCACGA CCAACGTGCT GAACGCGCTT TCGGACACGA CGGAAGACGC GTATCTGCGC
TATTACGATC CTGCGCGCTA CGCCGCGCTC ATGCAGGACA GCGCCGAGCA GTCGGCGGGC
ATCGGCGTGC TGTTCTCCGA ATACAAAGGG CGCGCTTACG CGGCCGACGT GTTCGAGGGT
TCGGCCGCCC AGATGGCCGA CGTGCGCAGC GGCGATTTCG TGGTGGCCAT CGACGGCGAC
CGCGGACACG AGTGGACCAC GAACGAGGTG ACCTCCGCGC TCAAGCGCGA GGAGGGCGAG
AACGTCGTCA TCACGTGGCG GCGCGCCAGC TCGCTCGACG ACGAGGGCGG CGAGGAGTTC
ACTACGACGC TCGTGTGCTC GAACTACGCG GTGAAGAACG TGGAGACCGA GCTTTCCGAC
ACGGTGGGCT ACATCAAGCT CAAGCAGATC ACGCAGAACG CGGCAGACCT CGTGAAGAAC
GCGGTCGCCG ACCTCGAGTC GCAAGGCGCC ACATCGTTCG TGCTCGATAT CCGCGACAAC
CCCGGCGGTT TCCTCACGCA GTCGGTGGAT ATCGCCAGCC TGTTCGTGAA GAGCGGCACC
ATTGTGAAGA TCCAGACGAA GGCCGAGGAG ACGTCCAAGA CCACCTCGCG CCCCTACGTG
ACGGAGAAGC CGCTGGTCTT GCTCGTGAAC GGCAAGACCT CCGCTTCCGC CGAGGTGCTG
GCCGCGTCGC TCAAGGACAA CCAGCGCGCC ACGCTCGTGG GATCCACGAC GTTGGGCAAG
GGCTCGGTGC AGGTGACGCG CGATTTGAGT TTCGGCGGAG CGCTGCGCTA CACCGCCGCG
TTCTACAAGA GCCCGCTCGG ACACGACATC GAAGGCGTGG GCGTCACCCC CGACGTGATG
GTGGGTTTGG CGGAGGGCGA GGATAACCAG AAGGCTCTCG CGCTCGAAAC CGCCCAGTCC
CTCGTGAAGG GCTAG
 
Protein sequence
MRLIKACIVV IVVSVAFVAG FTARGDASLL ESLGFTSLVV DVDRNPGSTT SGDTYDSLGA 
RVEEVEGIID NDSLDSYDLN MATTNVLNAL SDTTEDAYLR YYDPARYAAL MQDSAEQSAG
IGVLFSEYKG RAYAADVFEG SAAQMADVRS GDFVVAIDGD RGHEWTTNEV TSALKREEGE
NVVITWRRAS SLDDEGGEEF TTTLVCSNYA VKNVETELSD TVGYIKLKQI TQNAADLVKN
AVADLESQGA TSFVLDIRDN PGGFLTQSVD IASLFVKSGT IVKIQTKAEE TSKTTSRPYV
TEKPLVLLVN GKTSASAEVL AASLKDNQRA TLVGSTTLGK GSVQVTRDLS FGGALRYTAA
FYKSPLGHDI EGVGVTPDVM VGLAEGEDNQ KALALETAQS LVKG