Gene EcolC_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2094 
Symbol 
ID6067300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2290665 
End bp2291984 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content61% 
IMG OID641601502 
Productpeptidase S49 
Protein accessionYP_001725061 
Protein GI170020107 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00430491 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCAG AGCTGCGTAA TCTCCCGCAT ATTGCCAGCA TGGCCTTTAA TGAGCCGCTG 
ATGCTTGAAC CCGCCTATGC GCGGGTTTTC TTTTGTGCGC TTGCAGGCCA GCTTGGGATC
AGCCGCCTGA CGGATGCGGT GTCCGGCGAC AGCCTGACTG CCCAGGAGGC ACTCGCGACG
CTGGCATTAT CCGGTGATGA TGACGGACCA CGACAGGCCC GCAGTTATCA GGTCATGAAC
GGCATCGCCG TGCTGCCGGT GTCCGGCACG CTGGTCAGCC GGACGCGGGC GCTGCAGCCG
TACTCGGGGA TGACCGGTTA CAACGGCATT ATCGCCCGTC TGCAACAGGC TGCCAGCGAT
CCGATGGTGG ACGGCATTCT GCTCGATATG GACACGCCCG GCGGGATGGT GGCGGGGGCA
TTTGACTGCG CTGACATCAT CGCCCGTGTG CGTGACATAA AACCGGTATG GGCGCTTGCC
AACGACATGA ACTGCAGTGC AGGTCAGTTG CTTGCCAGTG CCGCCTCCCG GCGTCTGGTC
ACGCAGACCG CCCGGACAGG CTCCATCGGC GTCATGATGG CTCACAGTAA TTACGGTGCT
GCGCTGGAGA AACAGGGTGT GGAAATCACG CTGATTTACA GCGGCAGCCA TAAGGTGGAT
GGCAACCCCT ACAGCCATCT TCCGGATGAC GTCCGGGAGA CACTGCAGTC CCGGATGGAT
GCAACCCGCC AGATGTTTGC GCAGAAGGTG TCGGCATATA CCGGCCTGTC CGTGCAGGCT
GTGCTGGATA CCGAGGCTGC AGTGTACAGC GGTCAGGAGG TCATTGATGC CGGACTGGCT
GATGAACTTG TTAACAGCAC CGATGCGATC ACCGTCATGC GTGATGCACT GGATGCACGT
AAATCCCGTC TCTCAGGAGG GCGAATGACC AAAGAGACTC AATCAACAAC TGTTTCAGCC
ACTGCTTCGC AGGCTGACGT TACTGACGTG GTGCCAGCGA CGGAGGGCGA AAACGCCAGC
GCGGCGCAGC CGGACGTGAA CGCGCAGATC ACCGCAGCGG TTGCGGCAGA AAACAGCCGC
ATTATGGGGA TCCTCAACTG TGAGGAGGCT CACGGACGCG AAGAACAGGC ACGCGTGCTG
GCCGAAACCC CCGGTATGAC CGTGGAAACG GCCCGCCGCA TTCTGGCCGC AGCACCACAG
AGTGCACAGG CGCGCAGTGA CACTGCGCTG GATCGTCTGA TGCAGGGGGC ACCGGCACCG
CTGGCTTCAG GTAACCCGGC ATCTGATGCC GTTAACGATT TGCTGAACAC ACCAGTGTAA
 
Protein sequence
MTAELRNLPH IASMAFNEPL MLEPAYARVF FCALAGQLGI SRLTDAVSGD SLTAQEALAT 
LALSGDDDGP RQARSYQVMN GIAVLPVSGT LVSRTRALQP YSGMTGYNGI IARLQQAASD
PMVDGILLDM DTPGGMVAGA FDCADIIARV RDIKPVWALA NDMNCSAGQL LASAASRRLV
TQTARTGSIG VMMAHSNYGA ALEKQGVEIT LIYSGSHKVD GNPYSHLPDD VRETLQSRMD
ATRQMFAQKV SAYTGLSVQA VLDTEAAVYS GQEVIDAGLA DELVNSTDAI TVMRDALDAR
KSRLSGGRMT KETQSTTVSA TASQADVTDV VPATEGENAS AAQPDVNAQI TAAVAAENSR
IMGILNCEEA HGREEQARVL AETPGMTVET ARRILAAAPQ SAQARSDTAL DRLMQGAPAP
LASGNPASDA VNDLLNTPV