Gene ECH74115_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1623 
Symbol 
ID6969781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1570981 
End bp1572300 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content61% 
IMG OID643385583 
Productminor capsid protein C 
Protein accessionYP_002270077 
Protein GI209398131 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0216288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCAG AGCTGCGTAA TCTCCCGCAT ATTGCCAGCA TGGCTTTTAA TGAGCCGCTG 
ATGCTTGAAC CCGCCTATGC GCGGGTTTTC TTTTGTGCGC TTGCAGGCCA GCTTGGGATC
AGCCGCCTGA CGGATGCAGT ATCCGGCGAC AGCCTGACTG CCGGAGAGGC ACCCGCGGCG
CTGGCGTTAT CCGGTGATGA TGACGGACCA CGACAGGCCC GGAGTTATCA GGTCATGAAC
GGCATCGCCG TGCTGCCGGT GTCCGGTACG CTGGTCAGCC GGACGCGGGC GCTGCAGCCG
TATTCGGGAA TGACCGGTTA CAACGGCATT ATCGCCCGTC TGCAACAGGC TGCCAGCGAT
CCGATGGTGG ACGGCATTCT GCTCGATATG GACACACCGG GCGGGATGGT GGCGGGAGCA
TTTGACTGTG CTGACATCAT CGCCCGTGTG CGAGACATAA AACCGGTATG GGCGCTGGCC
AACGACATGA ACTGCAGTGC AGGTCAGCTG CTTGCCAGCG CCGCCTCCCG GCGTCTGGTC
ACGCAGACCG CCCGGACAGG CTCCATCGGC GTCATGATGG CTCACAGTAA TTACGGTGCT
GCCCTGGAGA AACAGGGCGT GGAAATCACG CTGATTTACA GCGGCAGCCA TAAGGTGGAT
GGCAACCCCT ACAGCCATCT ACCGGGTGAT GTCCGGGAAA CACTGCAGTC CCGGATGGAT
GCAACCCGCC GGATGTTTGC GCAGAAGGTG TCGGCATATA CCGGCCTGTC CGTGCAGGCT
GTGCTGGATA CCGAGGCTGC AGTGTACAGC GGTCAGGAGG CCATTGATGC CGGACTGGCT
GATGAACTTG TCAACAGCAC CGATGCGATC ACCGTTATGC GTGATGCACT GGATGCACGT
AAATCCCGTC TCTCAGGAGG GCGAATGACC AAAGAGACTC AATCAACAAC TGTTTCAGCC
ACTGCTTCGC AGGCTGACGT TACTGGCGTG GTGCCAGCGA CGGAGGGCGA AAACGCCAGC
GCGGCGCAGC CGGACGTGAA CGCGCAGATC ACCGCTGCGG TTGCGGCAGA AAACAGCCGC
ATTATGGGGA TCCTCAACTG TGAGGAGGCT CACGGACGCG AAGAACAGGC CCGCGTGCTG
GCAGAAACCC CCGGTATGAC CGTGGAAACG GCCCGCCGCA TTCTGGCCGC AGCACCACAG
AGTGCACAGG CGCGCAGTGA TACTGCGCTG GATCGTCTGA TGCAGGGGGC ACCGGCACCG
CTGGCTGCAG GTAACCCGGC ATCTGATGCC GTTAACGATT TGCTGAACAC ACCAGTGTAA
 
Protein sequence
MTAELRNLPH IASMAFNEPL MLEPAYARVF FCALAGQLGI SRLTDAVSGD SLTAGEAPAA 
LALSGDDDGP RQARSYQVMN GIAVLPVSGT LVSRTRALQP YSGMTGYNGI IARLQQAASD
PMVDGILLDM DTPGGMVAGA FDCADIIARV RDIKPVWALA NDMNCSAGQL LASAASRRLV
TQTARTGSIG VMMAHSNYGA ALEKQGVEIT LIYSGSHKVD GNPYSHLPGD VRETLQSRMD
ATRRMFAQKV SAYTGLSVQA VLDTEAAVYS GQEAIDAGLA DELVNSTDAI TVMRDALDAR
KSRLSGGRMT KETQSTTVSA TASQADVTGV VPATEGENAS AAQPDVNAQI TAAVAAENSR
IMGILNCEEA HGREEQARVL AETPGMTVET ARRILAAAPQ SAQARSDTAL DRLMQGAPAP
LAAGNPASDA VNDLLNTPV