Gene ECH74115_0524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0524 
SymbolclpX 
ID6970141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp527064 
End bp528338 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content52% 
IMG OID643384571 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_002269085 
Protein GI209398902 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000133307 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.672267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATA AACGCAAAGA TGGCTCAGGC AAATTGCTGT ATTGCTCTTT TTGCGGCAAA 
AGCCAGCATG AAGTGCGCAA GCTGATTGCC GGTCCATCCG TGTATATCTG CGACGAATGT
GTTGATTTAT GTAACGACAT CATTCGCGAA GAGATTAAAG AAGTTGCACC GCATCGTGAA
CGCAGTGCGC TACCGACGCC GCATGAAATT CGTAACCACC TGGACGATTA CGTTATCGGC
CAGGAACAGG CGAAAAAAGT GCTGGCGGTC GCGGTATACA ACCACTACAA ACGTCTGCGC
AACGGCGATA CCAGCAATGG CGTCGAGTTG GGCAAAAGTA ACATTCTGCT GATCGGTCCG
ACCGGTTCCG GTAAAACGCT GCTGGCTGAA ACGCTGGCGC GCCTGCTGGA CGTCCCGTTC
ACCATGGCCG ACGCAACCAC GCTGACCGAA GCCGGTTATG TGGGCGAAGA CGTTGAAAAC
ATCATTCAGA AGCTGTTGCA AAAGTGCGAT TACGACGTAC AGAAAGCGCA GCGCGGGATT
GTCTACATCG ATGAAATCGA CAAGATTTCT CGTAAGTCAG ACAACCCGTC TATTACCCGT
GACGTTTCCG GTGAAGGCGT ACAGCAGGCA CTGTTGAAAC TGATCGAAGG TACGGTAGCT
GCTGTTCCAC CGCAAGGTGG ACGTAAACAT CCGCAGCAGG AATTCTTGCA GGTTGATACC
TCTAAGATCC TGTTTATTTG TGGCGGTGCG TTTGCCGGTC TGGATAAAGT GATTTCCCAT
CGTGTAGAAA CCGGCTCCGG CATTGGTTTT GGCGCGACGG TAAAAGCGAA GTCCGACAAA
GCAAGCGAAG GCGAGCTGCT GGCGCAGGTT GAACCGGAAG ATCTGATCAA GTTTGGTCTT
ATCCCTGAGT TTATTGGTCG TCTGCCGGTT GTCGCAACGT TGAATGAACT GAGCGAAGAA
GCTCTGATTC AGATCCTCAA AGAGCCGAAA AACGCCCTGA CCAAGCAGTA TCAGGCGCTG
TTTAATCTGG AAGGCGTGGA TCTGGAATTC CGTGACGAGG CGCTGGATGC TATCGCTAAG
AAAGCGATGG CGCGTAAAAC CGGTGCCCGT GGCCTGCGTT CCATCGTAGA AGCCGCACTG
CTCGATACCA TGTACGATCT GCCGTCCATG GAAGATGTCG AAAAAGTGGT TATCGACGAG
TCGGTAATTG ATGGTCAAAG CAAACCGTTG CTGATTTATG GCAAGCCGGA AGCGCAACAG
GCATCTGGTG AATAA
 
Protein sequence
MTDKRKDGSG KLLYCSFCGK SQHEVRKLIA GPSVYICDEC VDLCNDIIRE EIKEVAPHRE 
RSALPTPHEI RNHLDDYVIG QEQAKKVLAV AVYNHYKRLR NGDTSNGVEL GKSNILLIGP
TGSGKTLLAE TLARLLDVPF TMADATTLTE AGYVGEDVEN IIQKLLQKCD YDVQKAQRGI
VYIDEIDKIS RKSDNPSITR DVSGEGVQQA LLKLIEGTVA AVPPQGGRKH PQQEFLQVDT
SKILFICGGA FAGLDKVISH RVETGSGIGF GATVKAKSDK ASEGELLAQV EPEDLIKFGL
IPEFIGRLPV VATLNELSEE ALIQILKEPK NALTKQYQAL FNLEGVDLEF RDEALDAIAK
KAMARKTGAR GLRSIVEAAL LDTMYDLPSM EDVEKVVIDE SVIDGQSKPL LIYGKPEAQQ
ASGE