Gene EcSMS35_0481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0481 
SymbolclpX 
ID6147502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp485728 
End bp487002 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content52% 
IMG OID641615375 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001742582 
Protein GI170682782 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000637699 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATA AACGCAAAGA TGGCTCAGGC AAATTGCTGT ATTGCTCTTT TTGCGGCAAA 
AGCCAGCATG AAGTGCGCAA GCTGATTGCC GGTCCATCCG TGTATATCTG CGACGAATGT
GTTGATTTAT GTAACGACAT CATTCGCGAA GAGATTAAAG AAGTTGCACC GCATCGTGAA
CGCAGTGCGC TACCGACGCC GCATGAAATT CGCAACCACC TGGACGATTA CGTTATCGGT
CAGGAACAGG CGAAAAAAGT GCTGGCGGTC GCGGTATACA ACCACTACAA ACGTCTGCGC
AACGGAGATA CCAGCAATGG CGTCGAGTTG GGCAAAAGTA ACATTCTGCT GATCGGTCCG
ACCGGTTCCG GTAAAACGCT GCTGGCCGAA ACGCTGGCGC GCCTGCTGGA CGTTCCGTTC
ACCATGGCTG ACGCAACCAC GCTGACCGAA GCCGGTTATG TGGGCGAAGA CGTTGAAAAC
ATCATTCAGA AGCTGTTGCA GAAGTGCGAT TACGACGTAC AGAAAGCGCA GCGCGGGATT
GTCTACATCG ATGAAATTGA CAAGATTTCT CGTAAGTCAG ACAACCCGTC CATTACCCGA
GACGTTTCCG GTGAAGGCGT ACAGCAGGCA CTGTTAAAAC TGATCGAAGG TACGGTAGCT
GCTGTTCCGC CGCAAGGTGG GCGTAAACAT CCGCAGCAGG AATTCTTGCA GGTTGATACC
TCTAAGATCC TGTTTATCTG TGGCGGTGCG TTTGCCGGTC TGGATAAAGT GATTTCCCAT
CGAGTAGAAA CCGGTTCCGG CATTGGTTTT GGCGCGACGG TAAAAGCGAA GTCCGACAAA
GCAAGCGAAG GTGAACTGCT GGCGCAGGTT GAACCGGAAG ATCTGATCAA GTTTGGTCTG
ATCCCTGAGT TCATTGGTCG TCTGCCGGTT GTCGCAACGT TGAATGAACT GAGCGAAGAA
GCTCTGATTC AGATCCTCAA AGAGCCGAAA AACGCCCTGA CCAAGCAGTA TCAGGCGCTG
TTTAATCTGG AAGGTGTGGA TCTGGAATTC CGTGACGAGG CGCTGGATGC TATCGCTAAG
AAAGCGATGG CGCGTAAAAC CGGTGCCCGT GGCCTGCGTT CCATCGTAGA AGCCGCACTG
CTCGATACCA TGTACGATCT GCCGTCCATG GAAGACGTCG AAAAAGTGGT TATCGACGAA
TCGGTAATTG ATGGTCAAAG CAAGCCGTTG CTGATTTATG GCAAGCCGGA AGCGCAACAG
GCATCTGGTG AATAA
 
Protein sequence
MTDKRKDGSG KLLYCSFCGK SQHEVRKLIA GPSVYICDEC VDLCNDIIRE EIKEVAPHRE 
RSALPTPHEI RNHLDDYVIG QEQAKKVLAV AVYNHYKRLR NGDTSNGVEL GKSNILLIGP
TGSGKTLLAE TLARLLDVPF TMADATTLTE AGYVGEDVEN IIQKLLQKCD YDVQKAQRGI
VYIDEIDKIS RKSDNPSITR DVSGEGVQQA LLKLIEGTVA AVPPQGGRKH PQQEFLQVDT
SKILFICGGA FAGLDKVISH RVETGSGIGF GATVKAKSDK ASEGELLAQV EPEDLIKFGL
IPEFIGRLPV VATLNELSEE ALIQILKEPK NALTKQYQAL FNLEGVDLEF RDEALDAIAK
KAMARKTGAR GLRSIVEAAL LDTMYDLPSM EDVEKVVIDE SVIDGQSKPL LIYGKPEAQQ
ASGE