Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0481 |
Symbol | clpX |
ID | 6147502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 485728 |
End bp | 487002 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615375 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001742582 |
Protein GI | 170682782 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000637699 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGATA AACGCAAAGA TGGCTCAGGC AAATTGCTGT ATTGCTCTTT TTGCGGCAAA AGCCAGCATG AAGTGCGCAA GCTGATTGCC GGTCCATCCG TGTATATCTG CGACGAATGT GTTGATTTAT GTAACGACAT CATTCGCGAA GAGATTAAAG AAGTTGCACC GCATCGTGAA CGCAGTGCGC TACCGACGCC GCATGAAATT CGCAACCACC TGGACGATTA CGTTATCGGT CAGGAACAGG CGAAAAAAGT GCTGGCGGTC GCGGTATACA ACCACTACAA ACGTCTGCGC AACGGAGATA CCAGCAATGG CGTCGAGTTG GGCAAAAGTA ACATTCTGCT GATCGGTCCG ACCGGTTCCG GTAAAACGCT GCTGGCCGAA ACGCTGGCGC GCCTGCTGGA CGTTCCGTTC ACCATGGCTG ACGCAACCAC GCTGACCGAA GCCGGTTATG TGGGCGAAGA CGTTGAAAAC ATCATTCAGA AGCTGTTGCA GAAGTGCGAT TACGACGTAC AGAAAGCGCA GCGCGGGATT GTCTACATCG ATGAAATTGA CAAGATTTCT CGTAAGTCAG ACAACCCGTC CATTACCCGA GACGTTTCCG GTGAAGGCGT ACAGCAGGCA CTGTTAAAAC TGATCGAAGG TACGGTAGCT GCTGTTCCGC CGCAAGGTGG GCGTAAACAT CCGCAGCAGG AATTCTTGCA GGTTGATACC TCTAAGATCC TGTTTATCTG TGGCGGTGCG TTTGCCGGTC TGGATAAAGT GATTTCCCAT CGAGTAGAAA CCGGTTCCGG CATTGGTTTT GGCGCGACGG TAAAAGCGAA GTCCGACAAA GCAAGCGAAG GTGAACTGCT GGCGCAGGTT GAACCGGAAG ATCTGATCAA GTTTGGTCTG ATCCCTGAGT TCATTGGTCG TCTGCCGGTT GTCGCAACGT TGAATGAACT GAGCGAAGAA GCTCTGATTC AGATCCTCAA AGAGCCGAAA AACGCCCTGA CCAAGCAGTA TCAGGCGCTG TTTAATCTGG AAGGTGTGGA TCTGGAATTC CGTGACGAGG CGCTGGATGC TATCGCTAAG AAAGCGATGG CGCGTAAAAC CGGTGCCCGT GGCCTGCGTT CCATCGTAGA AGCCGCACTG CTCGATACCA TGTACGATCT GCCGTCCATG GAAGACGTCG AAAAAGTGGT TATCGACGAA TCGGTAATTG ATGGTCAAAG CAAGCCGTTG CTGATTTATG GCAAGCCGGA AGCGCAACAG GCATCTGGTG AATAA
|
Protein sequence | MTDKRKDGSG KLLYCSFCGK SQHEVRKLIA GPSVYICDEC VDLCNDIIRE EIKEVAPHRE RSALPTPHEI RNHLDDYVIG QEQAKKVLAV AVYNHYKRLR NGDTSNGVEL GKSNILLIGP TGSGKTLLAE TLARLLDVPF TMADATTLTE AGYVGEDVEN IIQKLLQKCD YDVQKAQRGI VYIDEIDKIS RKSDNPSITR DVSGEGVQQA LLKLIEGTVA AVPPQGGRKH PQQEFLQVDT SKILFICGGA FAGLDKVISH RVETGSGIGF GATVKAKSDK ASEGELLAQV EPEDLIKFGL IPEFIGRLPV VATLNELSEE ALIQILKEPK NALTKQYQAL FNLEGVDLEF RDEALDAIAK KAMARKTGAR GLRSIVEAAL LDTMYDLPSM EDVEKVVIDE SVIDGQSKPL LIYGKPEAQQ ASGE
|
| |