Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3194 |
Symbol | clpX |
ID | 6066621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3500945 |
End bp | 3502219 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641602609 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001726143 |
Protein GI | 170021189 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000323878 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000107792 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGATA AACGCAAAGA TGGCTCAGGC AAATTGCTGT ATTGCTCTTT TTGCGGCAAA AGCCAGCATG AAGTGCGCAA GCTGATTGCC GGTCCATCCG TGTATATCTG CGACGAATGT GTTGATTTAT GTAACGACAT CATTCGCGAA GAGATTAAAG AAGTTGCACC GCATCGTGAA CGCAGTGCGC TACCGACGCC GCATGAAATT CGCAACCACC TGGACGATTA CGTTATCGGC CAGGAACAGG CGAAAAAAGT GCTGGCGGTC GCGGTATACA ACCATTACAA ACGTCTGCGC AACGGCGATA CCAGCAATGG CGTCGAGTTG GGCAAAAGTA ACATTCTGCT GATCGGTCCG ACCGGTTCCG GTAAAACGCT GCTGGCTGAA ACGCTGGCGC GCCTGCTGGA TGTTCCGTTC ACCATGGCCG ACGCGACTAC ACTGACCGAA GCCGGTTATG TGGGTGAAGA CGTTGAAAAC ATCATTCAGA AGCTGTTGCA GAAATGCGAC TACGATGTCC AGAAAGCACA GCGTGGTATT GTCTACATCG ATGAAATCGA CAAGATTTCT CGTAAGTCAG ACAACCCGTC CATTACCCGA GACGTTTCCG GTGAAGGCGT ACAGCAGGCA CTGTTGAAAC TGATCGAAGG TACGGTAGCT GCTGTTCCAC CGCAAGGTGG GCGTAAACAT CCGCAGCAGG AATTCTTGCA GGTTGATACC TCTAAGATCC TGTTTATTTG TGGCGGTGCG TTTGCCGGTC TGGATAAAGT GATTTCCCAC CGTGTAGAAA CCGGCTCCGG CATTGGTTTT GGCGCGACGG TAAAAGCGAA GTCCGACAAA GCAAGCGAAG GCGAGCTGCT GGCGCAGGTT GAACCGGAAG ATCTGATCAA GTTTGGTCTT ATCCCTGAGT TTATTGGTCG TCTGCCGGTT GTCGCAACGT TGAATGAACT GAGCGAAGAA GCTCTGATTC AGATCCTCAA AGAGCCGAAA AACGCCCTGA CCAAGCAGTA TCAGGCGCTG TTTAATCTGG AAGGCGTGGA TCTGGAATTC CGTGACGAGG CGCTGGATGC TATCGCTAAG AAAGCGATGG CGCGTAAAAC CGGTGCCCGT GGCCTGCGTT CCATCGTAGA AGCCGCACTG CTCGATACCA TGTACGATCT GCCGTCCATG GAAGACGTCG AAAAAGTGGT TATCGACGAG TCGGTAATTG ATGGTCAAAG CAAACCGTTG CTGATTTATG GCAAGCCGGA AGCGCAACAG GCATCTGGTG AATAA
|
Protein sequence | MTDKRKDGSG KLLYCSFCGK SQHEVRKLIA GPSVYICDEC VDLCNDIIRE EIKEVAPHRE RSALPTPHEI RNHLDDYVIG QEQAKKVLAV AVYNHYKRLR NGDTSNGVEL GKSNILLIGP TGSGKTLLAE TLARLLDVPF TMADATTLTE AGYVGEDVEN IIQKLLQKCD YDVQKAQRGI VYIDEIDKIS RKSDNPSITR DVSGEGVQQA LLKLIEGTVA AVPPQGGRKH PQQEFLQVDT SKILFICGGA FAGLDKVISH RVETGSGIGF GATVKAKSDK ASEGELLAQV EPEDLIKFGL IPEFIGRLPV VATLNELSEE ALIQILKEPK NALTKQYQAL FNLEGVDLEF RDEALDAIAK KAMARKTGAR GLRSIVEAAL LDTMYDLPSM EDVEKVVIDE SVIDGQSKPL LIYGKPEAQQ ASGE
|
| |