Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2417 |
Symbol | clpX |
ID | 5834043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2722189 |
End bp | 2723460 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641368217 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001639883 |
Protein GI | 163851840 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGA CAGGCGGCAA CGACTCGAAG AGCACGCTGT ACTGCTCGTT CTGCGGCAAG AGCCAGCACG AGGTTCGCAA GCTGATTGCG GGCCCGACGG TGTTCATCTG CGACGAGTGC GTCGAGCTGT GCATGGATAT CATCCGCGAG GAATCGAAGT CCTCGCTGGT GAAGTCCCGC GACGGGGTGC CGACCCCGAA GGAGATCCGG CGCGTCCTCG ACGACTACGT CATCGGCCAG GACTTCGCCA AGAAGGTCCT CTCGGTCGCC GTGCACAACC ACTACAAGCG GCTGGCCCAC GCGACGAAGC ACAACGACGT CGAACTGGCT AAGTCCAACA TCATGCTGAT CGGGCCGACG GGCTCGGGCA AGACGCTGCT CGCGCAGACG CTCGCCCGCA TCCTCGACGT GCCCTTCACC ATGGCCGACG CCACCACGCT GACCGAAGCG GGCTATGTCG GCGAGGACGT CGAGAACATC ATCCTCAAGC TGCTCCAGGC CTCCGACTAC AACGTCGAGC GGGCGCAGCG CGGCATCGTC TACATCGACG AGATCGACAA GATCTCCCGC AAGTCGGACA ACCCCTCGAT CACCCGCGAC GTCTCGGGCG AGGGCGTGCA GCAGGCACTC CTGAAGATCA TGGAAGGCAC CGTCGCCTCC GTGCCTCCGC AGGGCGGCCG CAAGCACCCG CAGCAGGAGT TCCTGCAGGT CGACACCACG AACATCCTGT TCATCTGCGG CGGCGCCTTC GCCGGGCTGG AGCGCATCAT CTCCCAGCGC GGCAAGGGGA CCTCGATCGG CTTCGGTGCC AGCGTCCAGG CGCCCGACGA TCGCCGCACC GGCGAGGTGT TCCGCTCGGT CGAGCCCGAG GATCTGCTGA AGTTCGGCCT GATCCCGGAA TTCGTCGGCC GTCTGCCGGT TCTGGCGACG CTGGAGGATC TCGATGAGGA GGCCCTCAAG AAGATCCTGC AGGAGCCGAA GAACGCGCTG GTCAAGCAGT ACCAGCGGCT GTTCGAGATG GAGAACGTCG AGCTGACCTT CCAGGACGAG GCGCTCAGCC TCGTCGCCCG CAAGGCCATC GAGCGCAAGA CCGGCGCCCG CGGCCTCCGG TCGATCCTGG AGACCATCCT CCTCGACACG ATGTACGACC TGCCCGGCCT CGAATCCGTC GAGCAGGTGG TCATCGGCCC GGAGGTGGTC GAGGGCAAAT CCAGGCCGCT CTTCATCCAC GGCGACCGCA ACAAGGAAGC CCCGGCCAGC GTCAGCGCCT GA
|
Protein sequence | MSKTGGNDSK STLYCSFCGK SQHEVRKLIA GPTVFICDEC VELCMDIIRE ESKSSLVKSR DGVPTPKEIR RVLDDYVIGQ DFAKKVLSVA VHNHYKRLAH ATKHNDVELA KSNIMLIGPT GSGKTLLAQT LARILDVPFT MADATTLTEA GYVGEDVENI ILKLLQASDY NVERAQRGIV YIDEIDKISR KSDNPSITRD VSGEGVQQAL LKIMEGTVAS VPPQGGRKHP QQEFLQVDTT NILFICGGAF AGLERIISQR GKGTSIGFGA SVQAPDDRRT GEVFRSVEPE DLLKFGLIPE FVGRLPVLAT LEDLDEEALK KILQEPKNAL VKQYQRLFEM ENVELTFQDE ALSLVARKAI ERKTGARGLR SILETILLDT MYDLPGLESV EQVVIGPEVV EGKSRPLFIH GDRNKEAPAS VSA
|
| |