Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18471 |
Symbol | clpX |
ID | 4911049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1572250 |
End bp | 1573617 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640161452 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001092071 |
Protein GI | 126697185 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAT TCGACGCCCA TCTTAAATGT TCATTTTGCG GGAAATCACA AGACCAAGTA AGAAAGCTTA TTGCTGGTCC TGGGGTTTAT ATCTGTGATG AGTGCATAGA TCTTTGTAAT GAAATTCTCG ATGAAGAACT ACTTGATAAT CAAGCGAACA CTAATAACTC TCCGCAAGTA AAAAAGAAAT TACCAACTGA TAATCCAAAA AAATCTGTTC CTTTAGAATT AACCTCTATT CCTAAGCCAT TAGAAATTAA AAGTTTTCTA GATAATCAAG TTGTTGGACA AGAATCTGCA AAAAAAATAT TATCTGTAGC CGTATACAAT CACTACAAGC GATTAGCTTG GAAATTTAAA GAAGAAAATA AAAATAGCAA TTCAAAAGAT TCACAAGCAA CTAAATTACA AAAATCAAAT ATTTTACTCA TCGGCCCTAC TGGAAGTGGG AAAACATTAT TGGCGCAAAC TTTAGCAGAG TTTCTGGATG TTCCTTTTGC AGTAGCTGAT GCAACGACTT TGACAGAAGC TGGATATGTT GGGGAGGATG TTGAAAATAT ACTTTTAAGA CTTCTACAAA AATCAGAAAT GAATGTAGAA CTAGCTCAAA AAGGAATAAT TTATATTGAT GAAATAGATA AAATTGCGAG AAAAAGCGAA AATCCTTCAA TTACTAGAGA TGTCTCTGGT GAAGGAGTAC AGCAAGCATT ATTAAAAATG CTTGAAGGAA CAATTGCTAA TGTGCCACCA CAAGGAGGAA GAAAACATCC TTATCATGAC TGCATCCAAA TTGATACGAG TCAAATATTA TTTATTTGTG GGGGAGCTTT TATAGGTTTA GAGGATATCG TTCAAAAGCG AATGGGTAAA CACTCTATAG GATTTACCAC CAATTCAGAT CAAAACAAAG TTGATACAAA AAAAATAGTA GATCCAAGAG ATGCCCTGAA AAATCTAGAA TTAGATGACT TAGTGAAATA TGGCCTAATT CCAGAATTTA TTGGAAGAAT TCCGGTTTGT GCTGTATTAG ATCGTCTTAC AAAGGAAACT TTAGAATCTA TTTTGACTCA ACCAAGAGAT GCATTAGTAA AGCAATTCAA AACTTTGCTA AGTATGGATA ATGTTGAATT ATCGTTTGAG CCTGATTCTG TTGAAGCAAT AGCAAATGAG GCATATAAAA GAAAAACAGG TGCAAGAGCA TTAAGGTCAA TAATTGAAGA GCTAATGCTA GACATAATGT ACACTTTGCC TTCTGAAGAA AATGTAAAAG AATTCACAAT TACGAAAAAA ATGGTAGATA ATTTGTTCTC ATCTAAAATT GTTAAACTAC CTTCAGGATC AAAAAGAGTC ATTAAAGAGT CTGCATAA
|
Protein sequence | MAKFDAHLKC SFCGKSQDQV RKLIAGPGVY ICDECIDLCN EILDEELLDN QANTNNSPQV KKKLPTDNPK KSVPLELTSI PKPLEIKSFL DNQVVGQESA KKILSVAVYN HYKRLAWKFK EENKNSNSKD SQATKLQKSN ILLIGPTGSG KTLLAQTLAE FLDVPFAVAD ATTLTEAGYV GEDVENILLR LLQKSEMNVE LAQKGIIYID EIDKIARKSE NPSITRDVSG EGVQQALLKM LEGTIANVPP QGGRKHPYHD CIQIDTSQIL FICGGAFIGL EDIVQKRMGK HSIGFTTNSD QNKVDTKKIV DPRDALKNLE LDDLVKYGLI PEFIGRIPVC AVLDRLTKET LESILTQPRD ALVKQFKTLL SMDNVELSFE PDSVEAIANE AYKRKTGARA LRSIIEELML DIMYTLPSEE NVKEFTITKK MVDNLFSSKI VKLPSGSKRV IKESA
|
| |