Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18661 |
Symbol | clpX |
ID | 4718604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1600265 |
End bp | 1601632 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640079600 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001010256 |
Protein GI | 123969398 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAT TCGACGCCCA TCTTAAATGT TCATTTTGCG GTAAATCACA AGACCAAGTA AGAAAGCTTA TAGCTGGTCC TGGGGTTTAT ATCTGTGATG AGTGCATAGA TCTTTGTAAT GAAATTCTCG ATGAAGAACT ACTTGATAAT CAAGCGAACA CAAACAACCC TCCGCAAGTA AAAAAGAAAT TACCAACTGA TAATCCAAAA AAATCTGTTC CTTTAGAATT AACCTCAATT CCTAAGCCAT TAGAAATTAA AAGTTTTCTA GATAATCAAG TTGTTGGACA AGAATCCGCA AAAAAAATAT TATCAGTCGC CGTATACAAT CACTACAAGC GATTAGCTTG GAAAGTTAAA GAAGATAGTA AAAATAACAA TGCAACAGAT TCACAAGCAA CTAAATTACA AAAATCAAAT ATTTTACTCA TCGGCCCTAC TGGAAGTGGA AAAACATTAT TGGCGCAAAC TTTAGCAGAG TTTTTAGATG TTCCTTTTGC AGTAGCTGAT GCAACGACTT TGACAGAAGC TGGATATGTT GGGGAGGATG TTGAAAACAT ACTTTTAAGA CTTCTACAGA AATCAGAAAT GAATGTAGAA CTAGCTCAAA AAGGAATTAT TTATATTGAT GAAATAGATA AAATTGCAAG AAAAAGCGAG AATCCCTCAA TTACTAGAGA TGTCTCTGGT GAAGGAGTAC AGCAAGCTTT ATTAAAAATG CTTGAAGGAA CAATTGCTAA TGTGCCACCA CAAGGCGGAA GAAAACATCC GTATCATGAC TGTATCCAAA TTGATACGAG TCAAATACTA TTTATTTGTG GGGGAGCTTT TATAGGTTTA GAGGATATCG TTCAAAAGCG TATGGGTAAA CACTCTATAG GTTTTACCAC CAATTCAGAT CAAAACAAAG TTGATACAAA AAAAATAGTA GACCCAAGAG ATTCCCTGAA AAATTTAGAA TTAGATGACT TAGTGAAATA TGGCCTAATT CCAGAATTTA TTGGAAGAAT TCCGGTTTGT GCTGTATTAG ATCGTCTTAC TAAAGAAACT TTAGAATCTA TTTTGACTCA ACCAAGAGAT GCATTAGTAA AGCAATTCAA AACTTTGCTA AGTATGGATA ATGTTGAATT ATCATTTGAG CCTGATTCTG TTGAAGCGAT AGCAAATGAA GCATACAAAA GAAAAACAGG TGCAAGAGCA TTAAGATCAA TAATTGAAGA GCTAATGCTA GACATAATGT ATACTTTGCC TTCTGAAGAA AATGTAAAAG AATTCACAAT TACGAAAAAA ATGGTAGATA ATTTGTTCTC ATCTAAAATT GTTAAACTAC CTTCAGGATC AAAAAGAATC ATTAAAGAGT CTGCATAA
|
Protein sequence | MAKFDAHLKC SFCGKSQDQV RKLIAGPGVY ICDECIDLCN EILDEELLDN QANTNNPPQV KKKLPTDNPK KSVPLELTSI PKPLEIKSFL DNQVVGQESA KKILSVAVYN HYKRLAWKVK EDSKNNNATD SQATKLQKSN ILLIGPTGSG KTLLAQTLAE FLDVPFAVAD ATTLTEAGYV GEDVENILLR LLQKSEMNVE LAQKGIIYID EIDKIARKSE NPSITRDVSG EGVQQALLKM LEGTIANVPP QGGRKHPYHD CIQIDTSQIL FICGGAFIGL EDIVQKRMGK HSIGFTTNSD QNKVDTKKIV DPRDSLKNLE LDDLVKYGLI PEFIGRIPVC AVLDRLTKET LESILTQPRD ALVKQFKTLL SMDNVELSFE PDSVEAIANE AYKRKTGARA LRSIIEELML DIMYTLPSEE NVKEFTITKK MVDNLFSSKI VKLPSGSKRI IKESA
|
| |