Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_17831 |
Symbol | clpX |
ID | 5730177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1611189 |
End bp | 1612550 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641286169 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001551668 |
Protein GI | 159904324 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGT TTGAAGCCCA TCTCAAATGC TCTTTTTGTG GCAAATCCCA AGAGCAAGTT AGGAAACTCA TTGCAGGTCC AGGAGTCTAT ATCTGTGATG AATGTATAGA TCTCTGTAAT GAGATCTTGG ATGAAGAACT GATAGATAAT CAAACAAAAA CAAGAGAAGC AAACAATCCA TCAAATAAAA ATCACACCGC CATTACAAGT ACTAGCAAGC CTGCTCCTAC CCTAGCGACC ATCCCCAAAC CAATTGATAT TAAAGACTTC TTAGATAAAC AAGTAGTTGG ACAAGAGGTT GCAAAAAAAA TCCTTTCTGT GGCTGTCTAT AACCATTACA AACGACTTGC TTGGCAAGGC GATGGTAATC AGGAAACGGA CCTAACAGCA ACAAGATTAC ATAAGTCAAA CATTCTCCTT ATTGGGCCTA CTGGTAGTGG GAAAACTCTT CTCGCGCAAA CTTTGGCAGA GCTGCTAGAT GTACCATTTG CAGTAGCAGA TGCAACGACA TTAACCGAAG CAGGATATGT AGGAGAAGAT GTTGAAAATA TTTTGCTTAG ACTCTTGCAG AAAGCCGATA TGGATATTGA ACTTGCGCAA AGAGGAATTA TTTATATCGA TGAAATAGAT AAAATCGCTA GAAAAAGTGA AAACCCTTCT ATAACAAGAG ATGTATCTGG AGAAGGAGTG CAACAAGCTT TGTTAAAGAT GCTAGAAGGC ACTGTTGCCA ACGTACCGCC TCAGGGAGGC AGGAAACATC CATATCATGA TTGTATTCAA ATAGATACCA GCCAAATACT CTTTATTTGT GGAGGTGCTT TTGTTGGTCT AGAAGATATT GTTCAAAAAA GATTAGGCAA AAACTCTATT GGCTTCATGC CTACAGATAG CCGAGGGCAA AATCATCTCA ATAGGGACTT AGACAATAAT CAAATGATTA ACAATCTCGA GCCAGATGAT TTAATACGTT ATGGTCTAAT TCCTGAATTT ATTGGAAGAA TGCCAGTAAC AGCAGTCCTA GAACCATTGA ATTCAGAAGC TTTAGAGGCA ATTCTAAAAG AACCTAGGGA TGCCGTAATT AAACAATTTA GAACCCTAAT GAGTATGGAT AATGTAAAAC TTGAATTTGA CGAAGGTGCA GTCACGGCAA TTGCTCAAGA AGCATTTAGA AGAAAAACTG GTGCAAGAGC CCTAAGAGGA ATAGTTGAAG AATTAATGGT TGATCTTATG TACAAACTTC CTTCCGAAAA AAATGTCAGT GATTTTACAG TGACTAAAAA AATGGTGGAT GAAATGATTA TCGGTGGGAA GGTATTAAAA CTGCCCTCTA ACGAAAAAAT AGATCACCCT GAATCTGCCT AA
|
Protein sequence | MAKFEAHLKC SFCGKSQEQV RKLIAGPGVY ICDECIDLCN EILDEELIDN QTKTREANNP SNKNHTAITS TSKPAPTLAT IPKPIDIKDF LDKQVVGQEV AKKILSVAVY NHYKRLAWQG DGNQETDLTA TRLHKSNILL IGPTGSGKTL LAQTLAELLD VPFAVADATT LTEAGYVGED VENILLRLLQ KADMDIELAQ RGIIYIDEID KIARKSENPS ITRDVSGEGV QQALLKMLEG TVANVPPQGG RKHPYHDCIQ IDTSQILFIC GGAFVGLEDI VQKRLGKNSI GFMPTDSRGQ NHLNRDLDNN QMINNLEPDD LIRYGLIPEF IGRMPVTAVL EPLNSEALEA ILKEPRDAVI KQFRTLMSMD NVKLEFDEGA VTAIAQEAFR RKTGARALRG IVEELMVDLM YKLPSEKNVS DFTVTKKMVD EMIIGGKVLK LPSNEKIDHP ESA
|
| |