Gene P9301_18471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18471 
SymbolclpX 
ID4911049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1572250 
End bp1573617 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content33% 
IMG OID640161452 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001092071 
Protein GI126697185 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAT TCGACGCCCA TCTTAAATGT TCATTTTGCG GGAAATCACA AGACCAAGTA 
AGAAAGCTTA TTGCTGGTCC TGGGGTTTAT ATCTGTGATG AGTGCATAGA TCTTTGTAAT
GAAATTCTCG ATGAAGAACT ACTTGATAAT CAAGCGAACA CTAATAACTC TCCGCAAGTA
AAAAAGAAAT TACCAACTGA TAATCCAAAA AAATCTGTTC CTTTAGAATT AACCTCTATT
CCTAAGCCAT TAGAAATTAA AAGTTTTCTA GATAATCAAG TTGTTGGACA AGAATCTGCA
AAAAAAATAT TATCTGTAGC CGTATACAAT CACTACAAGC GATTAGCTTG GAAATTTAAA
GAAGAAAATA AAAATAGCAA TTCAAAAGAT TCACAAGCAA CTAAATTACA AAAATCAAAT
ATTTTACTCA TCGGCCCTAC TGGAAGTGGG AAAACATTAT TGGCGCAAAC TTTAGCAGAG
TTTCTGGATG TTCCTTTTGC AGTAGCTGAT GCAACGACTT TGACAGAAGC TGGATATGTT
GGGGAGGATG TTGAAAATAT ACTTTTAAGA CTTCTACAAA AATCAGAAAT GAATGTAGAA
CTAGCTCAAA AAGGAATAAT TTATATTGAT GAAATAGATA AAATTGCGAG AAAAAGCGAA
AATCCTTCAA TTACTAGAGA TGTCTCTGGT GAAGGAGTAC AGCAAGCATT ATTAAAAATG
CTTGAAGGAA CAATTGCTAA TGTGCCACCA CAAGGAGGAA GAAAACATCC TTATCATGAC
TGCATCCAAA TTGATACGAG TCAAATATTA TTTATTTGTG GGGGAGCTTT TATAGGTTTA
GAGGATATCG TTCAAAAGCG AATGGGTAAA CACTCTATAG GATTTACCAC CAATTCAGAT
CAAAACAAAG TTGATACAAA AAAAATAGTA GATCCAAGAG ATGCCCTGAA AAATCTAGAA
TTAGATGACT TAGTGAAATA TGGCCTAATT CCAGAATTTA TTGGAAGAAT TCCGGTTTGT
GCTGTATTAG ATCGTCTTAC AAAGGAAACT TTAGAATCTA TTTTGACTCA ACCAAGAGAT
GCATTAGTAA AGCAATTCAA AACTTTGCTA AGTATGGATA ATGTTGAATT ATCGTTTGAG
CCTGATTCTG TTGAAGCAAT AGCAAATGAG GCATATAAAA GAAAAACAGG TGCAAGAGCA
TTAAGGTCAA TAATTGAAGA GCTAATGCTA GACATAATGT ACACTTTGCC TTCTGAAGAA
AATGTAAAAG AATTCACAAT TACGAAAAAA ATGGTAGATA ATTTGTTCTC ATCTAAAATT
GTTAAACTAC CTTCAGGATC AAAAAGAGTC ATTAAAGAGT CTGCATAA
 
Protein sequence
MAKFDAHLKC SFCGKSQDQV RKLIAGPGVY ICDECIDLCN EILDEELLDN QANTNNSPQV 
KKKLPTDNPK KSVPLELTSI PKPLEIKSFL DNQVVGQESA KKILSVAVYN HYKRLAWKFK
EENKNSNSKD SQATKLQKSN ILLIGPTGSG KTLLAQTLAE FLDVPFAVAD ATTLTEAGYV
GEDVENILLR LLQKSEMNVE LAQKGIIYID EIDKIARKSE NPSITRDVSG EGVQQALLKM
LEGTIANVPP QGGRKHPYHD CIQIDTSQIL FICGGAFIGL EDIVQKRMGK HSIGFTTNSD
QNKVDTKKIV DPRDALKNLE LDDLVKYGLI PEFIGRIPVC AVLDRLTKET LESILTQPRD
ALVKQFKTLL SMDNVELSFE PDSVEAIANE AYKRKTGARA LRSIIEELML DIMYTLPSEE
NVKEFTITKK MVDNLFSSKI VKLPSGSKRV IKESA