Gene P9211_17831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17831 
SymbolclpX 
ID5730177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1611189 
End bp1612550 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content38% 
IMG OID641286169 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001551668 
Protein GI159904324 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGT TTGAAGCCCA TCTCAAATGC TCTTTTTGTG GCAAATCCCA AGAGCAAGTT 
AGGAAACTCA TTGCAGGTCC AGGAGTCTAT ATCTGTGATG AATGTATAGA TCTCTGTAAT
GAGATCTTGG ATGAAGAACT GATAGATAAT CAAACAAAAA CAAGAGAAGC AAACAATCCA
TCAAATAAAA ATCACACCGC CATTACAAGT ACTAGCAAGC CTGCTCCTAC CCTAGCGACC
ATCCCCAAAC CAATTGATAT TAAAGACTTC TTAGATAAAC AAGTAGTTGG ACAAGAGGTT
GCAAAAAAAA TCCTTTCTGT GGCTGTCTAT AACCATTACA AACGACTTGC TTGGCAAGGC
GATGGTAATC AGGAAACGGA CCTAACAGCA ACAAGATTAC ATAAGTCAAA CATTCTCCTT
ATTGGGCCTA CTGGTAGTGG GAAAACTCTT CTCGCGCAAA CTTTGGCAGA GCTGCTAGAT
GTACCATTTG CAGTAGCAGA TGCAACGACA TTAACCGAAG CAGGATATGT AGGAGAAGAT
GTTGAAAATA TTTTGCTTAG ACTCTTGCAG AAAGCCGATA TGGATATTGA ACTTGCGCAA
AGAGGAATTA TTTATATCGA TGAAATAGAT AAAATCGCTA GAAAAAGTGA AAACCCTTCT
ATAACAAGAG ATGTATCTGG AGAAGGAGTG CAACAAGCTT TGTTAAAGAT GCTAGAAGGC
ACTGTTGCCA ACGTACCGCC TCAGGGAGGC AGGAAACATC CATATCATGA TTGTATTCAA
ATAGATACCA GCCAAATACT CTTTATTTGT GGAGGTGCTT TTGTTGGTCT AGAAGATATT
GTTCAAAAAA GATTAGGCAA AAACTCTATT GGCTTCATGC CTACAGATAG CCGAGGGCAA
AATCATCTCA ATAGGGACTT AGACAATAAT CAAATGATTA ACAATCTCGA GCCAGATGAT
TTAATACGTT ATGGTCTAAT TCCTGAATTT ATTGGAAGAA TGCCAGTAAC AGCAGTCCTA
GAACCATTGA ATTCAGAAGC TTTAGAGGCA ATTCTAAAAG AACCTAGGGA TGCCGTAATT
AAACAATTTA GAACCCTAAT GAGTATGGAT AATGTAAAAC TTGAATTTGA CGAAGGTGCA
GTCACGGCAA TTGCTCAAGA AGCATTTAGA AGAAAAACTG GTGCAAGAGC CCTAAGAGGA
ATAGTTGAAG AATTAATGGT TGATCTTATG TACAAACTTC CTTCCGAAAA AAATGTCAGT
GATTTTACAG TGACTAAAAA AATGGTGGAT GAAATGATTA TCGGTGGGAA GGTATTAAAA
CTGCCCTCTA ACGAAAAAAT AGATCACCCT GAATCTGCCT AA
 
Protein sequence
MAKFEAHLKC SFCGKSQEQV RKLIAGPGVY ICDECIDLCN EILDEELIDN QTKTREANNP 
SNKNHTAITS TSKPAPTLAT IPKPIDIKDF LDKQVVGQEV AKKILSVAVY NHYKRLAWQG
DGNQETDLTA TRLHKSNILL IGPTGSGKTL LAQTLAELLD VPFAVADATT LTEAGYVGED
VENILLRLLQ KADMDIELAQ RGIIYIDEID KIARKSENPS ITRDVSGEGV QQALLKMLEG
TVANVPPQGG RKHPYHDCIQ IDTSQILFIC GGAFVGLEDI VQKRLGKNSI GFMPTDSRGQ
NHLNRDLDNN QMINNLEPDD LIRYGLIPEF IGRMPVTAVL EPLNSEALEA ILKEPRDAVI
KQFRTLMSMD NVKLEFDEGA VTAIAQEAFR RKTGARALRG IVEELMVDLM YKLPSEKNVS
DFTVTKKMVD EMIIGGKVLK LPSNEKIDHP ESA