Gene P9303_00681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00681 
SymbolclpX 
ID4777999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp66333 
End bp67691 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content54% 
IMG OID640085568 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001016090 
Protein GI124021783 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.320437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAT TCGACGCCCA TCTAAAGTGC TCTTTCTGCG GCAAGTCTCA GGACCAAGTA 
CGCAAATTGA TTGCCGGGCC AGGTGTCTAC ATCTGCGACG AATGCATCGA TCTCTGCACC
GAGATCCTGG ATGAAGAGCT TGTCGATAGC CAAGGCAACC CACGTCACAG CAGCGAGTCC
AATCGCAAGT CTGCAACTGC TTCCCATAAA AGTGGAAAGC CAGCACCCAC ACTCGCCACC
ATACCCAAAC CTCAGGAGAT CAAAAACTTC CTAGATAAGC AAGTAGTGGG GCAAGAAGCT
GCCAAAAAAG TACTTTCAGT CGCCGTCTAT AACCACTACA AACGCCTGGC TTGGCAAGGT
GACGGCCAAG GTGAAACCGA TCTCTCTGCA ACCAGGCTCC ACAAGTCAAA CATCCTGCTC
ATCGGTCCAA CAGGCTGTGG CAAAACCCTA TTAGCCCAGA CCCTGGCCGA ACTCCTGGAC
GTGCCCTTCG CCGTTGCTGA TGCAACCACA CTGACCGAAG CCGGCTACGT GGGCGAGGAT
GTCGAAAACA TTTTGCTGCG GCTATTGCAA AAAGCCGACA TGGACGTGGA GCATGCCCAG
CGCGGCATCA TTTACGTCGA TGAGATCGAT AAGATCGCCC GAAAAAGTGA AAACCCCTCC
ATCACCCGCG ACGTATCCGG TGAAGGAGTA CAGCAAGCCC TGCTAAAGAT GCTCGAAGGC
ACAGTCGCCA ACGTGCCCCC CCAAGGAGGG CGTAAACACC CCTATCAAGA CTGCATCCAA
ATCGACACCA GCCAGATCCT CTTCATCTGC GGTGGTGCCT TCATTGGCCT CGAAGATGTG
GTGCAACGGC GACTAGGCCG TAATGCCATC GGTTTTATGC CCAGCGATGG TCGAGGGCGC
AGCCGTGCCA ATCGTGATCT AAAGGCCTCA CAGGTGCTTC ACCACCTTGA AGCCGACGAC
CTCGTGCGTT ATGGCTTGAT CCCTGAATTC ATTGGCCGCA TTCCCGTGAG TGCAGTGCTT
GAACCACTTG ATTCACAAGC ACTCGAATCG ATTCTCACGG AGCCTCGTGA CGCACTGGTC
AAGCAATTCA GCACCCTGCT GAGCATGGAC AACGTGCAGC TTGAGTTTGA ATCAGATGCC
GTAGAAGCTA TCGCCCAGGA AGCACATCGG CGCAAGACCG GGGCCCGAGC ACTACGAGGC
ATCATCGAAG AGTTAATGCT GGATTTGATG TACGACCTTC CCTCCAAGAA GAACGTGAAG
AAATTCACCG TGACCCGCAC GATGGTTGAT GAACATACCG GCGGCAAGGT TCTGCCTCTA
CCAGCCAACG ACGAGCGATC GCATAAGGAA TCAGCCTGA
 
Protein sequence
MAKFDAHLKC SFCGKSQDQV RKLIAGPGVY ICDECIDLCT EILDEELVDS QGNPRHSSES 
NRKSATASHK SGKPAPTLAT IPKPQEIKNF LDKQVVGQEA AKKVLSVAVY NHYKRLAWQG
DGQGETDLSA TRLHKSNILL IGPTGCGKTL LAQTLAELLD VPFAVADATT LTEAGYVGED
VENILLRLLQ KADMDVEHAQ RGIIYVDEID KIARKSENPS ITRDVSGEGV QQALLKMLEG
TVANVPPQGG RKHPYQDCIQ IDTSQILFIC GGAFIGLEDV VQRRLGRNAI GFMPSDGRGR
SRANRDLKAS QVLHHLEADD LVRYGLIPEF IGRIPVSAVL EPLDSQALES ILTEPRDALV
KQFSTLLSMD NVQLEFESDA VEAIAQEAHR RKTGARALRG IIEELMLDLM YDLPSKKNVK
KFTVTRTMVD EHTGGKVLPL PANDERSHKE SA