Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00681 |
Symbol | clpX |
ID | 4777999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 66333 |
End bp | 67691 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640085568 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001016090 |
Protein GI | 124021783 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.320437 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAT TCGACGCCCA TCTAAAGTGC TCTTTCTGCG GCAAGTCTCA GGACCAAGTA CGCAAATTGA TTGCCGGGCC AGGTGTCTAC ATCTGCGACG AATGCATCGA TCTCTGCACC GAGATCCTGG ATGAAGAGCT TGTCGATAGC CAAGGCAACC CACGTCACAG CAGCGAGTCC AATCGCAAGT CTGCAACTGC TTCCCATAAA AGTGGAAAGC CAGCACCCAC ACTCGCCACC ATACCCAAAC CTCAGGAGAT CAAAAACTTC CTAGATAAGC AAGTAGTGGG GCAAGAAGCT GCCAAAAAAG TACTTTCAGT CGCCGTCTAT AACCACTACA AACGCCTGGC TTGGCAAGGT GACGGCCAAG GTGAAACCGA TCTCTCTGCA ACCAGGCTCC ACAAGTCAAA CATCCTGCTC ATCGGTCCAA CAGGCTGTGG CAAAACCCTA TTAGCCCAGA CCCTGGCCGA ACTCCTGGAC GTGCCCTTCG CCGTTGCTGA TGCAACCACA CTGACCGAAG CCGGCTACGT GGGCGAGGAT GTCGAAAACA TTTTGCTGCG GCTATTGCAA AAAGCCGACA TGGACGTGGA GCATGCCCAG CGCGGCATCA TTTACGTCGA TGAGATCGAT AAGATCGCCC GAAAAAGTGA AAACCCCTCC ATCACCCGCG ACGTATCCGG TGAAGGAGTA CAGCAAGCCC TGCTAAAGAT GCTCGAAGGC ACAGTCGCCA ACGTGCCCCC CCAAGGAGGG CGTAAACACC CCTATCAAGA CTGCATCCAA ATCGACACCA GCCAGATCCT CTTCATCTGC GGTGGTGCCT TCATTGGCCT CGAAGATGTG GTGCAACGGC GACTAGGCCG TAATGCCATC GGTTTTATGC CCAGCGATGG TCGAGGGCGC AGCCGTGCCA ATCGTGATCT AAAGGCCTCA CAGGTGCTTC ACCACCTTGA AGCCGACGAC CTCGTGCGTT ATGGCTTGAT CCCTGAATTC ATTGGCCGCA TTCCCGTGAG TGCAGTGCTT GAACCACTTG ATTCACAAGC ACTCGAATCG ATTCTCACGG AGCCTCGTGA CGCACTGGTC AAGCAATTCA GCACCCTGCT GAGCATGGAC AACGTGCAGC TTGAGTTTGA ATCAGATGCC GTAGAAGCTA TCGCCCAGGA AGCACATCGG CGCAAGACCG GGGCCCGAGC ACTACGAGGC ATCATCGAAG AGTTAATGCT GGATTTGATG TACGACCTTC CCTCCAAGAA GAACGTGAAG AAATTCACCG TGACCCGCAC GATGGTTGAT GAACATACCG GCGGCAAGGT TCTGCCTCTA CCAGCCAACG ACGAGCGATC GCATAAGGAA TCAGCCTGA
|
Protein sequence | MAKFDAHLKC SFCGKSQDQV RKLIAGPGVY ICDECIDLCT EILDEELVDS QGNPRHSSES NRKSATASHK SGKPAPTLAT IPKPQEIKNF LDKQVVGQEA AKKVLSVAVY NHYKRLAWQG DGQGETDLSA TRLHKSNILL IGPTGCGKTL LAQTLAELLD VPFAVADATT LTEAGYVGED VENILLRLLQ KADMDVEHAQ RGIIYVDEID KIARKSENPS ITRDVSGEGV QQALLKMLEG TVANVPPQGG RKHPYQDCIQ IDTSQILFIC GGAFIGLEDV VQRRLGRNAI GFMPSDGRGR SRANRDLKAS QVLHHLEADD LVRYGLIPEF IGRIPVSAVL EPLDSQALES ILTEPRDALV KQFSTLLSMD NVQLEFESDA VEAIAQEAHR RKTGARALRG IIEELMLDLM YDLPSKKNVK KFTVTRTMVD EHTGGKVLPL PANDERSHKE SA
|
| |