Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02831 |
Symbol | clpB2 |
ID | 4779077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 260411 |
End bp | 263206 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083548 |
Product | putative ATP-dependent Clp protease, Hsp 100, ATP-binding subunit ClpB |
Protein accession | YP_001014112 |
Protein GI | 124024996 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACTT TGAATCAAAA GTCATCAAAA ATGAATGGAA GTCTCACTAC AGAACCAGAT TCGTTTAGCG ATGAAGCTTG GAGTCTTTTA TTAATAGCTG AACAATCAGC CAGAAGATGG AGACATAAGA ACTTAGATGT TGAGCATCTT ATTGAAGTGC TTTTTAGAAA TAAAAAATAT CAAAAATATA CAAATTCCTT ACCCATAAAC CATAAAGAAT TAAATGAAAT CTTAGAAAAC TTTATCGCTG AACTACCAAT AAACAACCAA CCAGATTTAT TTATTGGAGA AGACTTAGAA ATTCTTCTTG AGGTCGCTGA TGATTTTCGC TCTCGATGGG GATCTAATCA AATAGAAATA TCTCATATCC TCATCGCAAT TGGAAGAGAT AATCGCTTAG GAGAAGATCT TTTTTATCAA GCAGGTCTAC CGAGTGAAAT TCTCGAAGCG GAATTGAGAC GACTACCAGC ACCGAAATCA TTTAAACAAT CAAAAAGAAA TCAAAACAAA CCAATAACAA ATCGACCACA GAAAGATTCA CAGTCTTTTA TGCCTACCGA AACCACTGCA AAAGATCCAA AACCCGAGCC ACTTCCTCCT CTCTCTAAAG AAGAAATCAC ATCAAAGCAA GAACCTTTAA GCCTTAATGA GGCACCAAGT GCCTTAGATT TATACTGCAA AGATCTTACA ACTGAAGCTG AAAATGGAAC ATTAGACCCT GTGATTGGCA GAGAGTCTGA AATAAAAGCA ATTACAAAAG TTTTATCTAG AAGAGGTAAA AACAATCCAG TACTAATTGG TGCTCCTGGT GTTGGAAAAA CAGCAATTGC AGAATTATTA GCTCAAAAAA TTGTAGATAA CGAACTTCCT GAATCTCTTC AAGGTCTAAG GCTAATTTCA CTTGATATCG GTGCATTAAT TGCTGGAGCT AAATTCCGAG GACAATTTGA AGAACGCTTT AGATCATTAT TAAGTGAAAT CAACAATAGC GAAAAAGGGG TAATCCTATT CATAGATGAA TTACACACAA TTGTAAGCAA AGACAGATCA AATACTGATG CTGGTAGTCT ATTAAAACCA TTATTAGCAA GCGGAGACTT AAGATGCATT GGTGCAACTA CTCCAGACAA TTATAGACGT ACGATCGAAA AAGACCTCGC TCTAAATAGA CGATTTCAGC AAGTATCAAT CAAAGAACCA AGCTTAGATT TAAGCTTGGA AATCTTAAAA GGACTTAAAG AAAATTACGA GGTTCATCAT GGCGTAATTA TTACCGATGA AGCACTAATT ACAGCAAATC GTTTAGCCTA TAGATATATA AGTGATAGAT GCCTACCAGA TAAAGCTATT GACTTAATCG ACGAGGCTTC AGCTCAAGTA AGAATAGAAT CTGCATCAAA ACCAAAAATC ATAGAAGAGA AAGAATCTCA GGTTAATCAT TTAGAGTCAT CAATAATAAA TGCAGATAAA GATACAACTT TAGAGACTAT AAATAATCTT CAAGAAAAGA AAGAATTGCT ACTCTTTGAG TTGGCTGAGA TTAAACAAAA ATGGCAAGAT CAAATTGATA AATCAGCTGA ATTACAAGAA CTAAAAATAA GCTTGAAAGA ATTAAAAAAT TTAATAAGAG AAGTGGAAAT CTCTGGTGAT ATGGAAGAAG TAGAAAAACT TAAATACGAC CAACTCTACC AATTACAAGA AAGAATAGAA GAAATAGAAG TTTCTATTCG AGAAGATAAT GAGTATGGTA ATTCCTTACT AAAAGATAAA GTCAATCCAG AAGACATCGC TGATGTTGTC TCAAGATGGA CAGGAATTCC TGTTAGAAAG GTTGTATCGG GTGAAAGACA GAAACTTTTA AAGTTAGAAC AAGACTTAGG GAAAAAAGTT ATTGGCCAAT TAAATGCTGT TCAAGCAGTC TCAGCAGCAA TTCGCAGAGC AAGAGCTGGG ATGCAGGATA TAAAAAGGCC CATTGGATCC TTTCTTTTTC TAGGCCCTAC AGGTGTTGGA AAAACTGAAC TTGCTAAATC ACTAGCAAGT TCTTTGTTTG ATGAAGAGGA CGCTTTGTTG AGACTTGATA TGAGTGAATA TATGGAGAGA AATGCTGTTT CAAGACTCTT AGGTGCACCG CCTGGATACG TGGGTTACGA AGAAGGTGGT CAATTAACAG AGGCAATTAG AAAAAGACCT TATGCAGTTT TGCTTCTTGA TGAGATTGAA AAAGCTCATC AAGAAGTTTT CAACATCCTA TTACAGGTAT TAGATGATGG AAGACTCACC GATTCTCAAG GTCGAACAGT AGATTTTAGA AATACAGTTA TTGTTATGAC AAGCAATCTT GCTAGCAAAG CAATCTTAAA TAATTCACTT CAACTTCAAA GCGAAAATTC AAATAAAAAT ATTCTTTTAC AAGAATTAGA TCAAAAAATC AACGAAGCTC TAACAAAACA TTTTCGACCT GAATTTTTGA ACCGCATTGA TGAAGTAATA AAATTCAACC CACTTAAACC TGACAGCTTA GAGCAAATAG TTCGACTTCA ACTTGATGAA TTAAAGAAGC TTCTAAAGCA CCAAGGTTTA GACCTTTATG TTGACGAAAA TACTATTAAA ATTCTTGCTG AAGAAGGCTA TGAGCCTGAA TACGGGGCTA GACCGCTCAG AAGAGTGATT AGAAGAAGAT TAGAAAACCC ACTGGCCACA CAAATTCTAG AAGAGGCTTT TCAAGGTGCA AAATCAATAA GGGTTGAGAC TAAAGAGGAT GATTCAGAAA AACTTCTTTT TTTAATAGAT AACTAA
|
Protein sequence | MTTLNQKSSK MNGSLTTEPD SFSDEAWSLL LIAEQSARRW RHKNLDVEHL IEVLFRNKKY QKYTNSLPIN HKELNEILEN FIAELPINNQ PDLFIGEDLE ILLEVADDFR SRWGSNQIEI SHILIAIGRD NRLGEDLFYQ AGLPSEILEA ELRRLPAPKS FKQSKRNQNK PITNRPQKDS QSFMPTETTA KDPKPEPLPP LSKEEITSKQ EPLSLNEAPS ALDLYCKDLT TEAENGTLDP VIGRESEIKA ITKVLSRRGK NNPVLIGAPG VGKTAIAELL AQKIVDNELP ESLQGLRLIS LDIGALIAGA KFRGQFEERF RSLLSEINNS EKGVILFIDE LHTIVSKDRS NTDAGSLLKP LLASGDLRCI GATTPDNYRR TIEKDLALNR RFQQVSIKEP SLDLSLEILK GLKENYEVHH GVIITDEALI TANRLAYRYI SDRCLPDKAI DLIDEASAQV RIESASKPKI IEEKESQVNH LESSIINADK DTTLETINNL QEKKELLLFE LAEIKQKWQD QIDKSAELQE LKISLKELKN LIREVEISGD MEEVEKLKYD QLYQLQERIE EIEVSIREDN EYGNSLLKDK VNPEDIADVV SRWTGIPVRK VVSGERQKLL KLEQDLGKKV IGQLNAVQAV SAAIRRARAG MQDIKRPIGS FLFLGPTGVG KTELAKSLAS SLFDEEDALL RLDMSEYMER NAVSRLLGAP PGYVGYEEGG QLTEAIRKRP YAVLLLDEIE KAHQEVFNIL LQVLDDGRLT DSQGRTVDFR NTVIVMTSNL ASKAILNNSL QLQSENSNKN ILLQELDQKI NEALTKHFRP EFLNRIDEVI KFNPLKPDSL EQIVRLQLDE LKKLLKHQGL DLYVDENTIK ILAEEGYEPE YGARPLRRVI RRRLENPLAT QILEEAFQGA KSIRVETKED DSEKLLFLID N
|
| |