Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_05041 |
Symbol | groEL |
ID | 4776858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 495716 |
End bp | 497350 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086008 |
Product | chaperonin GroEL |
Protein accession | YP_001016521 |
Protein GI | 124022214 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAGC GCATTATTTA CAACGAGCAA GCTCGACGTG CCCTTGAGCG CGGCATTGAC ATCCTGGCCG AATCCGTGGC CGTCACACTG GGGCCCAAAG GCCGCAATGT GGTGCTTGAG AAAAAGTTTG GTGCACCCCA AATCATCAAT GATGGCGTCA CCATCGCCAA AGAGATTGAG CTTGAAGATC ACATTGAGAA CACCGGTGTT GCTCTGATCC GTCAGGCAGC TTCGAAGACC AATGACGCTG CAGGTGATGG CACGACCACA GCCACTGTTT TGGCCCATGC CATGGTGAAG GCCGGCTTGC GCAATGTGGC TGCTGGTGCC AATGCAATCT CCTTGAAGAA GGGGATCGAC AAGGCTTCCG ACTTCCTCGT TTCCAAGATT GAGGAGCTGG CCAAGCCGAT CAGTGACAGC AACGCCATTG CTCAGTGTGG AACCATTGCT GCCGGCAACG ACGAGGAAGT GGGTCAGATG ATTGCTGACG CCATGGACAA AGTTGGCAAG GAAGGTGTGA TCTCTCTTGA AGAGGGCAAG TCGATGACCA CTGAACTGGA GGTCACCGAG GGGATGCGTT TCGATAAGGG CTACATCTCC CCTTACTTCG CCACTGACAC TGAGCGGATG GAGGCTGTGC TGGATGAGCC CTACATTCTC CTCACTGATA AGAAGATTGG CTTGGTACAG GATCTGGTGC CTGTGCTTGA ACAGATTGCC CGCACTGGCA AGCCCCTGCT GATCATTGCT GAGGACATTG AGAAGGAAGC TCTCGCAACC TTGGTGGTCA ATCGTCTGCG TGGTGTGCTG AATGTGGCTG CCGTGAAGGC GCCTGGGTTT GGTGATCGCC GTAAGGCCAT GCTCGAAGAC ATGGCAGTGC TGACCAATGG TCAGTTGATC ACTGAAGACG CCGGTCTTAA ATTGGACAAT GCCAAGCTGG AAATGCTTGG TACAGCGCGC CGTGTGACCA TTAATAAGGA CACCACAACC ATCGTTGCCG AAGGCAATGA GACGGCGGTG AAAGGGCGTT GCGAGCAGAT CAAAAAGCAG ATGGACGAGA CTGACTCGAC CTACGACAAG GAGAAGCTGC AGGAACGACT GGCCAAGTTG GCCGGTGGCG TTGCTGTTGT GAAGGTGGGT GCTGCTACCG AAACCGAGAT GAAGGACAAG AAGCTCCGTC TTGAAGACGC CATCAACGCC ACCAAGGCAG CAGTTGAGGA AGGCATCGTC CCAGGTGGTG GCACCACACT GACTCATCTC GCTGCTGATC TGCAGAAGTG GGCCAATAGC AACCTCAGCG GTGAGGAGCT GATCGGTGCC AACATCGTGG AGGCTTCCCT GGCTGCTCCC TTGATGCGCA TTGCTGAGAA TGCCGGCGCC AATGGTGCTG TCGTGGCCGA GAACGTTAAG AGCAGGCCCA TCAGTGACGG CTATAACGCT GCTACTGGGG ATTACATCGA CATGCTTGCT GCAGGCATTG TTGACCCTGC CAAAGTCACC CGTTCTGGTT TGCAGAATGC GGCCTCGATT GCTGGCATGG TGCTGACCAC TGAGTGCATC GTTGCTGATT TGCCAGAGAA GAAGGAGGCA GCTCCTGCAG GCGGTGGCGG CATGGGTGGT GACTTCGACT ACTGA
|
Protein sequence | MAKRIIYNEQ ARRALERGID ILAESVAVTL GPKGRNVVLE KKFGAPQIIN DGVTIAKEIE LEDHIENTGV ALIRQAASKT NDAAGDGTTT ATVLAHAMVK AGLRNVAAGA NAISLKKGID KASDFLVSKI EELAKPISDS NAIAQCGTIA AGNDEEVGQM IADAMDKVGK EGVISLEEGK SMTTELEVTE GMRFDKGYIS PYFATDTERM EAVLDEPYIL LTDKKIGLVQ DLVPVLEQIA RTGKPLLIIA EDIEKEALAT LVVNRLRGVL NVAAVKAPGF GDRRKAMLED MAVLTNGQLI TEDAGLKLDN AKLEMLGTAR RVTINKDTTT IVAEGNETAV KGRCEQIKKQ MDETDSTYDK EKLQERLAKL AGGVAVVKVG AATETEMKDK KLRLEDAINA TKAAVEEGIV PGGGTTLTHL AADLQKWANS NLSGEELIGA NIVEASLAAP LMRIAENAGA NGAVVAENVK SRPISDGYNA ATGDYIDMLA AGIVDPAKVT RSGLQNAASI AGMVLTTECI VADLPEKKEA APAGGGGMGG DFDY
|
| |