Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3234 |
Symbol | groEL |
ID | 3911035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3697445 |
End bp | 3699088 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885136 |
Product | chaperonin GroEL |
Protein accession | YP_486841 |
Protein GI | 86750345 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.554891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCTA AAGAAGTCAA ATTCGGCGTC GACGCCCGCG ACCGCATGCT GCGCGGCGTG GACATTCTCG CCAATGCCGT GAAGGTCACG CTCGGCCCGA AGGGCCGCAA CGTCGTGCTC GACAAGTCGT TCGGCGCGCC CCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC GAGCTCGAGG ACAAGTTCGA GAACATGGGC GCCCAGATGG TGCGCGAAGT GGCCTCGAAG TCCGCCGATC TCGCCGGCGA CGGCACCACT ACCGCGACCG TGCTGGCCGC GGCGATCGTA CGTGAAGGCG CCAAGTCGGT GGCCGCCGGC ATGAACCCGA TGGATCTGAA GCGCGGCATC GACCTGGCTG TGGAAGCCGT GGTCGCCGAC CTCGTCAAGA ACTCCAAGAA GGTCACCTCG AACGACGAGA TCGCCCAGGT CGGCACCATC TCGGCCAATG GCGACGCGGA AATCGGCAAG TTCCTCGCCG ACGCGATGAA GAAGGTCGGC AACGAGGGTG TCATCACCGT CGAGGAAGCC AAGTCGCTCG AGACCGAACT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACATC TCGCCCTACT TCGTCACCAA CGCCGACAAG ATGCGCGTCG AATTCGACGA CGCCTACATC CTGATCAACG AGAAGAAGCT CTCCAACCTC AACGAACTGC TGCCGCTGCT CGAAGCCGTG GTGCAGACCG GCAAGCCGCT GGTGATCGTC GCTGAGGACG TCGAAGGCGA AGCGCTCGCC ACCCTCGTCG TCAACCGCCT GCGTGGCGGC CTGAAGGTCG CCGCCGTCAA GGCGCCGGGC TTCGGCGATC GCCGCAAGGC GATGCTGCAG GACATCGCGA TCCTGACCGG CGGCCAGGCG ATCTCGGAAG ACCTCGGCAT CAAGATGGAG AACGTCACGC TCCAGATGCT CGGTCGCGCC AAGAAGGTGA TGATCGACAA GGAGAACACC ACGATCGTCA ACGGCGCCGG TAAGAAGGTC GACATCGAAG CCCGCGTCGC CCAGATCAAG GCGCAGATCG AGGAGACCAC CTCGGACTAC GATCGCGAGA AGCTGCAGGA GCGCCTGGCC AAGCTCGCCG GCGGCGTCGC GGTGATCCGC GTCGGCGGCG CCACCGAGAT CGAGGTCAAG GAGCGCAAGG ATCGCGTTGA CGACGCGATG CACGCCACCC GCGCTGCGGT CGAGGAAGGC ATCGTCCCGG GCGGCGGCGT CGCTCTGCTG CGCGCCTCCG AGCAGCTCAA GCGCATCAAG ACCGCGAACG ACGACCAGAA GACCGGCGTC GAGATCGTGC GCAAGGCGCT CTCCGCCCCG GCCCGCCAGA TCGCCATCAA CGCCGGCGAA GACGGTTCGG TGATCGTCGG CAAGGTGCTG GAGAAGGATC AGTACAACTA CGGCTTCGAC AGCCAGACCG GCGAATACGG CGACATGGTC AAGAAGGGCA TCATCGACCC GACCAAGGTC GTGCGTGCGG CGATCCAGAA CGCGGCCTCG GTCGCGGCGC TCTTGATCAC CACCGAAGCC ATGATCGCGG AGCTGCCCAA GAAGGGCGGC GCCGGCGCCG GCGGCATGCC CCCGGGCGGC GGCATGGGCG GCATGGATTT CTGA
|
Protein sequence | MSAKEVKFGV DARDRMLRGV DILANAVKVT LGPKGRNVVL DKSFGAPRIT KDGVTVAKEI ELEDKFENMG AQMVREVASK SADLAGDGTT TATVLAAAIV REGAKSVAAG MNPMDLKRGI DLAVEAVVAD LVKNSKKVTS NDEIAQVGTI SANGDAEIGK FLADAMKKVG NEGVITVEEA KSLETELDVV EGMQFDRGYI SPYFVTNADK MRVEFDDAYI LINEKKLSNL NELLPLLEAV VQTGKPLVIV AEDVEGEALA TLVVNRLRGG LKVAAVKAPG FGDRRKAMLQ DIAILTGGQA ISEDLGIKME NVTLQMLGRA KKVMIDKENT TIVNGAGKKV DIEARVAQIK AQIEETTSDY DREKLQERLA KLAGGVAVIR VGGATEIEVK ERKDRVDDAM HATRAAVEEG IVPGGGVALL RASEQLKRIK TANDDQKTGV EIVRKALSAP ARQIAINAGE DGSVIVGKVL EKDQYNYGFD SQTGEYGDMV KKGIIDPTKV VRAAIQNAAS VAALLITTEA MIAELPKKGG AGAGGMPPGG GMGGMDF
|
| |