Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3172 |
Symbol | groEL |
ID | 3972608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 3514657 |
End bp | 3516300 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637926282 |
Product | chaperonin GroEL |
Protein accession | YP_533033 |
Protein GI | 90424663 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.252237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCTA AAGAAGTCAA ATTCGGCGTC GACGCTCGCG ATCGCATGCT GCGCGGTGTC GACATTCTCG CCAACGCGGT CAAGGTGACG CTCGGCCCGA AGGGCCGCAA CGTCGTGCTC GACAAGTCGT TCGGCGCGCC CCGCATTACC AAGGACGGCG TCACCGTCGC CAAGGAAATC GAGCTCGAGG ACAAGTTCGA GAACATGGGC GCCCAGATGG TGCGCGAAGT GGCCTCGAAG TCCGCGGATC TCGCCGGCGA CGGCACCACC ACCGCGACCG TGCTGGCCGC GGCGATCGTC CGTGAAGGCG CCAAGTCGGT TGCCGCCGGC ATGAACCCGA TGGATCTGAA GCGCGGCATC GACCTCGCGG TGGAAGCCGT CGTCGCCGAT CTCGTCAAGA ACTCCAAGAA GGTCACCTCG AACGAGGAGA TCGCCCAGGT CGGCACCATC TCCGCCAATG GCGACGCCGA AATCGGCAAG TTCCTGTCGG ACGCGATGAA GAAGGTCGGC AACGAGGGCG TCATCACCGT CGAGGAAGCC AAGTCGCTGG AAACCGAACT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACATC TCGCCCTACT TCGTCACCAA CGCCGACAAG ATGCGCGTTG AATTCGACGA CGCCTACATC CTGATCAACG AGAAGAAGCT CTCCAACCTC AACGAGCTGC TGCCGCTGCT CGAGGCCGTG GTGCAGACCG GCAAGCCGCT GGTGATCGTC GCTGAAGACG TCGAGGGCGA AGCCCTCGCC ACCCTCGTCG TCAACCGCCT GCGCGGTGGT CTGAAGGTTG CCGCCGTCAA GGCGCCGGGC TTCGGCGATC GCCGCAAGGC GATGCTGCAG GACATCGCGA TCCTGACCGG CGGCCAGGCG ATCTCGGAAG ATCTCGGCAT CAAGATGGAG AACGTCACCC TGGCCATGCT CGGCAAGGCC AAGAAGGTGA TGATCGACAA GGAAAACACC ACCATCGTCA ACGGCGCCGG CAAAAAGGCC GACATCGAAG CCCGCGTCGC CCAGATCAAG GCGCAGATCG AAGAGACCAC GTCGGACTAC GACCGTGAGA AGCTGCAGGA GCGTCTCGCC AAGCTCGCCG GTGGCGTTGC GGTGATCCGC GTCGGTGGTG CCACCGAGAT CGAAGTCAAG GAGCGCAAGG ACCGCGTCGA CGACGCGATG CATGCGACCC GTGCGGCGGT CGAGGAAGGC ATTCTACCGG GCGGCGGCGT CGCTTTGCTG CGTGCTTCCG AGCAGCTGAA GCGCATCAAG ACCCAGAACG ACGACCAGAA GACCGGCGTC GAAATCGTCC GCAAGGCTTT GTCCTGGCCG GCTCGCCAGA TCGCCATCAA CGCCGGCGAA GACGGCTCGG TGATCGTCGG CAAGATCCTC GAGAAGGATC AGTATTCGTA CGGCTTCGAC TCGCAGTCCG GCGAATATGG CGACATGGTC AAGAAGGGCA TCATCGACCC CACCAAGGTG GTGCGTGCGG CGATCCAGAA CGCGGCCTCG GTCGCGGCGC TCTTGATCAC CACCGAAGCG ATGATCGCTG AGCTGCCGAA GAAGGGCAAC GCCGGCGGCG GTATGCCCCC CGGTGGCGGC GGCATGGGCG GCATGGATTT CTGA
|
Protein sequence | MSAKEVKFGV DARDRMLRGV DILANAVKVT LGPKGRNVVL DKSFGAPRIT KDGVTVAKEI ELEDKFENMG AQMVREVASK SADLAGDGTT TATVLAAAIV REGAKSVAAG MNPMDLKRGI DLAVEAVVAD LVKNSKKVTS NEEIAQVGTI SANGDAEIGK FLSDAMKKVG NEGVITVEEA KSLETELDVV EGMQFDRGYI SPYFVTNADK MRVEFDDAYI LINEKKLSNL NELLPLLEAV VQTGKPLVIV AEDVEGEALA TLVVNRLRGG LKVAAVKAPG FGDRRKAMLQ DIAILTGGQA ISEDLGIKME NVTLAMLGKA KKVMIDKENT TIVNGAGKKA DIEARVAQIK AQIEETTSDY DREKLQERLA KLAGGVAVIR VGGATEIEVK ERKDRVDDAM HATRAAVEEG ILPGGGVALL RASEQLKRIK TQNDDQKTGV EIVRKALSWP ARQIAINAGE DGSVIVGKIL EKDQYSYGFD SQSGEYGDMV KKGIIDPTKV VRAAIQNAAS VAALLITTEA MIAELPKKGN AGGGMPPGGG GMGGMDF
|
| |