Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1836 |
Symbol | groEL |
ID | 3908995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2102255 |
End bp | 2103907 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883730 |
Product | chaperonin GroEL |
Protein accession | YP_485455 |
Protein GI | 86748959 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.253263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0261483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCCA AGGACGTCAA ATTCGGTGGA GACGCGCGCG ATCGGATGCT GCGCGGCGTC GACATCCTCG CCAATGCGGT CAAGGTCACG CTCGGCCCGA AGGGCCGGAA CGTGCTGATC GAGAAGAGCT TCGGCGCTCC CCGCATCACC AAGGACGGCG TCACCGTCGC CAAGGAGATC GAGCTCGACG ACAAGTTCGA GAACATGGGC GCGCAGATGC TGCGCGAAGT CGCCTCCAAG ACCAACGACC TCGCCGGTGA CGGCACCACC ACAGCGACCG TGCTGGCCCA GGCGATCGTC CGCGAAGGCG CCAAGTCGGT GGCCGCCGGC ATGAACCCGA TGGATCTGCG CCGCGGCATC GAGATCGCGG TCCAGGCCGT GGTCAAGGAC ATCCAGAAGC GCGCCCGTCC GGTCGCCTCC TCGGCCGAGA TCGCCCAGGT CGGTACCATC TCGGCCAATG GCGACGCGCC GATCGGCAAG ATGATCGCCC AGGCGATGCA GAAGGTCGGC AACGAGGGCG TCATCACCGT CGAAGAGAAC AAGTCGCTCG AGACCGAAGT CGACATCGTC GAGGGCATGA AGTTCGATCG CGGCTACCTG TCGCCCTATT TCGTCACCAA CGCCGAGAAG ATGACCGTCG AGCTCGACGA CGTCTACATC CTGCTGCACG AGAAGAAGGT GTCGGGCCTG CAGTCGATGC TGCCGGTGCT CGAAGCCGTG GTGCAGTCTG GCAAGCCGCT GCTGATCATC GCCGAGGATG TCGAAGGCGA AGCGCTGGCG ACGCTGGTGG TCAACCGGCT GCGCGGCGGC CTCAAGGTCT CGGCCGTCAA GGCGCCGGGC TTCGGCGATC GCCGCAAGGC GATGCTGGAA GACATCGCGA TCCTGACCGG CGGTCAGCTG ATCTCGGAAG AAATCGGCAT CAAGCTCGAG AGCGTCACGC TGAAGATGCT CGGCCGCGCC AAGAAGGTGG TGATCGACAA GGAGAACACC ACCATCGTCG GCGGCGCCGG CAAGAAGCCG GACATCGAGG CCCGCGTCCA GCAGATCAAG GCGCAGATCG AGGAGACCTC CTCGGACTAC GACCGTGAGA AGCTGCAGGA GCGTCTGGCC AAGCTCGCCG GCGGCGTCGC GGTGATCCGC GTCGGCGGCG CCACCGAGGT CGAGGTCAAG GAGAAGAAGG ACCGTGTCGA GGACGCGCTG AACGCGACCC GCGCCGCGGT GCAGGAAGGC ATCGTCCCGG GCGGCGGCGT CGCGCTGCTG CGCGCCAAGA AGGCGGTCGG CCGCATCCAC AACGACAATG CCGACGTCCA GGCCGGCATC AACATCGTGC TGAAGGCGCT GGAAGCTCCG ATCCGCCAGA TCGCCGAGAA CGCCGGCGTC GAAGGCTCGA TCGTGGTCGG CAAGATCCTC GAGAACAAGT CGGAGACGTT CGGCTTCGAC GCCCAGACCG AGGACTATGT CGACATGCTC GCCAAGGGCA TCGTCGATCC GGCCAAGGTG GTCCGCACCG CGCTGCAGGA CGCCTCGTCG GTCGCGGCGC TGCTGGTGAC CACCGAAGCC ATGGTCGCCG AACTGCCGAA GGAAGCCGCG CCGGCGATGC CGGGTGGCGG CGGCATGGGC GGAATGGGGG GCATGGGCGG CATGGGCTTC TGA
|
Protein sequence | MSAKDVKFGG DARDRMLRGV DILANAVKVT LGPKGRNVLI EKSFGAPRIT KDGVTVAKEI ELDDKFENMG AQMLREVASK TNDLAGDGTT TATVLAQAIV REGAKSVAAG MNPMDLRRGI EIAVQAVVKD IQKRARPVAS SAEIAQVGTI SANGDAPIGK MIAQAMQKVG NEGVITVEEN KSLETEVDIV EGMKFDRGYL SPYFVTNAEK MTVELDDVYI LLHEKKVSGL QSMLPVLEAV VQSGKPLLII AEDVEGEALA TLVVNRLRGG LKVSAVKAPG FGDRRKAMLE DIAILTGGQL ISEEIGIKLE SVTLKMLGRA KKVVIDKENT TIVGGAGKKP DIEARVQQIK AQIEETSSDY DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRVEDAL NATRAAVQEG IVPGGGVALL RAKKAVGRIH NDNADVQAGI NIVLKALEAP IRQIAENAGV EGSIVVGKIL ENKSETFGFD AQTEDYVDML AKGIVDPAKV VRTALQDASS VAALLVTTEA MVAELPKEAA PAMPGGGGMG GMGGMGGMGF
|
| |