Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_pSN254_0147 |
Symbol | groEL |
ID | 4929507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_009140 |
Strand | + |
Start bp | 128544 |
End bp | 130181 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642572445 |
Product | chaperonin GroEL |
Protein accession | YP_001102020 |
Protein GI | 134047129 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCA AGGATATTCG TTTCGGTGAA GACGCCCGCG CCCGCATGGT CCGCGGCGTG AACGTGCTCG CCAACGCCGT CAAGGCGACC CTGGGCCCGA AGGGCCGCAA CGTCGTGCTC GAGAAGAGCT TCGGCGCCCC GACGATCACC AAGGACGGCG TGTCCGTCGC CAAGGAGATC GAACTGGCCG ACAAGTTCGA GAACATGGGC GCGCAGATGG TCAAGGAAGT CGCTTCCAAG ACCTCCGACA ACGCCGGCGA CGGCACCACC ACCGCCACCG TGCTGGCCCA GGCCCTGATC CGCGAGGGCA TGAAGGCCGT GGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC GACAAGGCCG TCACCTCGGC CGTCGAGGAG CTGAAGAAGA TCTCCAAGCC CTGCTCGACC AGCAAGGAGA TCGCCCAGGT CGGTTCGATC TCGGCCAACT CCGACACCGA CATCGGCGAG CTGATCGCCA AGGCCATGGA CAAGGTCGGC AAGGAAGGCG TGATCACCGT CGAGGAGGGC TCGGGCCTGG AGAACGAGCT CGACGTCGTC GAGGGCATGC AGTTCGACCG CGGCTACCTG TCGCCGTACT TCATCAACAA CCCGCAGTCG ATGCAGGCCG AGCTGGAGGA TCCGTTCATC CTGCTGCACG ACAAGAAGAT CTCGAACGTC CGCGACCTGC TGCCGATCCT CGAGGGCGTG GCCAAGGCCG GCAAGCCGCT GCTGATCGTC GCCGAGGACG TCGAGGGCGA GGCGCTGGCC ACGCTGGTGG TCAACACCAT CCGCGGCATC GTGAAGGTCT GCGCGGTCAA GGCCCCGGGC TTCGGCGACC GCCGCAAGGC GATGCTGGAG GACATGGCCA TCCTCACCGG TGGCACCGTG ATCTCCGAGG AAGTCGGCCT CTCGCTCGAG AAGGCGACCA TCAACGACCT CGGCCGCGCG AAGAAGGTGC AGGTCTCGAA GGAGAACACC ACCATCATCG ACGGCGCCGG CGACACCGCG GACATCGAAG CCCGCATCAA GCAGATCAAG GCGCAGATCG AGGAGACCAC CTCGGACTAC GACCGCGAGA AGCTGCAGGA GCGCGTGGCC AAGCTGGCCG GGGGCGTTGC GGTGATCAAG GTCGGCGCCG CCACCGAAGT CGAGATGAAG GAAAAGAAGG CGCGCGTCGA AGACGCCCTG CATGCCACCC GTGCGGCGGT CGAGGAAGGC ATCGTCCCGG GCGGCGGCGT CGCCCTGATC CGTGCCAAGG CCGCGATCGC CGAGCTGAAG GGCGCCAACG AGGACCAGAA CCACGGCATC GCGATCGCCC TGCGCGCGAT GGAAGCCCCG CTGCGCGAGA TCGTCACCAA CGCCGGCGAC GAGCCGAGCG TGGTGCTGAA CCGCGTCGCC GAAGGCACCG GCGCGTTCGG CTACAACGCC GCCAACGGCG AGTTCGGCGA CATGATCGAG TTCGGCATCC TGGACCCGAC CAAGGTCACG CGCTCCGCGC TGCAGAACGC GGCGTCCATC GCCGGCCTGA TGATCACCAC CGAAGCGATG GTGGCCGAAG CGCCGAAGAA GGAAGAGCCG GCCGCTCCGG GCGGCGGCAT GGGCGGCATG GGCGGCATGG ATTTCTAA
|
Protein sequence | MAAKDIRFGE DARARMVRGV NVLANAVKAT LGPKGRNVVL EKSFGAPTIT KDGVSVAKEI ELADKFENMG AQMVKEVASK TSDNAGDGTT TATVLAQALI REGMKAVAAG MNPMDLKRGI DKAVTSAVEE LKKISKPCST SKEIAQVGSI SANSDTDIGE LIAKAMDKVG KEGVITVEEG SGLENELDVV EGMQFDRGYL SPYFINNPQS MQAELEDPFI LLHDKKISNV RDLLPILEGV AKAGKPLLIV AEDVEGEALA TLVVNTIRGI VKVCAVKAPG FGDRRKAMLE DMAILTGGTV ISEEVGLSLE KATINDLGRA KKVQVSKENT TIIDGAGDTA DIEARIKQIK AQIEETTSDY DREKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG IVPGGGVALI RAKAAIAELK GANEDQNHGI AIALRAMEAP LREIVTNAGD EPSVVLNRVA EGTGAFGYNA ANGEFGDMIE FGILDPTKVT RSALQNAASI AGLMITTEAM VAEAPKKEEP AAPGGGMGGM GGMDF
|
| |