Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4612 |
Symbol | groEL |
ID | 6146522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4715992 |
End bp | 4717638 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619428 |
Product | chaperonin GroEL |
Protein accession | YP_001746539 |
Protein GI | 170684024 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.562715 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCTA AAGACGTAAA ATTCGGTAAC GACGCTCGTG TGAAAATGCT GCGCGGCGTA AACGTACTGG CAGATGCAGT GAAAGTTACC CTCGGTCCGA AAGGCCGTAA CGTAGTTCTG GATAAATCTT TCGGTGCACC GACCATCACC AAAGATGGTG TTTCCGTTGC TCGTGAAATC GAACTGGAAG ACAAGTTCGA AAATATGGGT GCGCAGATGG TGAAAGAAGT TGCCTCTAAA GCGAACGACG CTGCAGGCGA CGGTACCACC ACTGCAACCG TACTGGCTCA GGCTATCATC ACTGAAGGTC TGAAAGCTGT TGCTGCGGGC ATGAACCCGA TGGACCTGAA ACGTGGTATC GACAAAGCTG TTACCGCTGC AGTTGAAGAA CTGAAAGCGC TGTCCGTACC GTGCTCTGAC TCTAAAGCGA TTGCTCAGGT TGGTACCATC TCCGCTAACT CCGACGAAAC CGTAGGTAAA CTGATCGCTG AAGCGATGGA CAAAGTCGGT AAAGAAGGTG TTATCACCGT TGAAGACGGT ACCGGTCTGC AGGACGAACT GGACGTGGTT GAAGGTATGC AGTTCGACCG TGGCTACCTG TCTCCTTACT TCATCAACAA GCCGGAAACT GGCGCAGTAG AACTGGAAAG CCCGTTCATC CTGCTGGCTG ACAAGAAAAT CTCCAACATC CGCGAAATGC TGCCGGTTCT GGAAGCTGTT GCCAAAGCAG GCAAACCGCT GCTGATCATC GCTGAAGATG TTGAAGGCGA AGCGCTGGCA ACTCTGGTTG TTAACACCAT GCGTGGCATC GTGAAAGTTG CTGCGGTTAA AGCTCCGGGC TTCGGCGATC GTCGTAAAGC TATGCTGCAG GATATCGCAA CCCTGACTGG CGGTACCGTA ATCTCTGAAG AGATCGGTAT GGAGCTGGAA AAAGCAACCC TGGAAGACCT GGGTCAGGCT AAACGTGTTG TGATCAACAA AGACACTACC ACCATCATCG ATGGCGTGGG TGAAGAAGCT GCAATCCAGG GCCGTGTTGC TCAGATCCGT CAGCAGATTG AAGAAGCAAC TTCTGACTAC GACCGTGAAA AACTGCAGGA GCGCGTAGCG AAACTGGCAG GCGGCGTTGC AGTTATCAAA GTAGGTGCTG CTACCGAAGT TGAAATGAAA GAGAAAAAAG CACGCGTTGA AGACGCCCTG CACGCGACCC GTGCTGCGGT AGAAGAAGGC GTGGTTGCTG GTGGTGGTGT TGCGCTGATC CGCGTAGCGT CTAAACTGGC TGACCTGCGT GGTCAGAACG AAGACCAGAA CGTGGGTATC AAAGTTGCAC TGCGTGCAAT GGAAGCTCCG CTGCGTCAGA TCGTCCTGAA CTGCGGCGAA GAACCGTCTG TTGTTGCTAA CACCGTTAAA GGCGGCGACG GCAACTACGG TTACAACGCA GCAACCGAAG AATACGGCAA CATGATCGAC ATGGGTATCC TGGATCCAAC TAAAGTAACC CGTTCTGCTC TGCAGTACGC GGCTTCTGTG GCTGGCCTGA TGATCACCAC CGAGTGCATG GTTACCGACC TGCCGAAAAA CGATGCAGCT GACTTAGGCG CTGCTGGCGG TATGGGCGGC ATGGGTGGCA TGGGCGGCAT GATGTAA
|
Protein sequence | MAAKDVKFGN DARVKMLRGV NVLADAVKVT LGPKGRNVVL DKSFGAPTIT KDGVSVAREI ELEDKFENMG AQMVKEVASK ANDAAGDGTT TATVLAQAII TEGLKAVAAG MNPMDLKRGI DKAVTAAVEE LKALSVPCSD SKAIAQVGTI SANSDETVGK LIAEAMDKVG KEGVITVEDG TGLQDELDVV EGMQFDRGYL SPYFINKPET GAVELESPFI LLADKKISNI REMLPVLEAV AKAGKPLLII AEDVEGEALA TLVVNTMRGI VKVAAVKAPG FGDRRKAMLQ DIATLTGGTV ISEEIGMELE KATLEDLGQA KRVVINKDTT TIIDGVGEEA AIQGRVAQIR QQIEEATSDY DREKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG VVAGGGVALI RVASKLADLR GQNEDQNVGI KVALRAMEAP LRQIVLNCGE EPSVVANTVK GGDGNYGYNA ATEEYGNMID MGILDPTKVT RSALQYAASV AGLMITTECM VTDLPKNDAA DLGAAGGMGG MGGMGGMM
|
| |