Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0019 |
Symbol | groEL |
ID | 4269550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 22813 |
End bp | 24474 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638124746 |
Product | chaperonin GroEL |
Protein accession | YP_740868 |
Protein GI | 114319185 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.557028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.807584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCTA AAGAGATCCG TTTCTCCGAC GACGCCCGCC AGCGCATGAT GAAGGGCGTG AACACCCTGG CCAACGCGGT CAAGGCCACC CTGGGCCCGC GCGGCCGCAA CGCCGTGCTG GACAAGTCCT TCGGCGCCCC CACCGTCACC AAGGACGGCG TCTCCGTGGC CAAGGAGATC GAGCTGGAGG ACAAGTTCGA GAACATGGGC GCGCAGATGC TCAAAGAGGT CTCCAGCCAG ACCTCCGACA TCGCCGGTGA CGGCACCACC ACGGCCACCG TGCTGGCCCA GGCCATCCTG CGCGAAGGCA TGAAGGCCGT GGCCGCCGGC ATGAACCCCA TGGACCTCAA GCGCGGCATC GACAAGGGCG TCTCCGCGGC CACCAAGTAC CTGGCCGACG AACTCTCCAA GCCCTGCGAG ACCGACACGT CCATCGCCCA GGTGGGCTCC ATCTCCGCCA ACTCCGACGA GTCCGTGGGT CGCATCATCG CCGACGCCAT GCAGAAGGTG GGCAAGGAAG GCGTCATCAC CGTGGAAGAG GGCTCCGGCC TGGAGAACGA GCTGGACGTG GTTGAGGGCA TGCAGTTCGA CCGCGGCTAC CTCAGCCCCT ACTTCATCAA CAACCAGCAG TCCATGAAGG CCGAGCTGGA AGACGCCTTC ATCCTGCTGC ACGACAAGAA GATCTCCAAC ATCCGCGACC TGCTGCCGCT GCTGGAGAAT GTCGCCAAGG CCAACAAGCC GCTGCTGATC ATCTCCGAGG ACATCGAGGG CGAGGCCCTG GCCACCCTGG TGGTCAACAG CATCCGTGGC ATCGTCAAGG TGGCTGCGGT CAAGGCCCCC GGCTTCGGTG ACCGCCGCAA GGCCATGCTG CAGGACATCG CCGTGCTCAC CGGCGGCACC GTGATCTCCG AGGAAGTGGG TCTGTCCCTG GAGAAGGCCA CCCTGGACGA CCTGGGCCAG GCCAAGAAGG TGGACGTCTC CAAGGAAGAG ACCACCATCG TCGGCGGCGC CGGCCGTCAC GACGACATCA TGGCCCGCGT CGAGCAGATC CGTGCCCAGA TCGAGGAGAG CACCTCCGAG TACGACAAGG AGAAGCTGCA GGAGCGCGTG GCCAAGCTGG CTGGCGGCGT GGCCGTCATC AAGGTGGGCG CCACCTCCGA GATCGAGATG AAGGAGAAGA AGGCCCGCGT GGAGGACGCC CTGCACGCCA CCCGCGCCGC GGTGGAAGAG GGCATCGTCC CCGGTGGTGG CACCGCGCTG CTGCGCGCCC AGGCCTCCCT CGACGGCTTG GAGTATGCCA ACCACGACCA GGAGGTGGGC ATCAACATCG TCCGCCGCGC CATGGAAGAG CCCCTGCGCC AGATCGTCTA CAACGCCGGT GGTGATGGCG CCGTGGTGGT CAACGAAGTG CGCAACGGCG AGGGCAACTA CGGCTACAAC GCCCAGAGCG GTGAGTACGG TGACTTGGTC GAGATGGGCA TTCTCGATCC CACCAAGGTG ACCCGCACCG CGCTGCAGAA CGCCGCCTCC GTGGCCGCGC TGATGATCAC CACCGAGGTG ATGGTCGCCG ACCTGCCCAA GGACGACGAC GCCGGTGCCG GCGGCGGCAT GGGCGACATG GGTGGTATGG GCGGCATGGG GGGCATGGGC GGCATGATGT AA
|
Protein sequence | MAAKEIRFSD DARQRMMKGV NTLANAVKAT LGPRGRNAVL DKSFGAPTVT KDGVSVAKEI ELEDKFENMG AQMLKEVSSQ TSDIAGDGTT TATVLAQAIL REGMKAVAAG MNPMDLKRGI DKGVSAATKY LADELSKPCE TDTSIAQVGS ISANSDESVG RIIADAMQKV GKEGVITVEE GSGLENELDV VEGMQFDRGY LSPYFINNQQ SMKAELEDAF ILLHDKKISN IRDLLPLLEN VAKANKPLLI ISEDIEGEAL ATLVVNSIRG IVKVAAVKAP GFGDRRKAML QDIAVLTGGT VISEEVGLSL EKATLDDLGQ AKKVDVSKEE TTIVGGAGRH DDIMARVEQI RAQIEESTSE YDKEKLQERV AKLAGGVAVI KVGATSEIEM KEKKARVEDA LHATRAAVEE GIVPGGGTAL LRAQASLDGL EYANHDQEVG INIVRRAMEE PLRQIVYNAG GDGAVVVNEV RNGEGNYGYN AQSGEYGDLV EMGILDPTKV TRTALQNAAS VAALMITTEV MVADLPKDDD AGAGGGMGDM GGMGGMGGMG GMM
|
| |