Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1420 |
Symbol | groEL |
ID | 7083502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1582097 |
End bp | 1583746 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643698437 |
Product | chaperonin GroEL |
Protein accession | YP_002355075 |
Protein GI | 217969841 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.038504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCTA AAGAAGTCAA GTTTGGTGAT TCCGCCCGCG AGCGCATGGT CGCCGGCATC AACATCCTCG CCAACGCGGT CAAGGTGACC CTGGGCCCGA AGGGCCGCAA CGTCGTGCTC GAGCGCTCGT TCGGCGCCCC GACCGTGACC AAGGACGGCG TCTCCGTCGC CAAGGAAATC GAGCTGAAGG ACAAGTTCGA GAACATGGGC GCGCAGATGG TCAAGGAAGT CGCTTCCAAG ACCTCGGACA TCGCCGGTGA CGGCACCACC ACCGCGACCG TGCTGGCGCA GTCGATCGTG CGCGAAGGCA TGAAGTTCGT CGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC GACAAGGCCG TCGTCGCCAC CATCGACGAG CTCAAGAAGC TGTCGAAGCC CTGCTCGACC AACAAGGAGA TCGCCCAGGT CGGCTCGATC TCGGCCAACT CCGACAGCGA CATCGGCGAC ATCATCGCCC GTGCGATGGA CAAGGTCGGC AAGGAAGGCG TGATTACCGT CGAAGACGGC AAGTCGCTGC AGAACGAACT CGACGTGGTC GAGGGCATGC AGTTCGACCG CGGTTACCTG TCGCCCTACT TCATCAACAA CCCGGACAAG CAGGTTGCCA TCCTCGAGCA GCCCTTCGTC CTGCTCTTCG ACAAGAAGAT CTCCAACATC CGCGACCTCC TGCCGGTGCT CGAGCAGGTC GCCAAGTCGG GCCGTCCGCT GCTGATCATC GCCGAGGACG TCGAAGGCGA AGCGCTCGCC ACCCTGGTGG TGAACAACAT CCGCGGCATC CTCAAGACCT GCGCCGTCAA GGCCCCGGGC TTCGGCGACC GTCGCAAGGC CATGCTGGAA GACATCGCCA TCCTGACCGG CGGCCAGGTC ATCGCCGAAG AAGTCGGCCT GACCCTCGAG AAGGCCACCC TGGACGACCT CGGCCAGGCT GCCCGCATCG AAGTCGGCAA GGAAAACACC ATCATCATCG ACGGCGCCGG CCAGGCCGAC CGCATCGAGG CGCGCGTCAA GCAGATCCGC GTGCAGATCG AGGAAGCCAG CTCCGACTAC GACCGCGAGA AGCTGCAGGA ACGCGTGGCC AAGCTGGCCG GCGGTGTTGC GGTGATCAAG GTCGGTGCCG CCACCGAAGT CGAGATGAAG GAGAAGAAGG CCCGCGTCGA GGACGCCCTG CACGCCACCC GCGCTGCGGT GGAAGAGGGC ATCGTCCCCG GCGGCGGCGT CGCGCTGCTG CGTGCCCGCG CCGCGCTGGG CGAGCTCAAG GGCGACAACC ACGACCAGGA TGCCGGCATC AAGATCGTGC TGCGCGCCAT GGAACAGCCC CTGCGCGAGA TCGTCGCCAA CGCCGGCGAC GAGCCGAGCG TGGTGGTGAA CAAGGTCGTC GAGGGCTCGG GCAACTACGG CTTCAACGCC GCCACCGGCG AGTACGGCGA CATGGTCGAG ATGGGCGTGC TGGATCCGAC CAAGGTCACC CGCACCGCGC TGCAGAACGC CGCGTCCGTC GCCGGCCTGA TGCTGACCAC CGACTGCATG GTCGGCGAGC TGGCCGAGGA CAAGCCGATG GGCGGCATGC CCGACATGGG CGGCATGGGT GGCATGGGCG GCATGGGCAT GGGCATGTAA
|
Protein sequence | MAAKEVKFGD SARERMVAGI NILANAVKVT LGPKGRNVVL ERSFGAPTVT KDGVSVAKEI ELKDKFENMG AQMVKEVASK TSDIAGDGTT TATVLAQSIV REGMKFVAAG MNPMDLKRGI DKAVVATIDE LKKLSKPCST NKEIAQVGSI SANSDSDIGD IIARAMDKVG KEGVITVEDG KSLQNELDVV EGMQFDRGYL SPYFINNPDK QVAILEQPFV LLFDKKISNI RDLLPVLEQV AKSGRPLLII AEDVEGEALA TLVVNNIRGI LKTCAVKAPG FGDRRKAMLE DIAILTGGQV IAEEVGLTLE KATLDDLGQA ARIEVGKENT IIIDGAGQAD RIEARVKQIR VQIEEASSDY DREKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG IVPGGGVALL RARAALGELK GDNHDQDAGI KIVLRAMEQP LREIVANAGD EPSVVVNKVV EGSGNYGFNA ATGEYGDMVE MGVLDPTKVT RTALQNAASV AGLMLTTDCM VGELAEDKPM GGMPDMGGMG GMGGMGMGM
|
| |