Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0546 |
Symbol | |
ID | 3830931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 567766 |
End bp | 569385 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828487 |
Product | chaperonin GroEL |
Protein accession | YP_429419 |
Protein GI | 83589410 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000760166 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.951326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCTA AACAGCTGGC CTTTGATGTA GAAGCCAGGC GGGCCCTGGA AAAGGGCGTC AGCACCGTTG CCCAAGCAGT GAAGGTGACC TTGGGCCCCA AGGGACGCAA TGTGGTTTTG GAGCGTAAAT TCGGTTCCCC GGTAATTACC AAAGACGGGG TAACCGTTGC TAAAGAAATC GAATTAAAGG ATCCCTACGA GAACATGGGT GCCCAGCTCT GCCGGGAAGT GGCCTCCAAG ACCAACGACG TGGCGGGCGA TGGGACAACT ACAGCTACCG TCCTGGCCCA GGCCATTATG CTGGAGGGCT TAAAGAATGT AGCCGCCGGT GCCAATCCCA TTTTCGTCAA GAAGGGTATT GACCGGGCAG TTGAAACCGT AGTAGACGAA ATCAAGAAGA TCAGCATCCC GGTGGAGTCC AAGGAAAGTA TCGCCCATGT AGCCTCCATT GCTGCCAACG AACGTGAGAT CGGCGAACTC ATTGCCGATG CCATGGAGAA GGTTGGCAAA GACGGCGTCA TCACCGTGGA AGAATCCAAG GGTACTGCTA CCACTGTTGA GGTAGTAGAA GGTATGGAAT TCGACCGCGG TTATGTATCA CCGTACTTTG TGACCAATAC TGAAGCCATG GAAGCTGAGT TTGAGGAACC CTATATACTT ATCCATGAAA AGAAGATCTC GGCCATCAAC GACCTCCTGC CCCTGCTGGA GAAAGTCGTC CGTACCGGCA AACCCCTGGT AATTATTGCC GAGGACATTG AAGGCGAGGC CCTCGCCACC CTGGTGGTCA ACAAACTGCG GGGCACCCTG AACTGCGCTG CCGTCAAAGC CCCTGGTTTT GGCGATCGCC GCAAGGCCAT GATGGAGGAT ATCGCCATCC TCACCGGCGG CACCTTCCTC TCCGAAGACC TGGGGGTCAA GCTGGAGAAC GTCGACCTGA ATATGCTTGG TCGGGCCAAG AAGGTTAAAA TTGCCAAGGA GAAGACCACC ATCGTTGAGG GCTACGGCAA GAAAGAGGCT GTTGACGGCC GGATAGCCCA GATTAAGAAA CAAATCGAAG AAACCGACTC CGATTACGAC CGCGAGAAAT TGCAGGAGCG TCTGGCCAAG CTGGCCGGTG GCGTGGCCGT CATCCGTGTT GGTGCGGCTA CCGAAACTGA ACTGAAGGAA AAGAAACACC GGGTTGAAGA CGCCCTGGCA GCTACCCGGG CGGCCGTTGA AGAGGGTATC GTTCCCGGTG GCGGTGCTAC CCTGGTACAC GCCATCCCGG CCGTGGAAAA GATCCAGGCC GAGGGTGACG AGGCTGTCGG TGTCAGGATT GTCCGCCGGG CTCTGGAAGA ACCCCTGCGC CAGATTGCAG CCAATGCTGG TCTGGAAGGT TCGGTTATTG TTGAGCGGGT ACGCAGCGAG CAACCCGGTA TCGGCTTTGA CGCCGTGAAG GAGGAGTATG TGGACATGAT TAAGGCCGGT ATCGTTGACC CGGCCAAGGT CACCCGCAGC GCCCTCCAGA ACGCGGCCAG CATCGCCTCC ATGCTCTTGA CTACCGAGGC CATTATCGCC GAAATTCCCA AGGAAGAAAA AGCGCCTGCC ATGCCGCCCG GTGGCGGAAT GGATTACTAA
|
Protein sequence | MAAKQLAFDV EARRALEKGV STVAQAVKVT LGPKGRNVVL ERKFGSPVIT KDGVTVAKEI ELKDPYENMG AQLCREVASK TNDVAGDGTT TATVLAQAIM LEGLKNVAAG ANPIFVKKGI DRAVETVVDE IKKISIPVES KESIAHVASI AANEREIGEL IADAMEKVGK DGVITVEESK GTATTVEVVE GMEFDRGYVS PYFVTNTEAM EAEFEEPYIL IHEKKISAIN DLLPLLEKVV RTGKPLVIIA EDIEGEALAT LVVNKLRGTL NCAAVKAPGF GDRRKAMMED IAILTGGTFL SEDLGVKLEN VDLNMLGRAK KVKIAKEKTT IVEGYGKKEA VDGRIAQIKK QIEETDSDYD REKLQERLAK LAGGVAVIRV GAATETELKE KKHRVEDALA ATRAAVEEGI VPGGGATLVH AIPAVEKIQA EGDEAVGVRI VRRALEEPLR QIAANAGLEG SVIVERVRSE QPGIGFDAVK EEYVDMIKAG IVDPAKVTRS ALQNAASIAS MLLTTEAIIA EIPKEEKAPA MPPGGGMDY
|
| |