Gene Moth_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2129 
Symbol 
ID3833280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2227221 
End bp2228831 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content61% 
IMG OID637830054 
Productchaperonin GroEL 
Protein accessionYP_430964 
Protein GI83590955 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000897626 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC AAGTAGTTTT CGACCGCGAA GCCAGGGAGG CCCTGGAAAA AGGTATTACC 
AAACTCACCG AAGCCGTCCG GGTCACCCTG GGACCCCGGG GACGCAACGT GGTCTTGGAA
AAGAAATTCG GGGCCCCTAC CATTACCAAC GACGGCGTGA CCATTGCCAA GGAGGTTGAG
CTGGAAGACC CCCTTGAGAA TGTGGGCGCG CTGCTGGTGC GGGAAGTAGC CTCCAAGACC
AACGATGTCG CCGGCGATGG GACTACTACC GCCTGTGTTC TGGCCCAGGC TATTGTCCGG
GAGGGCATGA AAAACGTGGC CGCCGGGGCC AATCCTATGT TCATGAAGCG GGGTATTGAA
AAGGCGGTGG CGGCCGTGGT GGAGAACCTT AAGGCCCAGG CCCGGCCGGT GGAAACCAAG
GACTCCATCA GCCAGGTAGC CTCCATTTCT GCCAATGACC CCCAGATCGG CGCCCTGGTG
GCCGACGCCA TGGAAAAGGT GGGGAAGGAC GGCGTCATAA CCGTGGAGGA ATCCAAAGGT
ATGGAAACCG CCGTGGACGT CGTGGAAGGC ATGCAGTTCG ATCGCGGCTA TATCTCTCCG
TATATGGTCA CCGACAACGA GCGCATGGAA GCCGTCCTGG AAGAGCCCTA CATCCTCATC
ACCGATAAAA AGATTACCGC CGTGGCCGAC CTGGTACCCG TCCTGGAACG GGTAGTACGG
ACGGGTAAAC CCCTGCTTAT AATCTGCGAG GATATGGAGG GTGAAGCTCT GGCCACCCTG
GTGGTCAATA AGATCCGGGG CACCTTTACC TGCGTGGCCG TCAAGGCGCC GGCCTTCGGC
GATCGGCGCA AGGCCATGCT ACAGGACATC GCCATCCTGA CGGGTGGCCA GGTTATTACC
GAAGAAGCCG GCTTGAAACT GGAGAACACC ACCCTGGATA TGCTGGGCCA GGCGCGCCAG
GTCCGGGTGG GTAAAGAGGA AACCACCATC GTCGAAGGTC GCGGCAAAGA AGAGGCCATT
GAAGCCCGGA TAGCCCAGAT TCGCCGCGAG TACGAGGAGT CGACCTCGGA CTACGACCGG
GAGAAACTCC AGGAACGCCT GGCCAAACTG GCCGGCGGTG TGGCGGTCAT TAAAGTCGGG
GCCGCTACCG AGACGGAAAT GAAAGAAAAG AAAATGCGCA TCGAAGACGC CCTCGCAGCC
ACCCGGGCGG CGGTGGAGGA GGGCATTGTC CCCGGCGGCG GCACCGCCCT GGTACGCGCC
CAGACGGCCC TGGATGGTGT CCAGGCCCAG GGTGACGAAC TGACGGGGGT GCGTCTGGTC
TACCGCGCCC TGGAAGAACC CATGCGCCAG ATTGCGGCCA ATGCCGGCGT TGATGGATCG
GTGGTAGTGG AGAAGGTGCG CCAGAGCGGT GACAGCATGG GCTTTAACGC CGCTACCCGG
GAGTATGTCA ACCTCTTTGA AGCGGGTATT GTCGATCCCT TGAAGGTGAC CCGTTCCGCC
CTGGAGAATG CTGCCAGCAT TGCCTCCCTG GTCCTGACCA CTGAGAGTCT AATAGCCGAC
ATTCCGGAGG AAGAACCGCC CGTTCCCGGC GGCGGTATGC CTCCCATGTA A
 
Protein sequence
MAKQVVFDRE AREALEKGIT KLTEAVRVTL GPRGRNVVLE KKFGAPTITN DGVTIAKEVE 
LEDPLENVGA LLVREVASKT NDVAGDGTTT ACVLAQAIVR EGMKNVAAGA NPMFMKRGIE
KAVAAVVENL KAQARPVETK DSISQVASIS ANDPQIGALV ADAMEKVGKD GVITVEESKG
METAVDVVEG MQFDRGYISP YMVTDNERME AVLEEPYILI TDKKITAVAD LVPVLERVVR
TGKPLLIICE DMEGEALATL VVNKIRGTFT CVAVKAPAFG DRRKAMLQDI AILTGGQVIT
EEAGLKLENT TLDMLGQARQ VRVGKEETTI VEGRGKEEAI EARIAQIRRE YEESTSDYDR
EKLQERLAKL AGGVAVIKVG AATETEMKEK KMRIEDALAA TRAAVEEGIV PGGGTALVRA
QTALDGVQAQ GDELTGVRLV YRALEEPMRQ IAANAGVDGS VVVEKVRQSG DSMGFNAATR
EYVNLFEAGI VDPLKVTRSA LENAASIASL VLTTESLIAD IPEEEPPVPG GGMPPM