Gene Moth_0546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0546 
Symbol 
ID3830931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp567766 
End bp569385 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content56% 
IMG OID637828487 
Productchaperonin GroEL 
Protein accessionYP_429419 
Protein GI83589410 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000760166 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.951326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTA AACAGCTGGC CTTTGATGTA GAAGCCAGGC GGGCCCTGGA AAAGGGCGTC 
AGCACCGTTG CCCAAGCAGT GAAGGTGACC TTGGGCCCCA AGGGACGCAA TGTGGTTTTG
GAGCGTAAAT TCGGTTCCCC GGTAATTACC AAAGACGGGG TAACCGTTGC TAAAGAAATC
GAATTAAAGG ATCCCTACGA GAACATGGGT GCCCAGCTCT GCCGGGAAGT GGCCTCCAAG
ACCAACGACG TGGCGGGCGA TGGGACAACT ACAGCTACCG TCCTGGCCCA GGCCATTATG
CTGGAGGGCT TAAAGAATGT AGCCGCCGGT GCCAATCCCA TTTTCGTCAA GAAGGGTATT
GACCGGGCAG TTGAAACCGT AGTAGACGAA ATCAAGAAGA TCAGCATCCC GGTGGAGTCC
AAGGAAAGTA TCGCCCATGT AGCCTCCATT GCTGCCAACG AACGTGAGAT CGGCGAACTC
ATTGCCGATG CCATGGAGAA GGTTGGCAAA GACGGCGTCA TCACCGTGGA AGAATCCAAG
GGTACTGCTA CCACTGTTGA GGTAGTAGAA GGTATGGAAT TCGACCGCGG TTATGTATCA
CCGTACTTTG TGACCAATAC TGAAGCCATG GAAGCTGAGT TTGAGGAACC CTATATACTT
ATCCATGAAA AGAAGATCTC GGCCATCAAC GACCTCCTGC CCCTGCTGGA GAAAGTCGTC
CGTACCGGCA AACCCCTGGT AATTATTGCC GAGGACATTG AAGGCGAGGC CCTCGCCACC
CTGGTGGTCA ACAAACTGCG GGGCACCCTG AACTGCGCTG CCGTCAAAGC CCCTGGTTTT
GGCGATCGCC GCAAGGCCAT GATGGAGGAT ATCGCCATCC TCACCGGCGG CACCTTCCTC
TCCGAAGACC TGGGGGTCAA GCTGGAGAAC GTCGACCTGA ATATGCTTGG TCGGGCCAAG
AAGGTTAAAA TTGCCAAGGA GAAGACCACC ATCGTTGAGG GCTACGGCAA GAAAGAGGCT
GTTGACGGCC GGATAGCCCA GATTAAGAAA CAAATCGAAG AAACCGACTC CGATTACGAC
CGCGAGAAAT TGCAGGAGCG TCTGGCCAAG CTGGCCGGTG GCGTGGCCGT CATCCGTGTT
GGTGCGGCTA CCGAAACTGA ACTGAAGGAA AAGAAACACC GGGTTGAAGA CGCCCTGGCA
GCTACCCGGG CGGCCGTTGA AGAGGGTATC GTTCCCGGTG GCGGTGCTAC CCTGGTACAC
GCCATCCCGG CCGTGGAAAA GATCCAGGCC GAGGGTGACG AGGCTGTCGG TGTCAGGATT
GTCCGCCGGG CTCTGGAAGA ACCCCTGCGC CAGATTGCAG CCAATGCTGG TCTGGAAGGT
TCGGTTATTG TTGAGCGGGT ACGCAGCGAG CAACCCGGTA TCGGCTTTGA CGCCGTGAAG
GAGGAGTATG TGGACATGAT TAAGGCCGGT ATCGTTGACC CGGCCAAGGT CACCCGCAGC
GCCCTCCAGA ACGCGGCCAG CATCGCCTCC ATGCTCTTGA CTACCGAGGC CATTATCGCC
GAAATTCCCA AGGAAGAAAA AGCGCCTGCC ATGCCGCCCG GTGGCGGAAT GGATTACTAA
 
Protein sequence
MAAKQLAFDV EARRALEKGV STVAQAVKVT LGPKGRNVVL ERKFGSPVIT KDGVTVAKEI 
ELKDPYENMG AQLCREVASK TNDVAGDGTT TATVLAQAIM LEGLKNVAAG ANPIFVKKGI
DRAVETVVDE IKKISIPVES KESIAHVASI AANEREIGEL IADAMEKVGK DGVITVEESK
GTATTVEVVE GMEFDRGYVS PYFVTNTEAM EAEFEEPYIL IHEKKISAIN DLLPLLEKVV
RTGKPLVIIA EDIEGEALAT LVVNKLRGTL NCAAVKAPGF GDRRKAMMED IAILTGGTFL
SEDLGVKLEN VDLNMLGRAK KVKIAKEKTT IVEGYGKKEA VDGRIAQIKK QIEETDSDYD
REKLQERLAK LAGGVAVIRV GAATETELKE KKHRVEDALA ATRAAVEEGI VPGGGATLVH
AIPAVEKIQA EGDEAVGVRI VRRALEEPLR QIAANAGLEG SVIVERVRSE QPGIGFDAVK
EEYVDMIKAG IVDPAKVTRS ALQNAASIAS MLLTTEAIIA EIPKEEKAPA MPPGGGMDY