Gene Moth_0529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0529 
SymbolclpX 
ID3830914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp548157 
End bp549416 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content54% 
IMG OID637828470 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_429402 
Protein GI83589393 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.128458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAT ACACCGATGA CAAAGGCCAG TTGAAGTGTT CCTTCTGCGG GAAACTCCAG 
GACCAGGTCA AGAAACTGGT TGCTGGCCCG GGTGTATATA TCTGTGACGA GTGCATCGAG
CTGTGCAATG AGATAATCGA AGAGGAGCTA AGCGAGGACC TGAACCTGGA AATGGGTGAA
CTGCCCAAGC CCAAGGAGAT CCGGGAGATC CTCGATCAGT ACGTCATAAG CCAGGATCAA
GCCAAAAAGG CCCTGGCTGT AGCCGTTTAT AACCATTATA AACGCATCAA CCTGGGTATG
AAGATGGATG ATGTTGAACT CCAGAAGAGC AACATCATCA TGCTGGGTCC CACCGGTAGC
GGTAAGACCC TGCTGGCCCA GACGCTGGCC AAGATTCTCA ATGTGCCCTT CGCCATCGCC
GACGCCACCT CCCTGACGGA AGCCGGCTAT GTGGGTGAGG ATGTGGAGAA TATTCTCCTC
AAGCTCATCC AGGCGGCCGA CTATGATGTG GAGAAGGCAG AGAAGGGTAT TGTCTATATC
GACGAGATCG ACAAGATAGC CCGGAAATCA GAGAACCCAT CCATTACCCG GGACGTATCC
GGGGAAGGGG TCCAGCAGGC CCTGCTCAAG ATCCTGGAAG GCACCATTGC CAGCGTGCCG
CCCCAGGGAG GCCGCAAGCA CCCTCACCAG GAGTTTATCC AGCTGGATAC GACCAATATC
CTCTTTATTT GTGGAGGGGC CTTTGACGGC CTGGACAAGA TCATCAAGAA TCGCATCTCC
CAGAAAACCA TGGGCTTTGG CGCCGAGATC CGGGGCAAGA ACGACGTCCA GGTGGGGGAT
ATCCTGAAGC AGGTCCTGCC CGTTGACCTG CTGAAGTACG GCCTGATCCC CGAGTTTGTT
GGCCGCCTGC CGGTGATTGT CACCCTGGAC GCCCTGGACG AGACTGCCCT GATCAGGGTC
CTTACGGAAC CCCGCAACGC CCTGGTGAAA CAGTACCAGA AGCTCTTTGA AATGGACGGG
GTAACCCTGG AATTTAAAGA AGACGCCCTG GTTACTATCG CCAGGGAAGC TATCAAGCGA
GAAACCGGGG CCCGGGGCCT GCGGGCCATC CTGGAGGAGA TCATGCTCGA CGTTATGTAC
GAGATCCCCT CCCGGAACAA TATCTCCAAG TGTATAATCA CTAAAGATGT TGTCCTGCGC
AAGGAAGAAC CCTTGCTCCT TACGGTAGAA AGGAAAAAGA AAAAAGAAGA AACAGCCTGA
 
Protein sequence
MYKYTDDKGQ LKCSFCGKLQ DQVKKLVAGP GVYICDECIE LCNEIIEEEL SEDLNLEMGE 
LPKPKEIREI LDQYVISQDQ AKKALAVAVY NHYKRINLGM KMDDVELQKS NIIMLGPTGS
GKTLLAQTLA KILNVPFAIA DATSLTEAGY VGEDVENILL KLIQAADYDV EKAEKGIVYI
DEIDKIARKS ENPSITRDVS GEGVQQALLK ILEGTIASVP PQGGRKHPHQ EFIQLDTTNI
LFICGGAFDG LDKIIKNRIS QKTMGFGAEI RGKNDVQVGD ILKQVLPVDL LKYGLIPEFV
GRLPVIVTLD ALDETALIRV LTEPRNALVK QYQKLFEMDG VTLEFKEDAL VTIAREAIKR
ETGARGLRAI LEEIMLDVMY EIPSRNNISK CIITKDVVLR KEEPLLLTVE RKKKKEETA