Gene Athe_0710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0710 
SymbolclpX 
ID7407134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp798102 
End bp799403 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content38% 
IMG OID643715082 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_002572598 
Protein GI222528716 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTAAGT TTGAAGAGAA GAAACAGCTA AGATGTTCAT TCTGTGGGAA GTCACAGGAT 
GAAGTAAGAC GACTTGTTGC TGGTCCTGGC GTTTATATCT GTGATGAGTG TATTGAACTT
TGCTCTGAAA TAATCTCAGA AGATTTTGAA GAAGAAGAAT ACAATGAGTT TGACGACAGA
CTTCCAACTC CAAAGGAGAT AAAAGAGTTT TTAGACCAAT ACGTTGTTGG TCAGGACCAC
GCAAAAAAGA TTTTGTCTGT GGCTGTATAT AACCATTACA AGAGGATATA CTACCACAAC
ACTAAAAAAG ACGATGTTGA ACTTCAAAAG AGCAACATCT TGATGCTTGG ACCAACAGGG
TCTGGTAAAA CATACCTTGC TCAGACTCTT GCAAAGATGT TAAATGTCCC GTTTGCAATA
GCTGATGCAA CAACTTTGAC AGAGGCAGGT TATGTTGGTG AAGACGTTGA AAATATCTTG
CTCAGGCTCA TACAGAATGC TGACTATGAT ATTGAAAGAG CAGAGCGTGG TATAATCTAT
ATAGATGAAA TTGACAAGAT TGCAAGAAAG TCTGACAATC CTTCTATCAC AAGAGATGTT
TCAGGCGAAG GTGTTCAGCA GGCCCTGCTC AAAATATTGG AAGGGACTAT TGCTTCTGTT
CCACCACAGG GTGGGAGAAA ACATCCGCAC CAGGAATTTA TACAGATAGA CACAACAAAC
ATCCTGTTTA TCTGTGGCGG TGCATTTGAG GGTATTGAGA AGATAATTGA AAAGAGAATT
GGTGAAAAGA CACTTGGTTT TAATGCAAAG ATTGAAAGCA AAAAAGAAAA AAAGATTGGG
GATATACTAA GACAAATAAT GCCTCAGGAT CTTCTAAAGT TTGGAATGAT TCCAGAGTTC
ATAGGACGCG TGCCTATAAT AGTTACATTG GATGCACTTG ACAAGGAAGC CCTGATAAAG
ATACTAACAG AGCCCAAAAA TGCTCTTGTA AAACAGTATC AAAAGCTCTT TGCAATGGAT
GGTGTTGAGT TGGAGTTTGA AAAAGATGCA TTAGAAGCGA TTGCTGACAA GGCTATTGAG
CGCAACACAG GTGCAAGAGG TCTTAGAGCT ATAATGGAAG AGATTATGCT TGATGTGATG
TTTGAGATTC CGTCAAATGA TAAGATAGAA AAGGTTATTA TTACAAAAGC TGCCGTTTTA
AAAGAAGACA AACCTATTGT AATAATAAAT GAGAACAAGA AAGCTCAGAA AAGACCAAAA
CTAAAACAGC GACTTCAGGA AAGAAGAGGA AATGTTTCAT AA
 
Protein sequence
MAKFEEKKQL RCSFCGKSQD EVRRLVAGPG VYICDECIEL CSEIISEDFE EEEYNEFDDR 
LPTPKEIKEF LDQYVVGQDH AKKILSVAVY NHYKRIYYHN TKKDDVELQK SNILMLGPTG
SGKTYLAQTL AKMLNVPFAI ADATTLTEAG YVGEDVENIL LRLIQNADYD IERAERGIIY
IDEIDKIARK SDNPSITRDV SGEGVQQALL KILEGTIASV PPQGGRKHPH QEFIQIDTTN
ILFICGGAFE GIEKIIEKRI GEKTLGFNAK IESKKEKKIG DILRQIMPQD LLKFGMIPEF
IGRVPIIVTL DALDKEALIK ILTEPKNALV KQYQKLFAMD GVELEFEKDA LEAIADKAIE
RNTGARGLRA IMEEIMLDVM FEIPSNDKIE KVIITKAAVL KEDKPIVIIN ENKKAQKRPK
LKQRLQERRG NVS