Gene Mlg_2287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2287 
SymbolclpX 
ID4268384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2595034 
End bp2596311 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID638127046 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_743119 
Protein GI114321436 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.16116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.74961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA AAGACAACGG CAGAGACGAC GGCGGCAAGC TCCTTTACTG TTCTTTCTGC 
GGGAAGAGCC AGCACGAGGT CCGCAAGCTG ATCGCCGGGC CCTCGGTGTT TGTGTGTGAC
GAGTGCGTCG AGCTCTGCAA CGACATTATC CGCGAGGAGC TGCAGGAGCA GTCCAGCTCC
GCGGGGGCGG GCCTGCCGCG GCCTCACGAG ATCAACCAGG TGCTGGATGA GTTCGTGGTG
GGGCAGGAGC ACGCCAAGAA GGTGCTCTCG GTGGCCGTCT ACAACCACTA CAAACGCCTC
GAGGCGGGCA GCCGCAAGGA CGAGGTGGAG CTGTCCAAGA GCAATATCCT GCTCATCGGC
CCCACCGGCT CGGGCAAGAC GCTGCTGGCC GAGACGCTGG CGCGCATGCT CAACGTGCCC
TTCACCATTG CCGACGCCAC CACCCTGACC GAGGCGGGGT ACGTCGGTGA GGACGTGGAG
AACATCATCC AGAAGCTGCT GCAAAAGTGC GATTACGATG TGGAGAAGGC CCAGCAGGGC
ATCGTCTACA TCGACGAGAT CGACAAGGTC TCGCGCAAGG CGGACAACCC CTCCATCACC
CGCGACGTCT CGGGCGAGGG GGTGCAGCAG GCGCTGCTCA AGCTGATCGA GGGCACCACG
GCCTCGGTGC CGCCCCAGGG CGGCCGCAAG CACCCGCAGC AGGAGTTCCT GCAGGTGGAC
ACCGGCGGCA TCCTGTTCAT CTGCGGCGGC GCCTTCGCCG GCCTGGACAA GGTCATCCAG
GACCGTTCCG AGAAGGGCGG TATCGGCTTC TCCGCCGAGA TCAAGTCCAA GGACGAGAAG
CGCTCGGTGG GTGAGACCCT GCAGGACGTG GAGCCCGAGG ATCTGGTCAA GTACGGCCTG
ATCCCGGAGT TTGTCGGCCG CCTGCCGGTG GTCGCCACCC TGGAGGAGCT GGACGAGCAG
GCCCTGGTGG AGATCCTCTC CGCGCCCAAG AACGCCCTGG TCAAGCAGTA CCAGAAGCTG
TTCGAGATGG AGGGTGTGGA GCTGGAGTTC CGCGAGGACG CCCTGCGCGC CGTGGCCCGC
AAGGCCATGG ACCGCAAGAC CGGTGCCCGC GGCCTGCGCA CCATCCTCGA GCACGTGCTG
CTGGACACCA TGTACGATCT GCCCTCCATG GAGAACGTTG AAAAGGTGGT GGTGGACGAC
GCCGTGATCC GCGGCGAGAC GCGGCCCTAC ATCATCTACG GCAATCAGGA GCAGTCGCGG
GCGGCTTCCT CCGACTGA
 
Protein sequence
MTDKDNGRDD GGKLLYCSFC GKSQHEVRKL IAGPSVFVCD ECVELCNDII REELQEQSSS 
AGAGLPRPHE INQVLDEFVV GQEHAKKVLS VAVYNHYKRL EAGSRKDEVE LSKSNILLIG
PTGSGKTLLA ETLARMLNVP FTIADATTLT EAGYVGEDVE NIIQKLLQKC DYDVEKAQQG
IVYIDEIDKV SRKADNPSIT RDVSGEGVQQ ALLKLIEGTT ASVPPQGGRK HPQQEFLQVD
TGGILFICGG AFAGLDKVIQ DRSEKGGIGF SAEIKSKDEK RSVGETLQDV EPEDLVKYGL
IPEFVGRLPV VATLEELDEQ ALVEILSAPK NALVKQYQKL FEMEGVELEF REDALRAVAR
KAMDRKTGAR GLRTILEHVL LDTMYDLPSM ENVEKVVVDD AVIRGETRPY IIYGNQEQSR
AASSD