Gene Mjls_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3581 
SymbolclpX 
ID4879292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3775571 
End bp3776851 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID640140885 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001071849 
Protein GI126436158 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0142597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCA TTGGAGACGG CGGTGACCTG CTGAAGTGCT CGTTCTGCGG GAAGAGTCAG 
AAGCAGGTCA AGAAGCTGAT CGCCGGCCCC GGCGTGTACA TCTGTGACGA GTGCATCGAT
CTCTGCAACG AGATCATCGA AGAGGAACTG GCCGACGCCG ACGAGGTGAA ACTCGACGAG
CTGCCCAAGC CCGCGGAGAT CCGCGAGTTC CTCGAGAACT ACGTCATCGG ACAGGACACC
GCCAAGCGCA CCCTGGCCGT CGCCGTCTAC AACCATTACA AGCGCATCCA GGCCGGCGAG
AAGAGCCGCG ATTCGCGCAC CGAGCCCGTC GAGTTGACCA AGTCCAACAT CCTGATGCTG
GGCCCGACCG GCTGCGGTAA GACCTACCTC GCCCAGACGC TGGCCAAGAT GCTCAACGTG
CCGTTCGCGA TCGCCGACGC CACGGCGCTG ACCGAAGCCG GCTACGTCGG TGAGGACGTC
GAGAACATCC TGCTGAAGCT GATCCAGGCC GCCGACTACG ACGTCAAGCG CGCCGAGACC
GGCATCATCT ACATCGACGA GGTCGACAAG ATCGCCCGCA AGAGCGAGAA CCCGTCGATC
ACCCGCGACG TCTCCGGCGA AGGTGTGCAG CAGGCGCTGC TCAAGATCCT CGAGGGCACG
CAGGCGTCGG TGCCTCCGCA GGGCGGGCGC AAGCACCCCC ACCAGGAGTT CATCCAGATC
GACACCACCA ACGTGCTGTT CATCGTCGCG GGTGCGTTCG CCGGTCTGGA GAAGATCGTC
TCCGACCGCG TCGGCAAGCG CGGCCTCGGC TTCGGCGCGG AGGTCCGCTC CAAGGCCGAG
ATCGACACCC AGGACCACTT CGCCGAGGTC ATGCCCGAGG ACCTGATCAA GTTCGGTCTG
ATCCCCGAGT TCATCGGCCG GTTGCCGGTC GTGGCGTCGG TGACCAACCT CGACAAGGAG
TCGTTGGTCA AGATCCTGTC GGAGCCGAAG AACGCGTTGG TCAAGCAGTA CACGCGGCTG
TTCGAGATGG ACGGCGTCGA GCTGGAGTTC ACCGGTGACG CGCTGGACGC CATCGCCGAT
CAGGCCATCC ACCGCGGCAC CGGCGCCCGT GGCCTGCGCG CCATCATGGA GGAAGTCCTG
CTGCCGGTGA TGTACGACAT CCCCAGCCGC GACGACGTCG CGAAGGTCGT CGTCACCAAG
GAGACCGTGC AGGACAACGT GCTGCCGACG ATCGTGCCGC GTAAGCCGTC ACGCCCCGAG
CGTCGCGACA AGAGCGCCTA G
 
Protein sequence
MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL ADADEVKLDE 
LPKPAEIREF LENYVIGQDT AKRTLAVAVY NHYKRIQAGE KSRDSRTEPV ELTKSNILML
GPTGCGKTYL AQTLAKMLNV PFAIADATAL TEAGYVGEDV ENILLKLIQA ADYDVKRAET
GIIYIDEVDK IARKSENPSI TRDVSGEGVQ QALLKILEGT QASVPPQGGR KHPHQEFIQI
DTTNVLFIVA GAFAGLEKIV SDRVGKRGLG FGAEVRSKAE IDTQDHFAEV MPEDLIKFGL
IPEFIGRLPV VASVTNLDKE SLVKILSEPK NALVKQYTRL FEMDGVELEF TGDALDAIAD
QAIHRGTGAR GLRAIMEEVL LPVMYDIPSR DDVAKVVVTK ETVQDNVLPT IVPRKPSRPE
RRDKSA