Gene Mvan_4036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4036 
SymbolclpX 
ID4648437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4318802 
End bp4320082 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID639807498 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_954819 
Protein GI120404990 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.330745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.230586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCA TTGGAGATGG CGGCGACCTG CTGAAGTGCT CGTTCTGCGG CAAGAGCCAA 
AAGCAGGTGA AGAAGCTCAT CGCGGGACCC GGCGTCTACA TCTGCGACGA GTGCATCGAC
CTGTGCAACG AGATCATCGA GGAAGAACTC GCCGACGCCG ACGACGTCAA GCTCGATGAG
CTGCCCAAAC CTGCGGAGAT CCGTGAGTTC CTCGAGGGCT ACGTCATCGG GCAGGACACC
GCCAAGCGCA CGCTGGCCGT GGCCGTCTAC AACCACTACA AGCGCATCCA GGCGGGCGAG
AAGGCCCGCG ACTCGCGCTC GGAGCCCGTC GAGCTGGCCA AGTCCAACAT CCTGATGCTC
GGCCCGACGG GCTGTGGCAA GACCTACCTC GCGCAGACGC TGGCCAAGAT GCTCAACGTC
CCGTTCGCGA TCGCGGATGC GACGGCGCTG ACCGAAGCCG GCTATGTCGG TGAGGACGTC
GAGAACATTC TGCTCAAACT GATCCAGGCC GCCGACTACG ACGTCAAGCG CGCCGAGACG
GGCATCATCT ACATCGACGA GGTCGACAAG ATCGCCCGCA AGAGCGAGAA CCCGTCGATC
ACCCGGGACG TCTCCGGTGA GGGCGTACAG CAGGCGCTGC TGAAGATCCT GGAAGGCACG
CAGGCGTCGG TGCCCCCGCA GGGCGGACGC AAGCACCCGC ACCAGGAGTT CATCCAGATC
GACACCACCA ACGTGCTGTT CATCGTGGCA GGCGCGTTCG CCGGCTTGGA GCGGATCGTG
TCCGACCGCG TCGGCAAGCG TGGCCTGGGC TTCGGCGCCG AGGTGAAGTC CAAGGCCGAG
ATCGACACCC AGGACCACTT CGCCGAGGTG ATGCCCGAGG ATCTGATCAA GTTCGGTCTG
ATCCCCGAGT TCATCGGCCG GCTCCCGGTC GTCGCGTCGG TGACGAACCT GGACAAGGAA
TCGCTCGTGC AGATCCTGTC CCAGCCGAAG AACGCGTTGG TCAAGCAGTA CACCCGGCTG
TTCGAGATGG ACGGTGTGGA GCTGGAGTTC GCCGAAGACG CGCTGGAGGC GATCGCCGAT
CAGGCCATCC ACCGTGGCAC CGGCGCCCGC GGTCTGCGCG CCATCATGGA GGAAGTCCTG
CTGCCGGTGA TGTACGACAT CCCGAGCCGC GACGACGTCG CCAAGGTGGT CGTCACCAAG
GAGACCGTGC TGGACAACGT GCTGCCGACC ATCGTGCCGC GCAAGCCGTC CCGCACCGAG
CGTCGCGACA AGAGCGCCTA G
 
Protein sequence
MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL ADADDVKLDE 
LPKPAEIREF LEGYVIGQDT AKRTLAVAVY NHYKRIQAGE KARDSRSEPV ELAKSNILML
GPTGCGKTYL AQTLAKMLNV PFAIADATAL TEAGYVGEDV ENILLKLIQA ADYDVKRAET
GIIYIDEVDK IARKSENPSI TRDVSGEGVQ QALLKILEGT QASVPPQGGR KHPHQEFIQI
DTTNVLFIVA GAFAGLERIV SDRVGKRGLG FGAEVKSKAE IDTQDHFAEV MPEDLIKFGL
IPEFIGRLPV VASVTNLDKE SLVQILSQPK NALVKQYTRL FEMDGVELEF AEDALEAIAD
QAIHRGTGAR GLRAIMEEVL LPVMYDIPSR DDVAKVVVTK ETVLDNVLPT IVPRKPSRTE
RRDKSA