Gene Arth_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2403 
SymbolclpX 
ID4445002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2697216 
End bp2698505 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID639690213 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_831882 
Protein GI116670949 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.313413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGGA TTGGCGAGAG CACGGATCTG CTGAAGTGCT CTTTCTGCGG AAAGAGCCAG 
AAGCAGGTGC GAAAGCTCAT TGCCGGGCCC GGCGTCTACA TCTGCGACGA GTGCATTGAG
CTCTGCAACG AGATCATTGA AGAGGAACTC GCGGAAGTAG CGGACCTTGG CAGCTTCGAA
CTGCCCAAGC CGCGCGAAAT CTACGATTTC CTGCAGGAAT ACGTCATCGG CCAGGAACCG
GCCAAGCGTT CCCTCGCCGT CGCGGTGTAC AACCATTACA AGCGGATCCA GGCCGGCCAC
GCCCCGAAGA GCGGCAGCCT CGCCGAAGGC GTCCATCACG ACGACGTCGA GATCGCCAAA
TCGAACATCC TCCTGATCGG CCCCACCGGT TGCGGTAAGA CCTACCTGGC CCAGACCCTC
GCCCGGCGCC TCAACGTACC GTTTGCCGTC GCGGACGCTA CGGCGCTGAC CGAGGCCGGG
TACGTGGGCG AGGACGTGGA GAACATCCTC CTGAAGCTCA TCCAGGCCGC TGACTACGAC
GTCAAGAAGG CCGAGCAGGG CATCATCTAC ATCGACGAGA TCGACAAGAT CTCCCGAAAG
AGCGAAAACC CTTCCATCAC CCGGGATGTC TCCGGCGAGG GCGTGCAGCA GGCCCTCCTG
AAGATCCTGG AAGGCACGGT GGCCTCGGTG CCCCCGCAGG GCGGCCGGAA ACACCCGCAC
CAGGAATTCA TCCAGATCGA CACCACCAAT GTGCTGTTCA TTGTCGCGGG GGCCTTCGCC
GGGCTCGAGG ACATCATCGG CTCCCGTTCC GGCCGCAAGG GCATAGGCTT CGGCGCTCCG
CTGAACGAAG TCAAGAACAA CTCCGACTCC TACGGCGAGG TCATGCCGGA AGACCTGCTG
AAGTTCGGGC TGATTCCGGA ATTCATCGGC CGCCTTCCCG TCATCACCAC GGTCTCCAAT
TTGGACCGGC CGGCCCTGAT CCAAATTCTG TCCACGCCGA AGAACGCGCT GGTGAAGCAG
TATCAGAAGA TGTTCCAGCT GGATGGTGTG GAGCTGCTCT TTGACGACGA AGCGCTGGAT
GTGATCGCCG ACCAGGCCCT GGAACGGGGC ACCGGCGCCC GCGGGCTCCG CGCCATCATG
GAGGAAGTGC TCCTCCCCGT GATGTTTGAT CTCCCCAGCA GGGACGACAT CGCCAGCGTT
GTTATTACCG CGGATGTGGT GGCAAAGAAG GCGCCGCCCA CCATGATCGC CCACGACGTG
GTGGCGAAGC GGCGGAATAA ATCAGCTTAG
 
Protein sequence
MARIGESTDL LKCSFCGKSQ KQVRKLIAGP GVYICDECIE LCNEIIEEEL AEVADLGSFE 
LPKPREIYDF LQEYVIGQEP AKRSLAVAVY NHYKRIQAGH APKSGSLAEG VHHDDVEIAK
SNILLIGPTG CGKTYLAQTL ARRLNVPFAV ADATALTEAG YVGEDVENIL LKLIQAADYD
VKKAEQGIIY IDEIDKISRK SENPSITRDV SGEGVQQALL KILEGTVASV PPQGGRKHPH
QEFIQIDTTN VLFIVAGAFA GLEDIIGSRS GRKGIGFGAP LNEVKNNSDS YGEVMPEDLL
KFGLIPEFIG RLPVITTVSN LDRPALIQIL STPKNALVKQ YQKMFQLDGV ELLFDDEALD
VIADQALERG TGARGLRAIM EEVLLPVMFD LPSRDDIASV VITADVVAKK APPTMIAHDV
VAKRRNKSA